An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments - PubMed (original) (raw)
doi: 10.1038/nbt717. Epub 2002 Jul 8.
Affiliations
- PMID: 12101404
- DOI: 10.1038/nbt717
An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments
X Shirley Liu et al. Nat Biotechnol. 2002 Aug.
Abstract
Chromatin immunoprecipitation followed by cDNA microarray hybridization (ChIP-array) has become a popular procedure for studying genome-wide protein-DNA interactions and transcription regulation. However, it can only map the probable protein-DNA interaction loci within 1-2 kilobases resolution. To pinpoint interaction sites down to the base-pair level, we introduce a computational method, Motif Discovery scan (MDscan), that examines the ChIP-array-selected sequences and searches for DNA sequence motifs representing the protein-DNA interaction sites. MDscan combines the advantages of two widely adopted motif search strategies, word enumeration and position-specific weight matrix updating, and incorporates the ChIP-array ranking information to accelerate searches and enhance their success rates. MDscan correctly identified all the experimentally verified motifs from published ChIP-array experiments in yeast (STE12, GAL4, RAP1, SCB, MCB, MCM1, SFF, and SWI5), and predicted two motif patterns for the differential binding of Rap1 protein in telomere regions. In our studies, the method was faster and more accurate than several established motif-finding algorithms. MDscan can be used to find DNA motifs not only in ChIP-array experiments but also in other experiments in which a subgroup of the sequences can be inferred to contain relatively abundant motif sites. The MDscan web server can be accessed at http://BioProspector.stanford.edu/MDscan/.
Similar articles
- A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences.
Li W, Meyer CA, Liu XS. Li W, et al. Bioinformatics. 2005 Jun;21 Suppl 1:i274-82. doi: 10.1093/bioinformatics/bti1046. Bioinformatics. 2005. PMID: 15961467 - Design of a combinatorial DNA microarray for protein-DNA interaction studies.
Mintseris J, Eisen MB. Mintseris J, et al. BMC Bioinformatics. 2006 Oct 3;7:429. doi: 10.1186/1471-2105-7-429. BMC Bioinformatics. 2006. PMID: 17018151 Free PMC article. - Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors.
Kel AE, Kel-Margoulis OV, Farnham PJ, Bartley SM, Wingender E, Zhang MQ. Kel AE, et al. J Mol Biol. 2001 May 25;309(1):99-120. doi: 10.1006/jmbi.2001.4650. J Mol Biol. 2001. PMID: 11491305 - Visualizing and characterizing in vivo DNA-binding events and direct target genes of plant transcription factors.
Muiño JM, Angenent GC, Kaufmann K. Muiño JM, et al. Methods Mol Biol. 2011;754:293-305. doi: 10.1007/978-1-61779-154-3_17. Methods Mol Biol. 2011. PMID: 21720960 Review. - Location analysis of DNA-bound proteins at the whole-genome level: untangling transcriptional regulatory networks.
Nal B, Mohr E, Ferrier P. Nal B, et al. Bioessays. 2001 Jun;23(6):473-6. doi: 10.1002/bies.1066. Bioessays. 2001. PMID: 11385626 Review.
Cited by
- Novel synthetic inducible promoters controlling gene expression during water-deficit stress with green tissue specificity in transgenic poplar.
Yang Y, Chaffin TA, Shao Y, Balasubramanian VK, Markillie M, Mitchell H, Rubio-Wilhelmi MM, Ahkami AH, Blumwald E, Neal Stewart C Jr. Yang Y, et al. Plant Biotechnol J. 2024 Jun;22(6):1596-1609. doi: 10.1111/pbi.14289. Epub 2024 Jan 17. Plant Biotechnol J. 2024. PMID: 38232002 Free PMC article. - MicrosatNavigator: exploring nonrandom distribution and lineage-specificity of microsatellite repeat motifs on vertebrate sex chromosomes across 186 whole genomes.
Rasoarahona R, Wattanadilokchatkun P, Panthum T, Jaisamut K, Lisachov A, Thong T, Singchat W, Ahmad SF, Han K, Kraichak E, Muangmai N, Koga A, Duengkae P, Antunes A, Srikulnath K. Rasoarahona R, et al. Chromosome Res. 2023 Sep 30;31(4):29. doi: 10.1007/s10577-023-09738-4. Chromosome Res. 2023. PMID: 37775555 - A survey on algorithms to characterize transcription factor binding sites.
Tognon M, Giugno R, Pinello L. Tognon M, et al. Brief Bioinform. 2023 May 19;24(3):bbad156. doi: 10.1093/bib/bbad156. Brief Bioinform. 2023. PMID: 37099664 Free PMC article. Review. - Designing artificial synthetic promoters for accurate, smart, and versatile gene expression in plants.
Yasmeen E, Wang J, Riaz M, Zhang L, Zuo K. Yasmeen E, et al. Plant Commun. 2023 Jul 10;4(4):100558. doi: 10.1016/j.xplc.2023.100558. Epub 2023 Feb 9. Plant Commun. 2023. PMID: 36760129 Free PMC article. Review. - Biogenesis of telomerase RNA from a protein-coding mRNA precursor.
Logeswaran D, Li Y, Akhter K, Podlevsky JD, Olson TL, Forsberg K, Chen JJ. Logeswaran D, et al. Proc Natl Acad Sci U S A. 2022 Oct 11;119(41):e2204636119. doi: 10.1073/pnas.2204636119. Epub 2022 Oct 5. Proc Natl Acad Sci U S A. 2022. PMID: 36197996 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases