Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data - PubMed (original) (raw)
Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data
Anton Valouev et al. Nat Methods. 2008 Sep.
Abstract
Molecular interactions between protein complexes and DNA mediate essential gene-regulatory functions. Uncovering such interactions by chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-Seq) has recently become the focus of intense interest. We here introduce quantitative enrichment of sequence tags (QuEST), a powerful statistical framework based on the kernel density estimation approach, which uses ChIP-Seq data to determine positions where protein complexes contact DNA. Using QuEST, we discovered several thousand binding sites for the human transcription factors SRF, GABP and NRSF at an average resolution of about 20 base pairs. MEME motif-discovery tool-based analyses of the QuEST-identified sequences revealed DNA binding by cofactors of SRF, providing evidence that cofactor binding specificity can be obtained from ChIP-Seq data. By combining QuEST analyses with Gene Ontology (GO) annotations and expression data, we illustrate how general functions of transcription factors can be inferred.
Figures
Figure 1
QuEST’s representation of ChIP-Seq data using density profiles.. (A) GABP ChIP-Seq reads from the promoter and CpG island of the Nitric oxide synthase interacting protein gene. Hypothetical GABP binding in five cells and the corresponding DNA fragments with sequencing reads. Below, actual read data. Forward reads are displayed as small blue bands and reverse reads as small maroon bands. (B) Forward (blue) and reverse (maroon) Read Density Profiles derived from the read data contribute to the Combined Density Profile (orange). The zero x-coordinate corresponds to coordinate 54775300 of human Chromosome 19, NCBI build 36.
Figure 2
Reproducibility and robustness of QuEST results assessed by comparison between two independent NRSF data sets. (A) Correlation between NRSF polyclonal and NRSF monoclonal peak scores (rho = 0.97) with the inset expanding the portion near the graph origin. (B) Bar chart of the distance between NRSF polyclonal and NRSF monoclonal peak call positions.
Figure 3
Resolution of QuEST as quantified by the distance between QuEST peak calls and TFBS motif centers. Histograms in each panel represent the distribution of peak distances to the nearest high-scoring motif.
Figure 4
Motif analysis results. Each panel displays significantly overrepresented motif Weblogos for each of the three transcription factors. Pie-charts show the fraction of peaks with motifs in close proximity to the peak (< 100 bps). Histograms show the distribution of the motif number within 100 bps of the peak.
Similar articles
- Genome-wide identification of in vivo protein-DNA binding sites from ChIP-Seq data.
Jothi R, Cuddapah S, Barski A, Cui K, Zhao K. Jothi R, et al. Nucleic Acids Res. 2008 Sep;36(16):5221-31. doi: 10.1093/nar/gkn488. Epub 2008 Aug 6. Nucleic Acids Res. 2008. PMID: 18684996 Free PMC article. - De novo motif identification improves the accuracy of predicting transcription factor binding sites in ChIP-Seq data analysis.
Boeva V, Surdez D, Guillon N, Tirode F, Fejes AP, Delattre O, Barillot E. Boeva V, et al. Nucleic Acids Res. 2010 Jun;38(11):e126. doi: 10.1093/nar/gkq217. Epub 2010 Apr 7. Nucleic Acids Res. 2010. PMID: 20375099 Free PMC article. - MEME-ChIP: motif analysis of large DNA datasets.
Machanick P, Bailey TL. Machanick P, et al. Bioinformatics. 2011 Jun 15;27(12):1696-7. doi: 10.1093/bioinformatics/btr189. Epub 2011 Apr 12. Bioinformatics. 2011. PMID: 21486936 Free PMC article. - Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond.
Mundade R, Ozer HG, Wei H, Prabhu L, Lu T. Mundade R, et al. Cell Cycle. 2014;13(18):2847-52. doi: 10.4161/15384101.2014.949201. Cell Cycle. 2014. PMID: 25486472 Free PMC article. Review. - Genome Wide Approaches to Identify Protein-DNA Interactions.
Ma T, Ye Z, Wang L. Ma T, et al. Curr Med Chem. 2019;26(42):7641-7654. doi: 10.2174/0929867325666180530115711. Curr Med Chem. 2019. PMID: 29848263 Review.
Cited by
- GATA-1 genome-wide occupancy associates with distinct epigenetic profiles in mouse fetal liver erythropoiesis.
Papadopoulos GL, Karkoulia E, Tsamardinos I, Porcher C, Ragoussis J, Bungert J, Strouboulis J. Papadopoulos GL, et al. Nucleic Acids Res. 2013 May;41(9):4938-48. doi: 10.1093/nar/gkt167. Epub 2013 Mar 21. Nucleic Acids Res. 2013. PMID: 23519611 Free PMC article. - Dynamic change of chromatin conformation in response to hypoxia enhances the expression of GLUT3 (SLC2A3) by cooperative interaction of hypoxia-inducible factor 1 and KDM3A.
Mimura I, Nangaku M, Kanki Y, Tsutsumi S, Inoue T, Kohro T, Yamamoto S, Fujita T, Shimamura T, Suehiro J, Taguchi A, Kobayashi M, Tanimura K, Inagaki T, Tanaka T, Hamakubo T, Sakai J, Aburatani H, Kodama T, Wada Y. Mimura I, et al. Mol Cell Biol. 2012 Aug;32(15):3018-32. doi: 10.1128/MCB.06643-11. Epub 2012 May 29. Mol Cell Biol. 2012. PMID: 22645302 Free PMC article. - Reassessment of Piwi binding to the genome and Piwi impact on RNA polymerase II distribution.
Lin H, Chen M, Kundaje A, Valouev A, Yin H, Liu N, Neuenkirchen N, Zhong M, Snyder M. Lin H, et al. Dev Cell. 2015 Mar 23;32(6):772-4. doi: 10.1016/j.devcel.2015.03.004. Dev Cell. 2015. PMID: 25805139 Free PMC article. - Reciprocal regulation of the basic helix-loop-helix/Per-Arnt-Sim partner proteins, Arnt and Arnt2, during neuronal differentiation.
Hao N, Bhakti VL, Peet DJ, Whitelaw ML. Hao N, et al. Nucleic Acids Res. 2013 Jun;41(11):5626-38. doi: 10.1093/nar/gkt206. Epub 2013 Apr 17. Nucleic Acids Res. 2013. PMID: 23599003 Free PMC article. - Composition and organization of active centromere sequences in complex genomes.
Hayden KE, Willard HF. Hayden KE, et al. BMC Genomics. 2012 Jul 20;13:324. doi: 10.1186/1471-2164-13-324. BMC Genomics. 2012. PMID: 22817545 Free PMC article.
References
- Cawley S, Bekiranov S, Ng HH, Kapranov P, Sekinger EA, Kampa D, Piccolboni A, Sementchenko V, Cheng J, Williams AJ, Wheeler R, Wong B, Drenkow J, Yamanaka M, Patel S, Brubaker S, Tammana H, Helt G, Struhl K, Gingeras TR. Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell. 2004 Feb 20;116(4):499–509. 2004. - PubMed
- Pokholok DK, Zeitlinger J, Hannett NM, Reynolds DB, Young RA. Activated Signal Transduction Kinases Frequently Occupy Target Genes. Science. 2006 Jul 28;Vol. 313.(no. 5786):533–536. - PubMed
- Lieb JD. Genome-wide mapping of protein-DNA interactions by chromatin immunoprecipitation and DNA microarray hybridization. Methods Mol Biol. 2003;224:99–109. - PubMed
- Johnson DS, Mortazavi A, Myers RM, Wold B. Genome-wide mapping of in vivo protein-DNA interactions. Science. 2007 Jun 8;316(5830):1497–1502. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- 5 U01 HG003162/HG/NHGRI NIH HHS/United States
- U01 HG003162-01/HG/NHGRI NIH HHS/United States
- 1 U54-HG004576/HG/NHGRI NIH HHS/United States
- U01 HG003162/HG/NHGRI NIH HHS/United States
- U54 HG004576-01/HG/NHGRI NIH HHS/United States
- U54 HG004576/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous