DNase I sensitivity QTLs are a major determinant of human expression variation (original) (raw)

Nature volume 482, pages 390–394 (2012)Cite this article

Subjects

Abstract

The mapping of expression quantitative trait loci (eQTLs) has emerged as an important tool for linking genetic variation to changes in gene regulation1,2,3,4,5. However, it remains difficult to identify the causal variants underlying eQTLs, and little is known about the regulatory mechanisms by which they act. Here we show that genetic variants that modify chromatin accessibility and transcription factor binding are a major mechanism through which genetic variation leads to gene expression differences among humans. We used DNase I sequencing to measure chromatin accessibility in 70 Yoruba lymphoblastoid cell lines, for which genome-wide genotypes and estimates of gene expression levels are also available6,7,8. We obtained a total of 2.7 billion uniquely mapped DNase I-sequencing (DNase-seq) reads, which allowed us to produce genome-wide maps of chromatin accessibility for each individual. We identified 8,902 locations at which the DNase-seq read depth correlated significantly with genotype at a nearby single nucleotide polymorphism or insertion/deletion (false discovery rate = 10%). We call such variants ‘DNase I sensitivity quantitative trait loci’ (dsQTLs). We found that dsQTLs are strongly enriched within inferred transcription factor binding sites and are frequently associated with allele-specific changes in transcription factor binding. A substantial fraction (16%) of dsQTLs are also associated with variation in the expression levels of nearby genes (that is, these loci are also classified as eQTLs). Conversely, we estimate that as many as 55% of eQTL single nucleotide polymorphisms are also dsQTLs. Our observations indicate that dsQTLs are highly abundant in the human genome and are likely to be important contributors to phenotypic variation.

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 51 print issues and online access

$199.00 per year

only $3.90 per issue

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Additional access options:

Similar content being viewed by others

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

All raw data and tables of all dsQTLs are deposited in GEO under accession number GSE31388 and at http://eqtl.uchicago.edu.

References

  1. Brem, R. B., Yvert, G., Clinton, R. & Kruglyak, L. Genetic dissection of transcriptional regulation in budding yeast. Science 296, 752–755 (2002)
    Article ADS CAS Google Scholar
  2. Cheung, V. G. et al. Mapping determinants of human gene expression by regional and genome-wide association. Nature 437, 1365–1369 (2005)
    Article ADS CAS Google Scholar
  3. Nicolae, D. L. et al. Trait-associated SNPs are more likely to be eQTLs: annotation to enhance discovery from GWAS. PLoS Genet. 6, e1000888 (2010)
    Article Google Scholar
  4. Nica, A. C. et al. Candidate causal regulatory effects by integration of expression QTLs with complex trait genetic associations. PLoS Genet. 6, e1000895 (2010)
    Article Google Scholar
  5. Lango Allen, H. et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 467, 832–838 (2010)
    Article ADS CAS Google Scholar
  6. Frazer, K. A. et al. A second generation human haplotype map of over 3.1 million SNPs. Nature 449, 851–861 (2007)
    Article ADS CAS Google Scholar
  7. Durbin, R. M. et al. A map of human genome variation from population-scale sequencing. Nature 467, 1061–1073 (2010)
    Article ADS CAS Google Scholar
  8. Pickrell, J. K. et al. Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature 464, 768–772 (2010)
    Article ADS CAS Google Scholar
  9. Gaulton, K. J. et al. A map of open chromatin in human pancreatic islets. Nature Genet. 42, 255–259 (2010)
    Article CAS Google Scholar
  10. Kasowski, M. et al. Variation in transcription factor binding among humans. Science 328, 232–235 (2010)
    Article ADS CAS Google Scholar
  11. McDaniell, R. et al. Heritable individual-specific and allele-specific chromatin signatures in humans. Science 328, 235–239 (2010)
    Article ADS CAS Google Scholar
  12. Zheng, W., Zhao, H., Mancera, E., Steinmetz, L. M. & Snyder, M. Genetic analysis of variation in transcription factor binding in yeast. Nature 464, 1187–1191 (2010)
    Article ADS CAS Google Scholar
  13. Galas, D. & Schmitz, A. DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. Nucleic Acids Res. 5, 3157–3170 (1978)
    Article CAS Google Scholar
  14. Boyle, A. P. et al. High-resolution mapping and characterization of open chromatin across the genome. Cell 132, 311–322 (2008)
    Article CAS Google Scholar
  15. Hesselberth, J. R. et al. Global mapping of protein-DNA interactions in vivo by digital genomic footprinting. Nature Methods 6, 283–289 (2009)
    Article CAS Google Scholar
  16. Boyle, A. P. et al. High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. Genome Res. 21, 456–464 (2011)
    Article CAS Google Scholar
  17. Pique-Regi, R. et al. Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data. Genome Res. 21, 447–455 (2011)
    Article CAS Google Scholar
  18. Ernst, J. et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 473, 43–49 (2011)
    Article ADS CAS Google Scholar
  19. Veyrieras, J. B. et al. High-resolution mapping of expression-QTLs yields insight into human gene regulation. PLoS Genet. 4, e1000214 (2008)
    Article Google Scholar
  20. Mirny, L. A. Nucleosome-mediated cooperativity between transcription factors. Proc. Natl Acad. Sci. USA 107, 22534–22539 (2010)
    Article ADS CAS Google Scholar
  21. Wasson, T. & Hartemink, A. J. An ensemble model of competitive multi-factor binding of the genome. Genome Res. 19, 2101–2112 (2009)
    Article CAS Google Scholar
  22. Raveh-Sadka, T., Levo, M. & Segal, E. Incorporating nucleosomes into thermodynamic models of transcription regulation. Genome Res. 19, 1480–1496 (2009)
    Article CAS Google Scholar
  23. Myers, R. M. et al. A user’s guide to the encyclopedia of DNA elements (ENCODE). PLoS Biol. 9, e1001046 (2011)
    Article CAS Google Scholar
  24. Storey, J. D., Taylor, J. E. & Siegmund, D. Strong control, conservative point estimation, and simultaneous conservative consistency of false discovery rates: a unified approach. J. R. Stat. Soc., B 66, 187–205 (2004)
    Article MathSciNet Google Scholar
  25. Dixon, A. L. et al. A genome-wide association study of global gene expression. Nature Genet. 39, 1202–1207 (2007)
    Article CAS Google Scholar
  26. Stranger, B. E. et al. Population genomics of human gene expression. Nature Genet. 39, 1217–1224 (2007)
    Article CAS Google Scholar
  27. Bell, J. T. et al. DNA methylation patterns associate with genetic and gene expression variation in HapMap cell lines. Genome Biol. 12, R10 (2011)
    Article CAS Google Scholar
  28. Visel, A. et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature 457, 854–858 (2009)
    Article ADS CAS Google Scholar
  29. Song, L. & Crawford, G. E. DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. Cold Spring Harb. Protocols 10.1101/pdb.prot5384 (2010)
  30. Guan, Y. & Stephens, M. Practical issues in imputation-based association mapping. PLoS Genet. 4, e1000279 (2008)
    Article Google Scholar

Download references

Acknowledgements

We thank members of the Pritchard, Przeworski, Stephens and Gilad laboratories for many helpful comments or discussions, and the ENCODE Project for publicly available ChIP-seq data. This work was supported by grants from the National Institutes of Health to Y.G. (HG006123) and J.K.P. (MH084703 and MH090951), by the Howard Hughes Medical Institute, by the Chicago Fellows Program (to R.P.R.), by the American Heart Association (to A.A.P.), and by the NIH Genetics and Regulation Training grant (A.A.P. and J.F.D.).

Author information

Author notes

  1. Jacob F. Degner, Athma A. Pai and Roger Pique-Regi: These authors contributed equally to this work.

Authors and Affiliations

  1. Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA,
    Jacob F. Degner, Athma A. Pai, Roger Pique-Regi, Jean-Baptiste Veyrieras, Daniel J. Gaffney, Joseph K. Pickrell, Matthew Stephens, Yoav Gilad & Jonathan K. Pritchard
  2. Committee on Genetics, Genomics and Systems Biology, University of Chicago, Chicago, Illinois 60637, USA,
    Jacob F. Degner
  3. BioMiningLabs, 69001 Lyon, France,
    Jean-Baptiste Veyrieras
  4. Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois 60637, USA,
    Daniel J. Gaffney, Sherryl De Leon, Katelyn Michelini, Noah Lewellen & Jonathan K. Pritchard
  5. Institute for Genome Sciences and Policy, Duke University, Durham, North Carolina 27708, USA,
    Gregory E. Crawford
  6. Department of Pediatrics, Division of Medical Genetics, Duke University School of Medicine, Durham, North Carolina 27708, USA,
    Gregory E. Crawford
  7. Department of Statistics, University of Chicago, Chicago, Illinois 60637, USA,
    Matthew Stephens

Authors

  1. Jacob F. Degner
    You can also search for this author inPubMed Google Scholar
  2. Athma A. Pai
    You can also search for this author inPubMed Google Scholar
  3. Roger Pique-Regi
    You can also search for this author inPubMed Google Scholar
  4. Jean-Baptiste Veyrieras
    You can also search for this author inPubMed Google Scholar
  5. Daniel J. Gaffney
    You can also search for this author inPubMed Google Scholar
  6. Joseph K. Pickrell
    You can also search for this author inPubMed Google Scholar
  7. Sherryl De Leon
    You can also search for this author inPubMed Google Scholar
  8. Katelyn Michelini
    You can also search for this author inPubMed Google Scholar
  9. Noah Lewellen
    You can also search for this author inPubMed Google Scholar
  10. Gregory E. Crawford
    You can also search for this author inPubMed Google Scholar
  11. Matthew Stephens
    You can also search for this author inPubMed Google Scholar
  12. Yoav Gilad
    You can also search for this author inPubMed Google Scholar
  13. Jonathan K. Pritchard
    You can also search for this author inPubMed Google Scholar

Contributions

A.A.P. led the data collection with assistance from S.D.L., K.M. and N.L. The data analysis was performed jointly by J.F.D. and R.P.R., with contributions from A.A.P., J.B.V., D.J.G. and J.K.Pi. G.E.C. and M.S. provided technical assistance and discussion of methods and results. The manuscript was written by J.F.D., A.A.P., R.P.R., Y.G. and J.K.Pr. The project was jointly supervised by Y.G. and J.K.Pr.

Corresponding authors

Correspondence toYoav Gilad or Jonathan K. Pritchard.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Text and Data, Supplementary Figures 1-28 with legends, Supplementary Tables 1-9 and additional references. (PDF 3101 kb)

PowerPoint slides

Rights and permissions

About this article

Cite this article

Degner, J., Pai, A., Pique-Regi, R. et al. DNase I sensitivity QTLs are a major determinant of human expression variation.Nature 482, 390–394 (2012). https://doi.org/10.1038/nature10808

Download citation

This article is cited by

Editorial Summary

Mapping genetic variation

Expression quantitative trait loci (eQTLs) are stretches of DNA that regulate gene transcription and expression and contribute to a particular phenotypic trait. eQTL mapping is an important tool for linking genetic variation to changes in gene regulation, but identifying the causal variants underlying eQTLs and the regulatory mechanisms involved remains a challenge. Degner et al. used DNaseI sequencing to measure genome-wide chromatin accessibility in 70 Yoruba lymphoblastoid cell lines to produce genome-wide maps of chromatin accessibility for each individual. They identify variants that they call DNaseI sensitivity quantitative trait loci (dsQTLs). The implication is that changes in chromatin accessibility or transcription-factor binding occur at many gene loci and are likely to be important contributors to phenotypic variation.