A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines - PubMed (original) (raw)
A cross-platform analysis of 14,177 expression quantitative trait loci derived from lymphoblastoid cell lines
Liming Liang et al. Genome Res. 2013 Apr.
Abstract
Gene expression levels can be an important link DNA between variation and phenotypic manifestations. Our previous map of global gene expression, based on ~400K single nucleotide polymorphisms (SNPs) and 50K transcripts in 400 sib pairs from the MRCA family panel, has been widely used to interpret the results of genome-wide association studies (GWASs). Here, we more than double the size of our initial data set with expression data on 550 additional individuals from the MRCE family panel using the Illumina whole-genome expression array. We have used new statistical methods for dimension reduction to account for nongenetic effects in estimates of expression levels, and we have also included SNPs imputed from the 1000 Genomes Project. Our methods reduced false-discovery rates and increased the number of expression quantitative trait loci (eQTLs) mapped either locally or at a distance (i.e., in cis or trans) from 1534 in the MRCA data set to 4452 (with <5% FDR). Imputation of 1000 Genomes SNPs further increased the number of eQTLs to 7302. Using the same methods and imputed SNPs in the newly acquired MRCE data set, we identified eQTLs for 9000 genes. The combined results identify strong local and distant effects for transcripts from 14,177 genes. Our eQTL database based on these results is freely available to help define the function of disease-associated variants.
Figures
Figure 1.
Empirical estimate of false discovery rate. (noPC) Using original expression value; (PC) adjusting nongenetic effect using the top 69 principal components; (1000G) imputation using SNPs from the 1000 Genomes project; (HapMap) imputation using HapMap2 SNPs; (300K) using autosomal SNPs from the Illumina 300K panel.
Figure 2.
Replication rate by distance from the eQTL to the transcript. (Red lines) Replication of eQTL. (A) Local effect; (B) distant syntenic effect (>1 Mb but gene and SNP on the same chromosome); and (C) distant effect (on a different chromosome). The analysis is based on autosomal SNPs from the Illumina 300K panel.
Figure 3.
Imputation accuracy by minor allele frequency (MAF; MRCA panel). (A) Correlation (_R_2) between 1000G imputed allele dosage derived from Illumina 300K arrays and true allele counts measured by Illumina 100K arrays, plotted by MAF. (B) Histogram of correlation with true allele counts by MAF in sample (upper panels) and minor allele counts in the 1000G reference haplotype. Note that there are no data in the bottom left panel following removal of singletons from the reference haplotype.
Figure 4.
Comparison of number of local eQTLs identified by directly genotyped SNPs, imputed HapMap2 SNPs, and imputed SNPs from the 1000 Genomes Project. (A) Results from Affymetrix expression data in the MRCA panel. (B) Results from Illumina expression data in the MRCE panel. (Blue bars) Original unadjusted expression; (red bars) expression values adjusted by the top principal components.
Figure 5.
Proportion of significant transcripts by overall heritability. (A) Unadjusted expression data and SNPs from the Illumina 300K panel in the MRCA subjects. (B) PC-adjusted expression data and the imputation of 1000G SNPs in the MRCA subjects. (C) Unadjusted expression data and SNPs from the Illumina 300K panel in the MRCE subjects. (D) PC-adjusted expression data and the imputation of 1000G SNPs in the MRCE subjects. The number of transcripts in each heritability category is given at the bottom of each bar.
Figure 6.
cis eQTL of the gene TIMM22. A map of association of SNPs to transcript abundance of the TIMM22 gene exemplifies the progressive increase in information for the experimental genotypes (A), experimental plus HapMap imputed SNPs (B), and experimental, HapMap imputed and 1000G imputed SNPs (C). The gray vertical bar at the gene TIMM22 indicates the position of the Affymetrix probe
Similar articles
- Integrated genome-wide analysis of expression quantitative trait loci aids interpretation of genomic association studies.
Joehanes R, Zhang X, Huan T, Yao C, Ying SX, Nguyen QT, Demirkale CY, Feolo ML, Sharopova NR, Sturcke A, Schäffer AA, Heard-Costa N, Chen H, Liu PC, Wang R, Woodhouse KA, Tanriverdi K, Freedman JE, Raghavachari N, Dupuis J, Johnson AD, O'Donnell CJ, Levy D, Munson PJ. Joehanes R, et al. Genome Biol. 2017 Jan 25;18(1):16. doi: 10.1186/s13059-016-1142-6. Genome Biol. 2017. PMID: 28122634 Free PMC article. - ncRNA-eQTL: a database to systematically evaluate the effects of SNPs on non-coding RNA expression across cancer types.
Li J, Xue Y, Amin MT, Yang Y, Yang J, Zhang W, Yang W, Niu X, Zhang HY, Gong J. Li J, et al. Nucleic Acids Res. 2020 Jan 8;48(D1):D956-D963. doi: 10.1093/nar/gkz711. Nucleic Acids Res. 2020. PMID: 31410488 Free PMC article. - Expression quantitative trait loci analysis identifies associations between genotype and gene expression in human intestine.
Kabakchiev B, Silverberg MS. Kabakchiev B, et al. Gastroenterology. 2013 Jun;144(7):1488-96, 1496.e1-3. doi: 10.1053/j.gastro.2013.03.001. Epub 2013 Mar 6. Gastroenterology. 2013. PMID: 23474282 Free PMC article. - Expression Quantitative Trait Loci Information Improves Predictive Modeling of Disease Relevance of Non-Coding Genetic Variation.
Croteau-Chonka DC, Rogers AJ, Raj T, McGeachie MJ, Qiu W, Ziniti JP, Stubbs BJ, Liang L, Martinez FD, Strunk RC, Lemanske RF Jr, Liu AH, Stranger BE, Carey VJ, Raby BA. Croteau-Chonka DC, et al. PLoS One. 2015 Oct 16;10(10):e0140758. doi: 10.1371/journal.pone.0140758. eCollection 2015. PLoS One. 2015. PMID: 26474488 Free PMC article. Review. - eQTL analysis in mice and rats.
Tesson BM, Jansen RC. Tesson BM, et al. Methods Mol Biol. 2009;573:285-309. doi: 10.1007/978-1-60761-247-6_16. Methods Mol Biol. 2009. PMID: 19763934 Review.
Cited by
- Genetics and epidemiology of mutational barcode-defined clonal hematopoiesis.
Stacey SN, Zink F, Halldorsson GH, Stefansdottir L, Gudjonsson SA, Einarsson G, Hjörleifsson G, Eiriksdottir T, Helgadottir A, Björnsdottir G, Thorgeirsson TE, Olafsdottir TA, Jonsdottir I, Gretarsdottir S, Tragante V, Magnusson MK, Jonsson H, Gudmundsson J, Olafsson S, Holm H, Gudbjartsson DF, Sulem P, Helgason A, Thorsteinsdottir U, Tryggvadottir L, Rafnar T, Melsted P, Ulfarsson MÖ, Vidarsson B, Thorleifsson G, Stefansson K. Stacey SN, et al. Nat Genet. 2023 Dec;55(12):2149-2159. doi: 10.1038/s41588-023-01555-z. Epub 2023 Nov 6. Nat Genet. 2023. PMID: 37932435 Free PMC article. - Whole genome DNA and RNA sequencing of whole blood elucidates the genetic architecture of gene expression underlying a wide range of diseases.
Liu C, Joehanes R, Ma J, Wang Y, Sun X, Keshawarz A, Sooda M, Huan T, Hwang SJ, Bui H, Tejada B, Munson PJ, Demirkale CY, Heard-Costa NL, Pitsillides AN, Peloso GM, Feolo M, Sharopova N, Vasan RS, Levy D. Liu C, et al. Sci Rep. 2022 Nov 23;12(1):20167. doi: 10.1038/s41598-022-24611-w. Sci Rep. 2022. PMID: 36424512 Free PMC article. - Whole Genome DNA and RNA Sequencing of Whole Blood Elucidates the Genetic Architecture of Gene Expression Underlying a Wide Range of Diseases.
Liu C, Joehanes R, Ma J, Wang Y, Sun X, Keshawarz A, Sooda M, Huan T, Hwang SJ, Bui H, Tejada B, Munson PJ, Cumhur D, Heard-Costa NL, Pitsillides AN, Peloso GM, Feolo M, Sharopova N, Vasan RS, Levy D. Liu C, et al. Res Sq [Preprint]. 2022 May 31:rs.3.rs-1598646. doi: 10.21203/rs.3.rs-1598646/v1. Res Sq. 2022. PMID: 35664994 Free PMC article. Updated. Preprint. - Whole Genome DNA and RNA Sequencing of Whole Blood Elucidates the Genetic Architecture of Gene Expression Underlying a Wide Range of Diseases.
Liu C, Joehanes R, Ma J, Wang Y, Sun X, Keshawarz A, Sooda M, Huan T, Hwang SJ, Bui H, Tejada B, Munson PJ, Cumhur D, Heard-Costa NL, Pitsillides AN, Peloso GM, Feolo M, Sharopova N, Vasan RS, Levy D. Liu C, et al. medRxiv [Preprint]. 2022 May 3:2022.04.13.22273841. doi: 10.1101/2022.04.13.22273841. medRxiv. 2022. PMID: 35547845 Free PMC article. Updated. Preprint. - Genetic variants associated mRNA stability in lung.
Li JR, Tang M, Li Y, Amos CI, Cheng C. Li JR, et al. BMC Genomics. 2022 Mar 11;23(1):196. doi: 10.1186/s12864-022-08405-y. BMC Genomics. 2022. PMID: 35272635 Free PMC article.
References
- Abecasis GR, Cherny SS, Cookson WO, Cardon LR 2002. Merlin–rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet 30: 97–101 - PubMed
- Akey JM, Biswas S, Leek JT, Storey JD 2007. On the design and analysis of gene expression studies in human populations. Nat Genet 39: 807–808 - PubMed
- Benjamini Y, Hochberg Y 1995. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol 57: 289–300
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources