Tissue-specific genetic control of splicing: implications for the study of complex traits - PubMed (original) (raw)

Tissue-specific genetic control of splicing: implications for the study of complex traits

Erin L Heinzen et al. PLoS Biol. 2008.

Abstract

Numerous genome-wide screens for polymorphisms that influence gene expression have provided key insights into the genetic control of transcription. Despite this work, the relevance of specific polymorphisms to in vivo expression and splicing remains unclear. We carried out the first genome-wide screen, to our knowledge, for SNPs that associate with alternative splicing and gene expression in human primary cells, evaluating 93 autopsy-collected cortical brain tissue samples with no defined neuropsychiatric condition and 80 peripheral blood mononucleated cell samples collected from living healthy donors. We identified 23 high confidence associations with total expression and 80 with alternative splicing as reflected by expression levels of specific exons. Fewer than 50% of the implicated SNPs however show effects in both tissue types, reflecting strong evidence for distinct genetic control of splicing and expression in the two tissue types. The data generated here also suggest the possibility that splicing effects may be responsible for up to 13 out of 84 reported genome-wide significant associations with human traits. These results emphasize the importance of establishing a database of polymorphisms affecting splicing and expression in primary tissue types and suggest that splicing effects may be of more phenotypic significance than overall gene expression changes.

PubMed Disclaimer

Conflict of interest statement

Competing interests. The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. Idealized Representation of How Overall Expression and Alternative Splicing Events Are Reflected in the Exon Array Data

Top panel: In this study, all exon-level data were normalized across all exons and individuals. Transcript-level expression was reported for each transcript interrogated on the array by averaging (PLIER method) exon expression levels for all exons contained in a transcript (annotation details can be found at

http://www.affymetrix.com

). Subject 1 in this example has a higher overall transcript expression level (indicated by green line representing an average of exons A–G within subject 1) compared to subject 2 (red line). In this example, all exons contained in the transcript were expressed at approximately equal levels, suggesting that this transcript does not have alternative splice variants in either subject. Middle panel: An example of the detection of alternative splicing in which multiple transcripts are produced from a single gene through unique combination of the coding regions. Exon D in subject 2 appears to be expressed at lower quantities when compared to the other exons in the transcript (exon D expression levels lie below the average transcript line), indicating that this exon may be spliced out of the transcript in this subject (i.e., higher expression of transcript isoform II). Bottom panel: A scenario where we cannot definitively establish the combinatorial assembly of exons using these data. In this example, subject 2 has lower expression of both exons B and D. We cannot conclude that this subject has a higher proportion of transcript IV expressed compared to subject 1 or if transcript II and III are expressed at higher levels. Despite this shortcoming in situations of large heterogeneity of transcript isoforms produced with multiple alternative splicing events, these data provide a clear indicator of alternative exon composition within transcripts for a given individual. This study was specifically focused on the _cis_-acting genetic regulation of overall expression and splicing. Therefore, we were interested in identifying groups of splicing and expression patterns unique to individuals with the same genotype at certain commonly variant loci in and surrounding the transcript or exon. In cases where the expression of multiple exons was under genetic regulation, we declared it a splicing event if <40% of all exons contained in the transcript were associated with high confidence with the genotype. If >40% of exons were implicated, this was considered to be an overall expression change.

Figure 2

Figure 2. Principal Components Analysis of All Exon Expression Level Data for Both Brain and PBMC Samples

The differentiated pattern of expression suggests the need for tissue specific evaluations of alternative splicing and expression and demonstrates the added benefit of studying genetic regulation of splicing and expression in two unique and important cellular populations. A similar profile was observed for the transcript level expression values in the two tissue types suggesting the same level of tissue specificity for splicing.

Figure 3

Figure 3. Quantitative Real-Time PCR Confirmation of Selected Genetically Regulated Exon-Level Expression Changes in the Two Tissue Types

Three representative scenarios are presented. The top panel shows an sQTL that was present in both brain and PBMCs. The middle panel and bottom panel show sQTLs unique to a particular tissue type, providing unequivocal evidence for tissue specific genetic regulation.

Figure 4

Figure 4. Methodological Details Evaluating the Proximity of a Detected sQTL and Its Region of LD to a Splicing Regulatory Region

Red box represents an exon whose expression is correlated with the SNP indicated by the starred red bar. All SNPs in LD with this sQTL are shown by red bars and the height of the bar indicates the level of correlation (r 2) with the starred SNP. We assessed how often the range of LD for a given sQTL (defined by r 2 > 0.2 with the sQTL) extended into or surpassed the splicing regulatory region. This analysis was performed by evaluating all mRNA transcripts containing the exon regulated by the sQTL. The splicing regulatory region was defined as the genomic region from the start of the exon located upstream of the associated exon through the stop site of the downstream exon interrogated. If the exon was part of multiple transcripts the region including the most distal and proximal neighboring exons was defined the splicing regulatory region. If the affected exon was located at the beginning or end of the transcript then the range was truncated at the start or stop of the transcript, respectively. Finally, if a single SNP associated with more than one exon in a single transcript they were considered as a single entry in this analysis (i.e., sQTL LD needed only come in close proximity of one of the affected exons to be counted as a positive entry).

Figure 5

Figure 5. SNPExpress Database

Top panel: Output showing an example of an association between rs10876864 and a splicing change in RPS26. Software permits the input of an SNP, a gene, or a genomic region for comprehensive interrogation of associations between SNPs and exon/transcript expression levels in the regions surrounding the SNP. The blue frame indicates the SNPs genotyped on the chip, the two lighter blue frames correspond to the −log _p_-values for the brain and PBMC samples, the turquoise panel contains all Ensembl transcripts, and the bottom green panel shows all of the exons/transcripts screened for on the array. Bottom panel: This database can be applied in a SNP-centric or gene-centric approach for determining the functional and phenotypic consequences of genetic variation on the transcriptome.

Similar articles

Cited by

References

    1. Cheung VG, Spielman RS, Ewens KG, Weber TM, Morley M, et al. Mapping determinants of human gene expression by regional and genome-wide association. Nature. 2005;437:1365–1369. - PMC - PubMed
    1. Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, et al. Genome-wide associations of gene expression variation in humans. PLoS Genet. 2005;1:e78. doi: <10.1371/journal.pgen.0010078>. - DOI - PMC - PubMed
    1. Dixon AL, Liang L, Moffatt MF, Chen W, Heath S, et al. A genome-wide association study of global gene expression. Nat Genet. 2007;39:1202–1207. - PubMed
    1. Hull J, Campino S, Rowlands K, Chan MS, Copley RR, et al. Identification of common genetic variation that modulates alternative splicing. PLoS Genet. 2007;3:e99. doi: <10.1371/journal.pgen.0030099>. - DOI - PMC - PubMed
    1. Kwan T, Benovoy D, Dias C, Gurd S, Provencher C, et al. Genome-wide analysis of transcript isoform variation in humans. Nat Genet. 2008;40:225–231. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources