AltTrans: transcript pattern variants annotated for both alternative splicing and alternative polyadenylation - PubMed (original) (raw)
AltTrans: transcript pattern variants annotated for both alternative splicing and alternative polyadenylation
Vincent Le Texier et al. BMC Bioinformatics. 2006.
Abstract
Background: The three major mechanisms that regulate transcript formation involve the selection of alternative sites for transcription start (TS), splicing, and polyadenylation. Currently there are efforts that collect data & annotation individually for each of these variants. It is important to take an integrated view of these data sets and to derive a data set of alternate transcripts along with consolidated annotation. We have been developing in the past computational pipelines that generate value-added data at genome-scale on individual variant types; these include AltSplice on splicing and AltPAS on polyadenylation. We now extend these pipelines and integrate the resultant data sets to facilitate an integrated view of the contributions from splicing and polyadenylation in the formation of transcript variants.
Description: The AltSplice pipeline examines gene-transcript alignments and delineates alternative splice events and splice patterns; this pipeline is extended as AltTrans to delineate isoform transcript patterns for each of which both introns/exons and 'terminating' polyA site are delineated; EST/mRNA sequences that qualify the transcript pattern confirm both the underlying splicing and polyadenylation. The AltPAS pipeline examines gene-transcript alignments and delineates all potential polyA sites irrespective of underlying splicing patterns. Resultant polyA sites from both AltTrans and AltPAS are merged. The generated database reports data on alternative splicing, alternative polyadenylation and the resultant alternate transcript patterns; the basal data is annotated for various biological features. The data (named as integrated AltTrans data) generated for both the organisms of human and mouse is made available through the Alternate Transcript Diversity web site at http://www.ebi.ac.uk/atd/.
Conclusion: The reported data set presents alternate transcript patterns that are annotated for both alternative splicing and alternative polyadenylation. Results based on current transcriptome data indicate that the contribution of alternative splicing is larger than that of alternative polyadenylation.
Figures
Figure 1
Derivation of transcript patterns by the AltTrans pipeline from AltSplice splice patterns. Each of the gene-transcript alignments from AltSplice is examined for the following: (i) the alignment shows a 3' dangling end on the EST/mRNA; (ii) such a dangling end shows a polyA tail sequence; and (iii) a polyA signal is seen on the gene within a maximum distance of 40 nts 5' to the cleavage position. Transcripts that show these features are grouped in a manner that each class of transcripts possesses the same exon/intron organisation and the same terminating polyA site. Such derived alternate transcript patterns are described as AltTrans transcript patterns. Note: Of the three EST's, that are all grouped under one AltSplice splice pattern, the EST3 does not show a "dangling end" and hence it is not considered further in the construction of AltTrans Transcript Patterns. EST1 and EST2 form two distinct transcript patterns that differ in terminating polyA sites.
Figure 2
Illustration of the relationship between the AltTrans, AltSplice, and AltPAS pipelines/data.
Figure 3
Distribution of spacing between polyA cleavage (PAC) site and polyA signal (PAS) in human transcript patterns from AltTrans. The bottom inset uses the data set of heterogeneous polyA sites; the top inset uses the data set of representative polyA sites (Nearby heterogeneous polyA sites are grouped and a representative polyA site is chosen – see text for methods).
Figure 4
Examples of PolyA table and transcript pattern table. Locations of the polyA site and signal are as on the gene. Status of the polyA site refers to whether the site is identified by the AltTrans or AltPAS pipeline. Entry in the last column is hyperlinked to pages listing detailed information on the confirming transcript sequences.
Figure 5
Example of splice pattern table and splice event table. Locations of exons are as on the gene.Inset A: Entry in column 1 is hyperlinked to a page listing the sequence of the splice pattern. Entry in column 2 gives the coding start & end positions on the gene and the length of the translated peptide sequence and is hyperlinked to a page listing the peptide sequence. Entry in column 3 lists the structure of the splice pattern as a string of exons. Entry in column 4 is hyperlinked to pages listing detailed information on the confirming transcript sequences. Entry in column 5 is hyperlinked to pages listing EST/mRNA sequences. Entry in column 6 is hyperlinked to pages listing allele specificity of the splice pattern. Inset B: Column 1 lists the exons involved in the event (in this example cassette exon event). Column 2 indicates whether the event involves modifications in the flanking exons as well; entries are hyperlinked to pages listing detailed information on the event. Column 3 indicates the identifier of the orthologous gene and the coordinates of the exon orthologous to the one presented in column1; the entry is hyperlinked to the orthologous gene entry.
Figure 6
Inset A: Example of transcript pattern view. Exons are indicated by boxes and introns by lines. Exons/introns that are variants are indicated in blue colour. Browsing the cursor over various elements of a pattern displays pop-up's giving detailed information on the elements. The displayed pop-up in this example shows information on the polyA sites that maps to the alternate transcript pattern AT2; of these two polyA sites, the first one (located at gene position 9317) terminates the transcript pattern while the second one (located at gene position 9189) is skipped and is not used as a terminating polyA in the formation of this pattern. Inset B: Example snapshot of a portion of Ensembl gene display page to illustrate the integration of the AltTrans data in Ensembl genome annotation project.
Similar articles
- ASTD: The Alternative Splicing and Transcript Diversity database.
Koscielny G, Le Texier V, Gopalakrishnan C, Kumanduri V, Riethoven JJ, Nardone F, Stanley E, Fallsehr C, Hofmann O, Kull M, Harrington E, Boué S, Eyras E, Plass M, Lopez F, Ritchie W, Moucadel V, Ara T, Pospisil H, Herrmann A, G Reich J, Guigó R, Bork P, Doeberitz Mv, Vilo J, Hide W, Apweiler R, Thanaraj TA, Gautheret D. Koscielny G, et al. Genomics. 2009 Mar;93(3):213-20. doi: 10.1016/j.ygeno.2008.11.003. Epub 2008 Dec 24. Genomics. 2009. PMID: 19059335 - ASD: the Alternative Splicing Database.
Thanaraj TA, Stamm S, Clark F, Riethoven JJ, Le Texier V, Muilu J. Thanaraj TA, et al. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D64-9. doi: 10.1093/nar/gkh030. Nucleic Acids Res. 2004. PMID: 14681360 Free PMC article. - Splicing Busts a Move: Isoform Switching Regulates Migration.
Mitra M, Lee HN, Coller HA. Mitra M, et al. Trends Cell Biol. 2020 Jan;30(1):74-85. doi: 10.1016/j.tcb.2019.10.007. Epub 2019 Dec 3. Trends Cell Biol. 2020. PMID: 31810769 Free PMC article. Review. - Transposable element-driven transcript diversification and its relevance to genetic disorders.
Ayarpadikannan S, Lee HE, Han K, Kim HS. Ayarpadikannan S, et al. Gene. 2015 Mar 10;558(2):187-94. doi: 10.1016/j.gene.2015.01.039. Epub 2015 Jan 21. Gene. 2015. PMID: 25617522 Review.
Cited by
- Alternative splicing for diseases, cancers, drugs, and databases.
Tang JY, Lee JC, Hou MF, Wang CL, Chen CC, Huang HW, Chang HW. Tang JY, et al. ScientificWorldJournal. 2013 May 22;2013:703568. doi: 10.1155/2013/703568. Print 2013. ScientificWorldJournal. 2013. PMID: 23766705 Free PMC article. Review. - A novel MCPH1 isoform complements the defective chromosome condensation of human MCPH1-deficient cells.
Gavvovidis I, Rost I, Trimborn M, Kaiser FJ, Purps J, Wiek C, Hanenberg H, Neitzel H, Schindler D. Gavvovidis I, et al. PLoS One. 2012;7(8):e40387. doi: 10.1371/journal.pone.0040387. Epub 2012 Aug 30. PLoS One. 2012. PMID: 22952573 Free PMC article. - Mining Functional Elements in Messenger RNAs: Overview, Challenges, and Perspectives.
Ahmed F, Benedito VA, Zhao PX. Ahmed F, et al. Front Plant Sci. 2011 Nov 30;2:84. doi: 10.3389/fpls.2011.00084. eCollection 2011. Front Plant Sci. 2011. PMID: 22639614 Free PMC article. - POLYAR, a new computer program for prediction of poly(A) sites in human sequences.
Akhtar MN, Bukhari SA, Fazal Z, Qamar R, Shahmuradov IA. Akhtar MN, et al. BMC Genomics. 2010 Nov 19;11:646. doi: 10.1186/1471-2164-11-646. BMC Genomics. 2010. PMID: 21092114 Free PMC article. - Discovery of novel human transcript variants by analysis of intronic single-block EST with polyadenylation site.
Wang P, Yu P, Gao P, Shi T, Ma D. Wang P, et al. BMC Genomics. 2009 Nov 12;10:518. doi: 10.1186/1471-2164-10-518. BMC Genomics. 2009. PMID: 19906316 Free PMC article.
References
- Zavolan M, Kondo S, Schönbach C, Adachi J, Hume DA, RIKEN GER Group. Members GSL, Hayashizaki Y, Gaasterland T. Impact of Alternative Initiation, Splicing, and Termination on the Diversity of the mRNA Transcripts Encoded by the Mouse Transcriptome. Genome Res. 2003;13:1290–1300. doi: 10.1101/gr.1017303. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous