Gene Model Annotations for Drosophila melanogaster: The Rule-Benders - PubMed (original) (raw)
Gene Model Annotations for Drosophila melanogaster: The Rule-Benders
Madeline A Crosby et al. G3 (Bethesda). 2015.
Abstract
In the context of the FlyBase annotated gene models in Drosophila melanogaster, we describe the many exceptional cases we have curated from the literature or identified in the course of FlyBase analysis. These range from atypical but common examples such as dicistronic and polycistronic transcripts, noncanonical splices, trans-spliced transcripts, noncanonical translation starts, and stop-codon readthroughs, to single exceptional cases such as ribosomal frameshifting and HAC1-type intron processing. In FlyBase, exceptional genes and transcripts are flagged with Sequence Ontology terms and/or standardized comments. Because some of the rule-benders create problems for handlers of high-throughput data, we discuss plans for flagging these cases in bulk data downloads.
Keywords: bicistronic; multiphasic exon; non-AUG translation start; shared promoter; stop-codon suppression.
Copyright © 2015 Crosby et al.
Figures
Figure 1
A dicistronic transcript isoform for
primo-1 and primo-2 is produced from a stage- and tissue-specific promoter. A GBrowse view showing (top to bottom): the gene extents and the gene models; cDNAs and ESTs; transcription start site(s); unstranded RNA-Seq coverage data corresponding to a developmental series (early embryos, top, to adults, bottom); and stranded RNA-Seq coverage data (plus strand top, minus strand bottom) corresponding to testis (red), male accessory gland (magenta), ovary from virgin females (orange), and ovaries from mated females (tan). More information on data presented in GBrowse may be found at http://flybase.org/wiki/FlyBase:GBrowse\_Tracks#General.
Figure 2
Noncanonical splices supported by RNA-Seq junction data. (A) Of three alternative splice acceptors for intron 6 of the
bifid (bi) gene, two are noncanonical TGs, including the splice acceptor used at the highest frequency (first highlighted junction). A GBrowse view showing (top to bottom): nucleotide sequence; region of the gene model showing one intron/exon boundary; EST data; RNA-Seq junction data; and unstranded RNA-Seq coverage data corresponding to a developmental series (early embryos, top, to adults, bottom). More information on data presented in GBrowse may be found at http://flybase.org/wiki/FlyBase:GBrowse\_Tracks#General. (B) Report for an RNA-Seq junction that corresponds to a noncanonical splice but is aligned to incorrect noncanonical sites, one of several cases that were slightly misaligned.
Figure 3
Noncanonical terminal extensions of the CDS. (A) CUG start codon in
Fmr1 results in a 48-aa N-terminal extension; a GBrowse view showing amino acid sequence and amino ends of annotated polypeptides. Use of this alternative start codon has been confirmed by Western blot, mutagenesis of reported constructs, and rescue constructs (Beerman and Jongens 2011). (B) For the dan gene model, a stop-codon readthrough annotated for dan-RB is supported by PhyloCSF analysis (conservation of protein signatures). A GBrowse view showing (top to bottom): the gene model; stop codons on the plus strand in each of the three open reading frames; and regions of protein conservation among the Drosophila species (tan extents at the bottom). More information on data presented in GBrowse may be found at http://flybase.org/wiki/FlyBase:GBrowse\_Tracks#General.
Similar articles
- Gene Model Annotations for Drosophila melanogaster: Impact of High-Throughput Data.
Matthews BB, Dos Santos G, Crosby MA, Emmert DB, St Pierre SE, Gramates LS, Zhou P, Schroeder AJ, Falls K, Strelets V, Russo SM, Gelbart WM; FlyBase Consortium. Matthews BB, et al. G3 (Bethesda). 2015 Jun 24;5(8):1721-36. doi: 10.1534/g3.115.018929. G3 (Bethesda). 2015. PMID: 26109357 Free PMC article. - FlyBase: introduction of the Drosophila melanogaster Release 6 reference genome assembly and large-scale migration of genome annotations.
dos Santos G, Schroeder AJ, Goodman JL, Strelets VB, Crosby MA, Thurmond J, Emmert DB, Gelbart WM; FlyBase Consortium. dos Santos G, et al. Nucleic Acids Res. 2015 Jan;43(Database issue):D690-7. doi: 10.1093/nar/gku1099. Epub 2014 Nov 14. Nucleic Acids Res. 2015. PMID: 25398896 Free PMC article. - Towards comprehensive annotation of Drosophila melanogaster enzymes in FlyBase.
Garapati PV, Zhang J, Rey AJ, Marygold SJ. Garapati PV, et al. Database (Oxford). 2019 Jan 1;2019:bay144. doi: 10.1093/database/bay144. Database (Oxford). 2019. PMID: 30689844 Free PMC article. - Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.
Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell KS, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, Smith CD, Tupy JL, Whitfied EJ, Bayraktaroglu L, Berman BP, Bettencourt BR, Celniker SE, de Grey AD, Drysdale RA, Harris NL, Richter J, Russo S, Schroeder AJ, Shu SQ, Stapleton M, Yamada C, Ashburner M, Gelbart WM, Rubin GM, Lewis SE. Misra S, et al. Genome Biol. 2002;3(12):RESEARCH0083. doi: 10.1186/gb-2002-3-12-research0083. Epub 2002 Dec 31. Genome Biol. 2002. PMID: 12537572 Free PMC article. Review. - The Drosophila melanogaster genome sequencing and annotation projects: a status report.
Drysdale R. Drysdale R. Brief Funct Genomic Proteomic. 2003 Jul;2(2):128-34. doi: 10.1093/bfgp/2.2.128. Brief Funct Genomic Proteomic. 2003. PMID: 15239934 Review.
Cited by
- Regulatory genome annotation of 33 insect species.
Asma H, Tieke E, Deem KD, Rahmat J, Dong T, Huang X, Tomoyasu Y, Halfon MS. Asma H, et al. Elife. 2024 Oct 11;13:RP96738. doi: 10.7554/eLife.96738. Elife. 2024. PMID: 39392676 Free PMC article. - FlyBase at 25: looking to the future.
Gramates LS, Marygold SJ, Santos GD, Urbano JM, Antonazzo G, Matthews BB, Rey AJ, Tabone CJ, Crosby MA, Emmert DB, Falls K, Goodman JL, Hu Y, Ponting L, Schroeder AJ, Strelets VB, Thurmond J, Zhou P; the FlyBase Consortium. Gramates LS, et al. Nucleic Acids Res. 2017 Jan 4;45(D1):D663-D671. doi: 10.1093/nar/gkw1016. Epub 2016 Oct 30. Nucleic Acids Res. 2017. PMID: 27799470 Free PMC article. - Lipid Dynamics, Identification, and Expression Patterns of Fatty Acid Synthase Genes in an Endoparasitoid, Meteorus pulchricornis (Hymenoptera: Braconidae).
Wang J, Shen LW, Xing XR, Xie YQ, Li YJ, Liu ZX, Wang J, Wu FA, Sheng S. Wang J, et al. Int J Mol Sci. 2020 Aug 28;21(17):6228. doi: 10.3390/ijms21176228. Int J Mol Sci. 2020. PMID: 32872177 Free PMC article. - Leaky ribosomal scanning enables tunable translation of bicistronic ORFs in green algae.
Duenas MA, Craig RJ, Gallaher SD, Moseley JL, Merchant SS. Duenas MA, et al. bioRxiv [Preprint]. 2024 Jul 25:2024.07.24.605010. doi: 10.1101/2024.07.24.605010. bioRxiv. 2024. PMID: 39091764 Free PMC article. Preprint. - Two neuronal peptides encoded from a single transcript regulate mitochondrial complex III in Drosophila.
Bosch JA, Ugur B, Pichardo-Casas I, Rabasco J, Escobedo F, Zuo Z, Brown B, Celniker S, Sinclair DA, Bellen HJ, Perrimon N. Bosch JA, et al. Elife. 2022 Nov 8;11:e82709. doi: 10.7554/eLife.82709. Elife. 2022. PMID: 36346220 Free PMC article.
References
- Andjelkovic M., Jones P. F., Grossniklaus U., Cron P., Schier A. F., et al. , 1995. Developmental regulation of expression and activity of multiple forms of the Drosophila RAC protein kinase. J. Biol. Chem. 270: 4066–4075. - PubMed
- Bainton R. J., Tsai L. T. Y., Schwabe T., DeSalvo M., Gaul U., et al. , 2005. moody encodes two GPCRs that regulate cocaine behaviors and blood-brain barrier permeability in Drosophila. Cell 123: 145–156. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- P41 HG000739/HG/NHGRI NIH HHS/United States
- U41 HG000739/HG/NHGRI NIH HHS/United States
- G1000968/Medical Research Council/United Kingdom
- U41 HG00739/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases