In silico analysis of 3'-end-processing signals in Aspergillus oryzae using expressed sequence tags and genomic sequencing data - PubMed (original) (raw)
In silico analysis of 3'-end-processing signals in Aspergillus oryzae using expressed sequence tags and genomic sequencing data
Mizuki Tanaka et al. DNA Res. 2011 Jun.
Abstract
To investigate 3'-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3'-untranslated region (3' UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3' UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3' UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15-30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3'-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3'-end-processing signals are similar to those in yeast and plants, some notable differences exist between them.
Figures
Figure 1.
Profile of the A. oryzae poly(A) data set. (A) Frequency distribution of EST contigs based on the EST copy number. The EST copy number of each contig contained in the A. oryzae poly(A) data set was obtained from the A. oryzae EST database (
http://nribf2.nrib.go.jp/EST2/index.html
). Data on the total EST contigs were obtained from the study by Akao et al. (B) Gene expression levels determined by DNA microarray analysis. The fluorescence intensity of each gene was normalized to that of the histone H4 gene.
Figure 2.
Distribution of 3′ UTR lengths determined for 1065 unique EST sequences. The average length is 241 nt.
Figure 3.
Single nucleotide frequencies in the 3′ UTR and 100 nt sequence downstream of the poly(A) site. (A) Single nucleotide profile in the 3′ UTR and 100 nt sequence downstream of the poly(A) site. The poly(A) site is at position 0. The upstream sequence of the poly(A) site is designated minus and the downstream sequence is designated plus. (B) Sequence logo generated from the actual frequency of occurrence of each of the four nucleotides around the cleavage site. (C) Six regions of the 3′ UTR and 100 nt sequence downstream of the poly(A) site formed according to the single nucleotide profile. The cleavage and polyadenylation site is located between regions IV and V.
Figure 4.
Representative hexanucleotide signals in the poly(A) signal region (from −40 to −1 nt).
Figure 5.
A schematic representation of the alignment of 3′-end-processing signals in A. oryzae, yeast, and plants. The arrow indicates the cleavage and polyadenylation site.
Similar articles
- Aspergillus oryzae has two nearly identical Taka-amylase genes, each containing eight introns.
Gines MJ, Dove MJ, Seligy VL. Gines MJ, et al. Gene. 1989 Jun 30;79(1):107-17. doi: 10.1016/0378-1119(89)90096-6. Gene. 1989. PMID: 2789162 - Sequence determinants in human polyadenylation site selection.
Legendre M, Gautheret D. Legendre M, et al. BMC Genomics. 2003 Feb 25;4(1):7. doi: 10.1186/1471-2164-4-7. Epub 2003 Feb 25. BMC Genomics. 2003. PMID: 12600277 Free PMC article. - Similarities and differences of polyadenylation signals in human and fly.
Retelska D, Iseli C, Bucher P, Jongeneel CV, Naef F. Retelska D, et al. BMC Genomics. 2006 Jul 12;7:176. doi: 10.1186/1471-2164-7-176. BMC Genomics. 2006. PMID: 16836751 Free PMC article. - [Genome sequencing of Aspergillus oryzae].
Machida M. Machida M. Tanpakushitsu Kakusan Koso. 2006 May;51(5):452-6. Tanpakushitsu Kakusan Koso. 2006. PMID: 16686348 Review. Japanese. No abstract available. - Implications of polyadenylation in health and disease.
Curinha A, Oliveira Braz S, Pereira-Castro I, Cruz A, Moreira A. Curinha A, et al. Nucleus. 2014;5(6):508-19. doi: 10.4161/nucl.36360. Epub 2014 Oct 31. Nucleus. 2014. PMID: 25484187 Free PMC article. Review.
Cited by
- A highly contiguous genome assembly reveals sources of genomic novelty in the symbiotic fungus Rhizophagus irregularis.
Manley BF, Lotharukpong JS, Barrera-Redondo J, Llewellyn T, Yildirir G, Sperschneider J, Corradi N, Paszkowski U, Miska EA, Dallaire A. Manley BF, et al. G3 (Bethesda). 2023 Jun 1;13(6):jkad077. doi: 10.1093/g3journal/jkad077. G3 (Bethesda). 2023. PMID: 36999556 Free PMC article. - Widespread Polycistronic Transcripts in Fungi Revealed by Single-Molecule mRNA Sequencing.
Gordon SP, Tseng E, Salamov A, Zhang J, Meng X, Zhao Z, Kang D, Underwood J, Grigoriev IV, Figueroa M, Schilling JS, Chen F, Wang Z. Gordon SP, et al. PLoS One. 2015 Jul 15;10(7):e0132628. doi: 10.1371/journal.pone.0132628. eCollection 2015. PLoS One. 2015. PMID: 26177194 Free PMC article. - Analysis of Tc1-Mariner elements in Sclerotinia sclerotiorum suggests recent activity and flexible transposases.
Santana MF, Silva JC, Mizubuti ES, Araújo EF, Queiroz MV. Santana MF, et al. BMC Microbiol. 2014 Oct 3;14:256. doi: 10.1186/s12866-014-0256-9. BMC Microbiol. 2014. PMID: 25281292 Free PMC article. - Comparative in silico analysis of EST-SSRs in angiosperm and gymnosperm tree genera.
Ranade SS, Lin YC, Zuccolo A, Van de Peer Y, García-Gil Mdel R. Ranade SS, et al. BMC Plant Biol. 2014 Aug 21;14:220. doi: 10.1186/s12870-014-0220-8. BMC Plant Biol. 2014. PMID: 25143005 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials