The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species - PubMed (original) (raw)
The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species
J Quackenbush et al. Nucleic Acids Res. 2001.
Abstract
While genome sequencing projects are advancing rapidly, EST sequencing and analysis remains a primary research tool for the identification and categorization of gene sequences in a wide variety of species and an important resource for annotation of genomic sequence. The TIGR Gene Indices (http://www.tigr.org/tdb/tgi. shtml) are a collection of species-specific databases that use a highly refined protocol to analyze EST sequences in an attempt to identify the genes represented by that data and to provide additional information regarding those genes. Gene Indices are constructed by first clustering, then assembling EST and annotated gene sequences from GenBank for the targeted species. This process produces a set of unique, high-fidelity virtual transcripts, or Tentative Consensus (TC) sequences. The TC sequences can be used to provide putative genes with functional annotation, to link the transcripts to mapping and genomic sequence data, to provide links between orthologous and paralogous genes and as a resource for comparative sequence analysis.
Figures
Figure 1
An example THC from the Human Gene Index. The consensus sequence is presented in FASTA format below which the locations of the gene sequences (red) and ESTs that comprise the assembly are shown with their respective locations within the assembly. Links are provided to GenBank records, internal data for all ESTs sequenced at TIGR and to clones available through the ATCC. This THC has been assigned a putative ID of ‘insulin receptor inhibitor, muscle’ as it contains a HT853 (as well as gene sequences from GenBank).
Figure 2
An example TOG from the TOGA database. The human, mouse and rat TCs all contain annotated genes; those in mouse and rat have been identified as ‘bithoraxoid-like protein’ while the human gene is simply annotated as ‘HSPC162’ and the cattle TC consists only of ESTs. The stringent overlap criteria used to construct the TOGs makes it unlikely that these matches are spurious and provides putative functional annotation for the previously unclassified human and bovine gene and EST sequences.
Figure 3
Alignment of TCs from the TIGR Plant Gene Indices with the sequence of Arabidopsis thaliana Chromsome II. The coding sequence of a putative casein kinase II catalytic subunit shows significant homology to the same gene in other plants as is evident from an alignment between the Arabidopsis genomic sequence and the various plant TCs. This gene is well conserved across both monocots and dicots. The multiple hits seen in some species may represent paralogs, gene families, alternative splice forms or partial TC assemblies.
Similar articles
- The TIGR gene indices: reconstruction and representation of expressed gene sequences.
Quackenbush J, Liang F, Holt I, Pertea G, Upton J. Quackenbush J, et al. Nucleic Acids Res. 2000 Jan 1;28(1):141-5. doi: 10.1093/nar/28.1.141. Nucleic Acids Res. 2000. PMID: 10592205 Free PMC article. - The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes.
Lee Y, Tsai J, Sunkara S, Karamycheva S, Pertea G, Sultana R, Antonescu V, Chan A, Cheung F, Quackenbush J. Lee Y, et al. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D71-4. doi: 10.1093/nar/gki064. Nucleic Acids Res. 2005. PMID: 15608288 Free PMC article. - Cross-referencing eukaryotic genomes: TIGR Orthologous Gene Alignments (TOGA).
Lee Y, Sultana R, Pertea G, Cho J, Karamycheva S, Tsai J, Parvizi B, Cheung F, Antonescu V, White J, Holt I, Liang F, Quackenbush J. Lee Y, et al. Genome Res. 2002 Mar;12(3):493-502. doi: 10.1101/gr.212002. Genome Res. 2002. PMID: 11875039 Free PMC article. - Gene identification through large-scale EST sequence processing.
Lindlöf A. Lindlöf A. Appl Bioinformatics. 2003;2(3):123-9. Appl Bioinformatics. 2003. PMID: 15130797 Review. - Identification and analysis of gene families from the duplicated genome of soybean using EST sequences.
Nelson RT, Shoemaker R. Nelson RT, et al. BMC Genomics. 2006 Aug 9;7:204. doi: 10.1186/1471-2164-7-204. BMC Genomics. 2006. PMID: 16899135 Free PMC article. Review.
Cited by
- Comparative transcriptomics of stem rust resistance in wheat NILs mediated by Sr24 rust resistance gene.
Vishwakarma G, Saini A, Bhardwaj SC, Kumar S, Das BK. Vishwakarma G, et al. PLoS One. 2023 Dec 11;18(12):e0295202. doi: 10.1371/journal.pone.0295202. eCollection 2023. PLoS One. 2023. PMID: 38079439 Free PMC article. - Induced expression modes of genes related to Toll, Imd, and JAK/STAT signaling pathway-mediated immune response in Spodoptera frugiperda infected with Beauveria bassiana.
Li J, Mao Y, Yi J, Lin M, Xu H, Cheng Y, Wu H, Liu J. Li J, et al. Front Physiol. 2023 Aug 24;14:1249662. doi: 10.3389/fphys.2023.1249662. eCollection 2023. Front Physiol. 2023. PMID: 37693000 Free PMC article. - Transcriptome Characterization and Expression Analysis of Chemosensory Genes in Chilo sacchariphagus (Lepidoptera Crambidae), a Key Pest of Sugarcane.
Liu J, Liu H, Yi J, Mao Y, Li J, Sun D, An Y, Wu H. Liu J, et al. Front Physiol. 2021 Mar 5;12:636353. doi: 10.3389/fphys.2021.636353. eCollection 2021. Front Physiol. 2021. PMID: 33762968 Free PMC article. - The 21st Century Agriculture: When Rice Research Draws Attention to Climate Variability and How Weedy Rice and Underutilized Grains Come in Handy.
Hanafiah NM, Mispan MS, Lim PE, Baisakh N, Cheng A. Hanafiah NM, et al. Plants (Basel). 2020 Mar 16;9(3):365. doi: 10.3390/plants9030365. Plants (Basel). 2020. PMID: 32188108 Free PMC article. Review. - A comparative analysis of nonhost resistance across the two Triticeae crop species wheat and barley.
Delventhal R, Rajaraman J, Stefanato FL, Rehman S, Aghnoum R, McGrann GRD, Bolger M, Usadel B, Hedley PE, Boyd L, Niks RE, Schweizer P, Schaffrath U. Delventhal R, et al. BMC Plant Biol. 2017 Dec 4;17(1):232. doi: 10.1186/s12870-017-1178-0. BMC Plant Biol. 2017. PMID: 29202692 Free PMC article.
References
- Adams M.D., Kelley,J.M., Gocayne,J.D., Dubnick,M., Polymeropoulos,M.H.M., Xiao,H., Merril,C.R., Wu,A., Olde,B., Moreno,R.F. et al. (1991) Complementary DNA sequencing: expressed sequence tags and human genome project. Science, 252, 1651–1656. - PubMed
- Boguski M.S. and Schuler,G.D. (1995) ESTablishing a human transcript map. Nature Genet., 10, 369–371. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials