Text-based analysis of genes, proteins, aging, and cancer - PubMed (original) (raw)
Text-based analysis of genes, proteins, aging, and cancer
Jeremy R Semeiks et al. Mech Ageing Dev. 2005 Jan.
Abstract
The diverse nature of cancer- and aging-related genes presents a challenge for large-scale studies based on molecular sequence and profiling data. An underexplored source of data for modeling and analysis is the textual descriptions and annotations present in curated gene-centered biomedical corpora. Here, 450 genes designated by surveys of the scientific literature as being associated with cancer and aging were analyzed using two complementary approaches. The first, ensemble attribute profile clustering, is a recently formulated, text-based, semi-automated data interpretation strategy that exploits ideas from statistical information retrieval to discover and characterize groups of genes with common structural and functional properties. Groups of genes with shared and unique Gene Ontology terms and protein domains were defined and examined. Human homologs of a group of known Drosphila aging-related genes are candidates for genes that may influence lifespan (hep/MAPK2K7, bsk/MAPK8, puc/LOC285193). These JNK pathway-associated proteins may specify a molecular hub that coordinates and integrates multiple intra- and extracellular processes via space- and time-dependent interactions with proteins in other pathways. The second approach, a qualitative examination of the chromosomal locations of 311 human cancer- and aging-related genes, provides anecdotal evidence for a "phenotype position effect": genes that are proximal in the linear genome often encode proteins involved in the same phenomenon. Comparative genomics was employed to enhance understanding of several genes, including open reading frames, identified as new candidates for genes with roles in aging or cancer. Overall, the results highlight fundamental molecular and mechanistic connections between progenitor/stem cell lineage determination, embryonic morphogenesis, cancer, and aging. Despite diversity in the nature of the molecular and cellular processes associated with these phenomena, they seem related to the architectural hub of tissue polarity and a need to generate and control this property in a timely manner.
Similar articles
- MILANO--custom annotation of microarray results using automatic literature searches.
Rubinstein R, Simon I. Rubinstein R, et al. BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12. BMC Bioinformatics. 2005. PMID: 15661078 Free PMC article. - How to decide which are the most pertinent overly-represented features during gene set enrichment analysis.
Barriot R, Sherman DJ, Dutour I. Barriot R, et al. BMC Bioinformatics. 2007 Sep 11;8:332. doi: 10.1186/1471-2105-8-332. BMC Bioinformatics. 2007. PMID: 17848190 Free PMC article. - Gene expression trends and protein features effectively complement each other in gene function prediction.
Wabnik K, Hvidsten TR, Kedzierska A, Van Leene J, De Jaeger G, Beemster GT, Komorowski J, Kuiper MT. Wabnik K, et al. Bioinformatics. 2009 Feb 1;25(3):322-30. doi: 10.1093/bioinformatics/btn625. Epub 2008 Dec 2. Bioinformatics. 2009. PMID: 19050035 - Computational prediction of cancer-gene function.
Hu P, Bader G, Wigle DA, Emili A. Hu P, et al. Nat Rev Cancer. 2007 Jan;7(1):23-34. doi: 10.1038/nrc2036. Epub 2006 Dec 14. Nat Rev Cancer. 2007. PMID: 17167517 Review. - How bioinformatics can help reverse engineer human aging.
de Magalhães JP, Toussaint O. de Magalhães JP, et al. Ageing Res Rev. 2004 Apr;3(2):125-41. doi: 10.1016/j.arr.2003.08.006. Ageing Res Rev. 2004. PMID: 15177050 Review.
Cited by
- Category fluency, latent semantic analysis and schizophrenia: a candidate gene approach.
Nicodemus KK, Elvevåg B, Foltz PW, Rosenstein M, Diaz-Asper C, Weinberger DR. Nicodemus KK, et al. Cortex. 2014 Jun;55:182-91. doi: 10.1016/j.cortex.2013.12.004. Epub 2013 Dec 20. Cortex. 2014. PMID: 24447899 Free PMC article. - Genome-wide patterns of gene expression during aging in the African malaria vector Anopheles gambiae.
Wang MH, Marinotti O, James AA, Walker E, Githure J, Yan G. Wang MH, et al. PLoS One. 2010 Oct 13;5(10):e13359. doi: 10.1371/journal.pone.0013359. PLoS One. 2010. PMID: 20967211 Free PMC article. - AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology.
Achcar F, Camadro JM, Mestivier D. Achcar F, et al. Nucleic Acids Res. 2009 Jul;37(Web Server issue):W63-7. doi: 10.1093/nar/gkp430. Epub 2009 May 27. Nucleic Acids Res. 2009. PMID: 19474346 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous