A computational system to select candidate genes for complex human traits - PubMed (original) (raw)
A computational system to select candidate genes for complex human traits
Kyle J Gaulton et al. Bioinformatics. 2007.
Abstract
Motivation: Identification of the genetic variation underlying complex traits is challenging. The wealth of information publicly available about the biology of complex traits and the function of individual genes permits the development of informatics-assisted methods for the selection of candidate genes for these traits.
Results: We have developed a computational system named CAESAR that ranks all annotated human genes as candidates for a complex trait by using ontologies to semantically map natural language descriptions of the trait with a variety of gene-centric information sources. In a test of its effectiveness, CAESAR successfully selected 7 out of 18 (39%) complex human trait susceptibility genes within the top 2% of ranked candidates genome-wide, a subset that represents roughly 1% of genes in the human genome and provides sufficient enrichment for an association study of several hundred human genes. This approach can be applied to any well-documented mono- or multi-factorial trait in any organism for which an annotated gene set exists.
Availability: CAESAR scripts and test data can be downloaded from http://visionlab.bio.unc.edu/caesar/
Similar articles
- TraitMap: an XML-based genetic-map database combining multigenic loci and biomolecular networks.
Heida N, Hasegawa Y, Mochizuki Y, Hirosawa K, Konagaya A, Toyoda T. Heida N, et al. Bioinformatics. 2004 Aug 4;20 Suppl 1:i152-60. doi: 10.1093/bioinformatics/bth940. Bioinformatics. 2004. PMID: 15262794 - Inter-species normalization of gene mentions with GNAT.
Hakenberg J, Plake C, Leaman R, Schroeder M, Gonzalez G. Hakenberg J, et al. Bioinformatics. 2008 Aug 15;24(16):i126-132. doi: 10.1093/bioinformatics/btn299. Bioinformatics. 2008. PMID: 18689813 - Gene symbol disambiguation using knowledge-based profiles.
Xu H, Fan JW, Hripcsak G, Mendonça EA, Markatou M, Friedman C. Xu H, et al. Bioinformatics. 2007 Apr 15;23(8):1015-22. doi: 10.1093/bioinformatics/btm056. Epub 2007 Feb 21. Bioinformatics. 2007. PMID: 17314123 - An informatics approach to systems neurogenetics.
Rosen GD, Chesler EJ, Manly KF, Williams RW. Rosen GD, et al. Methods Mol Biol. 2007;401:287-303. doi: 10.1007/978-1-59745-520-6_16. Methods Mol Biol. 2007. PMID: 18368372 Review. - Expression quantitative trait loci and the PhenoGen database.
Saba L, Hoffman PL, Hornbaker C, Bhave SV, Tabakoff B. Saba L, et al. Alcohol Res Health. 2008;31(3):272-4. Alcohol Res Health. 2008. PMID: 23584876 Free PMC article. Review.
Cited by
- Text mining in cancer gene and pathway prioritization.
Luo Y, Riedlinger G, Szolovits P. Luo Y, et al. Cancer Inform. 2014 Oct 13;13(Suppl 1):69-79. doi: 10.4137/CIN.S13874. eCollection 2014. Cancer Inform. 2014. PMID: 25392685 Free PMC article. Review. - Approaches for recognizing disease genes based on network.
Zou Q, Li J, Wang C, Zeng X. Zou Q, et al. Biomed Res Int. 2014;2014:416323. doi: 10.1155/2014/416323. Epub 2014 Feb 24. Biomed Res Int. 2014. PMID: 24707485 Free PMC article. - ProphNet: a generic prioritization method through propagation of information.
Martínez V, Cano C, Blanco A. Martínez V, et al. BMC Bioinformatics. 2014;15 Suppl 1(Suppl 1):S5. doi: 10.1186/1471-2105-15-S1-S5. Epub 2014 Jan 10. BMC Bioinformatics. 2014. PMID: 24564336 Free PMC article. - Integrating human omics data to prioritize candidate genes.
Chen Y, Wu X, Jiang R. Chen Y, et al. BMC Med Genomics. 2013 Dec 18;6:57. doi: 10.1186/1755-8794-6-57. BMC Med Genomics. 2013. PMID: 24344781 Free PMC article. - Chapter 15: disease gene prioritization.
Bromberg Y. Bromberg Y. PLoS Comput Biol. 2013 Apr;9(4):e1002902. doi: 10.1371/journal.pcbi.1002902. Epub 2013 Apr 25. PLoS Comput Biol. 2013. PMID: 23633938 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical