On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report - PubMed (original) (raw)
On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report
Paul D Thomas et al. PLoS Comput Biol. 2012.
Abstract
A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the "functional similarity" between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the "ortholog conjecture" (or, more properly, the "ortholog functional conservation hypothesis"). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an "open world assumption" (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis.
Conflict of interest statement
PDT, VW and JAB are funded in part by grants to maintain model organism databases.
Similar articles
- The ortholog conjecture is untestable by the current gene ontology but is supported by RNA sequencing data.
Chen X, Zhang J. Chen X, et al. PLoS Comput Biol. 2012;8(11):e1002784. doi: 10.1371/journal.pcbi.1002784. Epub 2012 Nov 29. PLoS Comput Biol. 2012. PMID: 23209392 Free PMC article. - GO functional similarity clustering depends on similarity measure, clustering method, and annotation completeness.
Liu M, Thomas PD. Liu M, et al. BMC Bioinformatics. 2019 Mar 27;20(1):155. doi: 10.1186/s12859-019-2752-2. BMC Bioinformatics. 2019. PMID: 30917779 Free PMC article. - Interspecies gene function prediction using semantic similarity.
Yu G, Luo W, Fu G, Wang J. Yu G, et al. BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5. BMC Syst Biol. 2016. PMID: 28155711 Free PMC article. - An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.
[No authors listed] [No authors listed] Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review. - In Silico Functional Annotation of Genomic Variation.
Butkiewicz M, Bush WS. Butkiewicz M, et al. Curr Protoc Hum Genet. 2016 Jan 1;88:6.15.1-6.15.17. doi: 10.1002/0471142905.hg0615s88. Curr Protoc Hum Genet. 2016. PMID: 26724722 Free PMC article. Review.
Cited by
- FAS: assessing the similarity between proteins using multi-layered feature architectures.
Dosch J, Bergmann H, Tran V, Ebersberger I. Dosch J, et al. Bioinformatics. 2023 May 4;39(5):btad226. doi: 10.1093/bioinformatics/btad226. Bioinformatics. 2023. PMID: 37084276 Free PMC article. - Defining the extent of gene function using ROC curvature.
Fischer S, Gillis J. Fischer S, et al. Bioinformatics. 2022 Dec 13;38(24):5390-5397. doi: 10.1093/bioinformatics/btac692. Bioinformatics. 2022. PMID: 36271855 Free PMC article. - Non-synonymous to synonymous substitutions suggest that orthologs tend to keep their functions, while paralogs are a source of functional novelty.
Escorcia-Rodríguez JM, Esposito M, Freyre-González JA, Moreno-Hagelsieb G. Escorcia-Rodríguez JM, et al. PeerJ. 2022 Aug 31;10:e13843. doi: 10.7717/peerj.13843. eCollection 2022. PeerJ. 2022. PMID: 36065404 Free PMC article. - Predicting functions of maize proteins using graph convolutional network.
Zhou G, Wang J, Zhang X, Guo M, Yu G. Zhou G, et al. BMC Bioinformatics. 2020 Dec 16;21(Suppl 16):420. doi: 10.1186/s12859-020-03745-6. BMC Bioinformatics. 2020. PMID: 33323113 Free PMC article. - Recurrent sequence evolution after independent gene duplication.
A von der Dunk SH, Snel B. A von der Dunk SH, et al. BMC Evol Biol. 2020 Aug 8;20(1):98. doi: 10.1186/s12862-020-01660-1. BMC Evol Biol. 2020. PMID: 32770961 Free PMC article.
References
- Ohno S. Evolution by Gene Duplication. Berlin: Springer-Verlag; 1970.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources