Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts - PubMed (original) (raw)
Biobibliometrics: information retrieval and visualization from co-occurrences of gene names in Medline abstracts
B J Stapley et al. Pac Symp Biocomput. 2000.
Free article
Abstract
Successful information retrieval from biomedical literature databases is becoming increasingly difficult. We have developed a prototype system for retrieving and visualizing information from literature and genomic databases using gene names. The premise of our work is that, if two genes have a related biological function, the co-occurrence of two gene names (or aliases of those genes) within the biomedical literature is more likely. From a collection of Medline documents, we have extracted the number of co-occurrences of every pair of Saccharomyces cerevisiae genes. The query is automatically conflated to include gene aliases as well. In addition, the retrieved document set can be filtered by the user with a MeSH term. From this co-occurrence data we construct a matrix that contains dissimilarity measurements of every pair of genes, based on their joint and individual occurrence statistics. A graph is generated from this matrix, with node and edge inclusion being determined by a user-defined threshold. Nodes of the graph represent genes, while edge lengths are a function of the occurrence of the two genes within the literature. Nodes can be hypertext-linked to sequence databases, while edges are linked to those Medline documents that generated them. The system is a tool for efficiently exploring the biomedical information landscape and may act as a inference network.
Similar articles
- G-Bean: an ontology-graph based web tool for biomedical literature retrieval.
Wang JZ, Zhang Y, Dong L, Li L, Srimani PK, Yu PS. Wang JZ, et al. BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-15-S12-S1. Epub 2014 Nov 6. BMC Bioinformatics. 2014. PMID: 25474588 Free PMC article. - MILANO--custom annotation of microarray results using automatic literature searches.
Rubinstein R, Simon I. Rubinstein R, et al. BMC Bioinformatics. 2005 Jan 20;6:12. doi: 10.1186/1471-2105-6-12. BMC Bioinformatics. 2005. PMID: 15661078 Free PMC article. - PSE: a tool for browsing a large amount of MEDLINE/PubMed abstracts with gene names and common words as the keywords.
Yoneya T. Yoneya T. BMC Bioinformatics. 2005 Dec 10;6:295. doi: 10.1186/1471-2105-6-295. BMC Bioinformatics. 2005. PMID: 16336692 Free PMC article. - Concept-based query expansion for retrieving gene related publications from MEDLINE.
Matos S, Arrais JP, Maia-Rodrigues J, Oliveira JL. Matos S, et al. BMC Bioinformatics. 2010 Apr 28;11:212. doi: 10.1186/1471-2105-11-212. BMC Bioinformatics. 2010. PMID: 20426836 Free PMC article. - The strength of co-authorship in gene name disambiguation.
Farkas R. Farkas R. BMC Bioinformatics. 2008 Jan 29;9:69. doi: 10.1186/1471-2105-9-69. BMC Bioinformatics. 2008. PMID: 18230174 Free PMC article.
Cited by
- BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets.
Lai PT, Wei CH, Luo L, Chen Q, Lu Z. Lai PT, et al. ArXiv [Preprint]. 2023 Jun 19:arXiv:2306.11189v1. ArXiv. 2023. PMID: 37502629 Free PMC article. Updated. Preprint. - A consideration of publication-derived immune-related associations in Coronavirus and related lung damaging diseases.
Geifman N, Whetton AD. Geifman N, et al. J Transl Med. 2020 Aug 3;18(1):297. doi: 10.1186/s12967-020-02472-z. J Transl Med. 2020. PMID: 32746922 Free PMC article. - A cross-platform approach to characterize and screen potential neurovascular unit toxicants.
Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. Zurlinden TJ, et al. Reprod Toxicol. 2020 Sep;96:300-315. doi: 10.1016/j.reprotox.2020.06.010. Epub 2020 Jun 24. Reprod Toxicol. 2020. PMID: 32590145 Free PMC article. - Network Integrative Genomic and Transcriptomic Analysis of Carbapenem-Resistant Klebsiella pneumoniae Strains Identifies Genes for Antibiotic Resistance and Virulence.
Lee M, Pinto NA, Kim CY, Yang S, D'Souza R, Yong D, Lee I. Lee M, et al. mSystems. 2019 May 21;4(4):e00202-19. doi: 10.1128/mSystems.00202-19. mSystems. 2019. PMID: 31117026 Free PMC article. - Automatic extraction of protein-protein interactions using grammatical relationship graph.
Yu K, Lung PY, Zhao T, Zhao P, Tseng YY, Zhang J. Yu K, et al. BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):42. doi: 10.1186/s12911-018-0628-4. BMC Med Inform Decis Mak. 2018. PMID: 30066644 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Molecular Biology Databases