THINK Back: KNowledge-based Interpretation of High Throughput data - PubMed (original) (raw)
THINK Back: KNowledge-based Interpretation of High Throughput data
Fernando Farfán et al. BMC Bioinformatics. 2012.
Abstract
Results of high throughput experiments can be challenging to interpret. Current approaches have relied on bulk processing the set of expression levels, in conjunction with easily obtained external evidence, such as co-occurrence. While such techniques can be used to reason probabilistically, they are not designed to shed light on what any individual gene, or a network of genes acting together, may be doing. Our belief is that today we have the information extraction ability and the computational power to perform more sophisticated analyses that consider the individual situation of each gene. The use of such techniques should lead to qualitatively superior results. The specific aim of this project is to develop computational techniques to generate a small number of biologically meaningful hypotheses based on observed results from high throughput microarray experiments, gene sequences, and next-generation sequences. Through the use of relevant known biomedical knowledge, as represented in published literature and public databases, we can generate meaningful hypotheses that will aide biologists to interpret their experimental data. We are currently developing novel approaches that exploit the rich information encapsulated in biological pathway graphs. Our methods perform a thorough and rigorous analysis of biological pathways, using complex factors such as the topology of the pathway graph and the frequency in which genes appear on different pathways, to provide more meaningful hypotheses to describe the biological phenomena captured by high throughput experiments, when compared to other existing methods that only consider partial information captured by biological pathways.
Figures
Figure 1
Example of density analysis on biological pathways. Two example pathways with differentially expressed genes appearing in different configurations. A pathway with differentially expressed genes appearing tightly-clustered in one portion of the graph is more significant than a pathway in which the differentially expressed genes appear spread out.
Figure 2
Distribution of appearance frequency of genes in KEGG Pathway database. The x axis indicates the appearance frequency of each gene in KEGG Pathways. The y axis shows the proportion of genes with indicated appearance frequency out of all KEGG Pathway genes. The total number of genes at each appearance count is indicated.
Figure 3
THINK-Back Software Architecture. The THINK-Back suite of tools has been developed to allow scientists worldwide to use our gene set enrichment testing methods and to combine them with previously-developed methods as adjustment tools. These tools can be accessed via web services, an application programmer's interface (API), and soon we will be providing a web-based user interface as well.
Figure 4
THINK-Back Web Service Call Diagram. The sequence diagram shows the steps to call the THINK-Back Web service, request the job status, and finally obtain the results. After submitting the job for execution, the web service returns a unique identifier that should be used to check the job status. Once the status returned is DONE, the user can check for the results of the execution.
Similar articles
- Graph mining for next generation sequencing: leveraging the assembly graph for biological insights.
Warnke-Sommer J, Ali H. Warnke-Sommer J, et al. BMC Genomics. 2016 May 6;17:340. doi: 10.1186/s12864-016-2678-2. BMC Genomics. 2016. PMID: 27154001 Free PMC article. - Interpreting experimental results using gene ontologies.
Beissbarth T. Beissbarth T. Methods Enzymol. 2006;411:340-52. doi: 10.1016/S0076-6879(06)11018-6. Methods Enzymol. 2006. PMID: 16939799 Review. - The future of Cochrane Neonatal.
Soll RF, Ovelman C, McGuire W. Soll RF, et al. Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12. Early Hum Dev. 2020. PMID: 33036834 - Exploring matrix factorization techniques for significant genes identification of Alzheimer's disease microarray gene expression data.
Kong W, Mou X, Hu X. Kong W, et al. BMC Bioinformatics. 2011;12 Suppl 5(Suppl 5):S7. doi: 10.1186/1471-2105-12-S5-S7. Epub 2011 Jul 27. BMC Bioinformatics. 2011. PMID: 21989140 Free PMC article. - Systems Biology Applications to Decipher Mechanisms and Novel Biomarkers in CNS Trauma.
Yu C, Kobeissy F. Yu C, et al. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 30. In: Kobeissy FH, editor. Brain Neurotrauma: Molecular, Neuropsychological, and Rehabilitation Aspects. Boca Raton (FL): CRC Press/Taylor & Francis; 2015. Chapter 30. PMID: 26269919 Free Books & Documents. Review.
Cited by
- Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap.
Reimand J, Isserlin R, Voisin V, Kucera M, Tannus-Lopes C, Rostamianfar A, Wadi L, Meyer M, Wong J, Xu C, Merico D, Bader GD. Reimand J, et al. Nat Protoc. 2019 Feb;14(2):482-517. doi: 10.1038/s41596-018-0103-9. Nat Protoc. 2019. PMID: 30664679 Free PMC article. - MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways.
Koumakis L, Kanterakis A, Kartsaki E, Chatzimina M, Zervakis M, Tsiknakis M, Vassou D, Kafetzopoulos D, Marias K, Moustakis V, Potamias G. Koumakis L, et al. PLoS Comput Biol. 2016 Nov 10;12(11):e1005187. doi: 10.1371/journal.pcbi.1005187. eCollection 2016 Nov. PLoS Comput Biol. 2016. PMID: 27832067 Free PMC article. - NET-GE: a novel NETwork-based Gene Enrichment for detecting biological processes associated to Mendelian diseases.
Di Lena P, Martelli PL, Fariselli P, Casadio R. Di Lena P, et al. BMC Genomics. 2015;16 Suppl 8(Suppl 8):S6. doi: 10.1186/1471-2164-16-S8-S6. Epub 2015 Jun 18. BMC Genomics. 2015. PMID: 26110971 Free PMC article. - Functional and genomic context in pathway analysis of GWAS data.
Mooney MA, Nigg JT, McWeeney SK, Wilmot B. Mooney MA, et al. Trends Genet. 2014 Sep;30(9):390-400. doi: 10.1016/j.tig.2014.07.004. Epub 2014 Aug 22. Trends Genet. 2014. PMID: 25154796 Free PMC article. Review. - Methods and approaches in the topology-based analysis of biological pathways.
Mitrea C, Taghavi Z, Bokanizad B, Hanoudi S, Tagett R, Donato M, Voichiţa C, Drăghici S. Mitrea C, et al. Front Physiol. 2013 Oct 10;4:278. doi: 10.3389/fphys.2013.00278. Front Physiol. 2013. PMID: 24133454 Free PMC article. Review.
References
- Brown P, Botstein D. Exploring the new world of the genome with DNA microarrays. Nature Genetics. 1999;21(1 Suppl):33–37. - PubMed
- Smyth GK. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article 3. - PubMed
Publication types
MeSH terms
Grants and funding
- P30 ES017885/ES/NIEHS NIH HHS/United States
- R01 LM010138/LM/NLM NIH HHS/United States
- 5-R01-LM-010138-02/LM/NLM NIH HHS/United States
- 1U54DA021519/DA/NIDA NIH HHS/United States
LinkOut - more resources
Full Text Sources