Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases - PubMed (original) (raw)

Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases

Marylyn D Ritchie et al. BMC Bioinformatics. 2003.

Abstract

Background: Appropriate definition of neural network architecture prior to data analysis is crucial for successful data mining. This can be challenging when the underlying model of the data is unknown. The goal of this study was to determine whether optimizing neural network architecture using genetic programming as a machine learning strategy would improve the ability of neural networks to model and detect nonlinear interactions among genes in studies of common human diseases.

Results: Using simulated data, we show that a genetic programming optimized neural network approach is able to model gene-gene interactions as well as a traditional back propagation neural network. Furthermore, the genetic programming optimized neural network is better than the traditional back propagation neural network approach in terms of predictive ability and power to detect gene-gene interactions when non-functional polymorphisms are present.

Conclusion: This study suggests that a machine learning strategy for optimizing neural network architecture may be preferable to traditional trial-and-error approaches for the identification and characterization of gene-gene interactions in common, complex human diseases.

PubMed Disclaimer

Figures

Figure 1

Binary expression tree example of a GP solution. This figure is an example of a possible computer program generated by GP. While the program can take virtually any form, we are using a binary expression tree representation, thus we have shown this type as an example.

Figure 2

GP crossover. This figure shows a crossover event in GP between two binary expression trees. Here, the left sub-tree of parent 1 is swapped with the left sub-tree of parent 2 to create 2 new trees.

Figure 3

GPNN representation of a NN. This figure is an example of one NN optimized by GPNN. The O is the output node, S indicates the activation function, W indicates a weight, and X1-X4 are the NN inputs.

Figure 4

Feed-forward BPNN representation of the GPNN in Figure 3. To generate this NN, each weight in Figure 3 was computed to produce a single value.

Figure 5

Optimal architecture from BPNN trial and error optimization. This figure shows the result of the BPNN trial and error procedure on one data set from each epistasis model. This shows the NN architecture for the best classification error selected from Table 1.

Cited by

Interaction models matter: an efficient, flexible computational framework for model-specific investigation of epistasis.
Batista S, Madar VS, Freda PJ, Bhandary P, Ghosh A, Matsumoto N, Chitre AS, Palmer AA, Moore JH. Batista S, et al. BioData Min. 2024 Feb 28;17(1):7. doi: 10.1186/s13040-024-00358-0. BioData Min. 2024. PMID: 38419006 Free PMC article.
Gamma-Aminobutyric Acid Type A Receptor Variants are Associated with Autism Spectrum Disorders.
Adak P, Banerjee N, Sinha S, Bandyopadhyay AK. Adak P, et al. J Mol Neurosci. 2023 May;73(4-5):237-249. doi: 10.1007/s12031-023-02113-2. Epub 2023 Mar 21. J Mol Neurosci. 2023. PMID: 36943547
Identification of Clinically Relevant HIV Vif Protein Motif Mutations through Machine Learning and Undersampling.
Altamirano-Flores JS, Alvarado-Hernández LÁ, Cuevas-Tello JC, Tino P, Guerra-Palomares SE, Garcia-Sepulveda CA. Altamirano-Flores JS, et al. Cells. 2023 Feb 28;12(5):772. doi: 10.3390/cells12050772. Cells. 2023. PMID: 36899908 Free PMC article.
Multifactor dimensionality reduction reveals the effect of interaction between ERAP1 and IFIH1 polymorphisms in psoriasis susceptibility genes.
Zhang C, Qin Q, Li Y, Zheng X, Chen W, Zhen Q, Li B, Wang W, Sun L. Zhang C, et al. Front Genet. 2022 Nov 8;13:1009589. doi: 10.3389/fgene.2022.1009589. eCollection 2022. Front Genet. 2022. PMID: 36425068 Free PMC article.
learnMET: an R package to apply machine learning methods for genomic prediction using multi-environment trial data.
Westhues CC, Simianer H, Beissinger TM. Westhues CC, et al. G3 (Bethesda). 2022 Nov 4;12(11):jkac226. doi: 10.1093/g3journal/jkac226. G3 (Bethesda). 2022. PMID: 36124944 Free PMC article.

References

1. Templeton AR. Epistasis and complex traits. In: Wade M, Brodie III B, Wolf J, editor. Epistasis and Evolutionary Process. Oxford, Oxford University Press; 2000.
1. Moore JH, Williams SM. New strategies for identifying gene-gene interactions in hypertension. Ann Med. 2002;34:88–95. - PubMed
1. Bellman R. Adaptive Control Processes. Princeton, Princeton University Press. 1961.
1. Bhat A, Lucek PR, Ott J. Analysis of complex traits using neural networks. Genet Epidemiol. 1999;17:S503–S507. - PubMed
1. Curtis D, North BV, Sham PC. Use of an artificial neural network to detect association between a disease and multiple marker genotypes. Ann Hum Genet. 2001;65:95–107. - PubMed

Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases - PubMed (original) (raw)