Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information - PubMed (original) (raw)
Comparative Study
. 2004 Mar 1;20(4):477-86.
doi: 10.1093/bioinformatics/btg432. Epub 2004 Jan 22.
Affiliations
- PMID: 14990443
- DOI: 10.1093/bioinformatics/btg432
Comparative Study
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information
Shandar Ahmad et al. Bioinformatics. 2004.
Abstract
Motivation: Though vitally important to cell function, the mechanism of protein-DNA binding has not yet been completely understood. We therefore analysed the relationship between DNA binding and protein sequence composition, solvent accessibility and secondary structure. Using non-redundant databases of transcription factors and protein-DNA complexes, neural network models were developed to utilize the information present in this relationship to predict DNA-binding proteins and their binding residues.
Results: Sequence composition was found to provide sufficient information to predict the probability of its binding to DNA with nearly 69% sensitivity at 64% accuracy for the considered proteins; sequence neighbourhood and solvent accessibility information were sufficient to make binding site predictions with 40% sensitivity at 79% accuracy. Detailed analysis of binding residues shows that some three- and five-residue segments frequently bind to DNA and that solvent accessibility plays a major role in binding. Although, binding behaviour was not associated with any particular secondary structure, there were interesting exceptions at the residue level. Over-representation of some residues in the binding sites was largely lost at the total sequence level, but a different kind of compositional preference was observed in DNA-binding proteins.
Similar articles
- Prediction of DNA-binding residues from sequence.
Ofran Y, Mysore V, Rost B. Ofran Y, et al. Bioinformatics. 2007 Jul 1;23(13):i347-53. doi: 10.1093/bioinformatics/btm174. Bioinformatics. 2007. PMID: 17646316 - PSSM-based prediction of DNA binding sites in proteins.
Ahmad S, Sarai A. Ahmad S, et al. BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33. BMC Bioinformatics. 2005. PMID: 15720719 Free PMC article. - A neural network method for prediction of beta-turn types in proteins using evolutionary information.
Kaur H, Raghava GP. Kaur H, et al. Bioinformatics. 2004 Nov 1;20(16):2751-8. doi: 10.1093/bioinformatics/bth322. Epub 2004 May 14. Bioinformatics. 2004. PMID: 15145798 - Correlated substitution analysis and the prediction of amino acid structural contacts.
Horner DS, Pirovano W, Pesole G. Horner DS, et al. Brief Bioinform. 2008 Jan;9(1):46-56. doi: 10.1093/bib/bbm052. Epub 2007 Nov 13. Brief Bioinform. 2008. PMID: 18000015 Review. - Interaction-site prediction for protein complexes: a critical assessment.
Zhou HX, Qin S. Zhou HX, et al. Bioinformatics. 2007 Sep 1;23(17):2203-9. doi: 10.1093/bioinformatics/btm323. Epub 2007 Jun 22. Bioinformatics. 2007. PMID: 17586545 Review.
Cited by
- A comprehensive review of protein-centric predictors for biomolecular interactions: from proteins to nucleic acids and beyond.
Jia P, Zhang F, Wu C, Li M. Jia P, et al. Brief Bioinform. 2024 Mar 27;25(3):bbae162. doi: 10.1093/bib/bbae162. Brief Bioinform. 2024. PMID: 38739759 Free PMC article. Review. - EPDRNA: A Model for Identifying DNA-RNA Binding Sites in Disease-Related Proteins.
Sun C, Feng Y. Sun C, et al. Protein J. 2024 Jun;43(3):513-521. doi: 10.1007/s10930-024-10183-3. Epub 2024 Mar 16. Protein J. 2024. PMID: 38491248 - ULDNA: integrating unsupervised multi-source language models with LSTM-attention network for high-accuracy protein-DNA binding site prediction.
Zhu YH, Liu Z, Liu Y, Ji Z, Yu DJ. Zhu YH, et al. Brief Bioinform. 2024 Jan 22;25(2):bbae040. doi: 10.1093/bib/bbae040. Brief Bioinform. 2024. PMID: 38349057 Free PMC article. - Deep-WET: a deep learning-based approach for predicting DNA-binding proteins using word embedding techniques with weighted features.
Mahmud SMH, Goh KOM, Hosen MF, Nandi D, Shoombuatong W. Mahmud SMH, et al. Sci Rep. 2024 Feb 5;14(1):2961. doi: 10.1038/s41598-024-52653-9. Sci Rep. 2024. PMID: 38316843 Free PMC article. - HybridDBRpred: improved sequence-based prediction of DNA-binding amino acids using annotations from structured complexes and disordered proteins.
Zhang J, Basu S, Kurgan L. Zhang J, et al. Nucleic Acids Res. 2024 Jan 25;52(2):e10. doi: 10.1093/nar/gkad1131. Nucleic Acids Res. 2024. PMID: 38048333 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources