I-TASSER server for protein 3D structure prediction - PubMed (original) (raw)
I-TASSER server for protein 3D structure prediction
Yang Zhang. BMC Bioinformatics. 2008.
Abstract
Background: Prediction of 3-dimensional protein structures from amino acid sequences represents one of the most important problems in computational structural biology. The community-wide Critical Assessment of Structure Prediction (CASP) experiments have been designed to obtain an objective assessment of the state-of-the-art of the field, where I-TASSER was ranked as the best method in the server section of the recent 7th CASP experiment. Our laboratory has since then received numerous requests about the public availability of the I-TASSER algorithm and the usage of the I-TASSER predictions.
Results: An on-line version of I-TASSER is developed at the KU Center for Bioinformatics which has generated protein structure predictions for thousands of modeling requests from more than 35 countries. A scoring function (C-score) based on the relative clustering structural density and the consensus significance score of multiple threading templates is introduced to estimate the accuracy of the I-TASSER predictions. A large-scale benchmark test demonstrates a strong correlation between the C-score and the TM-score (a structural similarity measurement with values in [0, 1]) of the first models with a correlation coefficient of 0.91. Using a C-score cutoff > -1.5 for the models of correct topology, both false positive and false negative rates are below 0.1. Combining C-score and protein length, the accuracy of the I-TASSER models can be predicted with an average error of 0.08 for TM-score and 2 A for RMSD.
Conclusion: The I-TASSER server has been developed to generate automated full-length 3D protein structural predictions where the benchmarked scoring system helps users to obtain quantitative assessments of the I-TASSER models. The output of the I-TASSER server for each query includes up to five full-length models, the confidence score, the estimated TM-score and RMSD, and the standard deviation of the estimations. The I-TASSER server is freely available to the academic community at http://zhang.bioinformatics.ku.edu/I-TASSER.
Figures
Figure 1
TM-score (a) and RMSD (b) versus C-score of the I-TASSER models for 500 testing proteins. The dashed curve in (a) is from Equation 3 which is fit from the 300 training proteins and used for estimating the TM-score of the I-TASSER models. The solid circles are the root mean squared deviation from the estimated TM-score values (RMSTD). The solid curve is from Equation 4 which is fit from the 300 training proteins. The dotted lines are the TM-score and C-score cutoffs for correct folds.
Figure 2
Two examples of the I-TASSER models from 1ca4A and 1cmaA. Both models have similar RMSD values but indicate significantly different modeling qualities. In the superposition, the thin backbones are the native structure and thick backbones the I-TASSER models. Blue to red runs from N- to C-terminal.
Figure 3
TM-score (a) and RMSD (b) of the I-TASSER models versus the length of target proteins. The numbers indicate the Pearson correlation coefficients.
Figure 4
RMSD versus C-score-ln(L) of the I-TASSER models for 500 test proteins (open circles). The dashed curve is from Equation 5 which is fit from the 300 training proteins and used for estimating RMSD of the I-TASSER models. The solid circles are the root mean squared RMSD deviation (RMSRD) from the estimated RMSD values. The solid curve is from Equation 6 which is fit from the 300 training proteins.
Similar articles
- Ab initio modeling of small proteins by iterative TASSER simulations.
Wu S, Skolnick J, Zhang Y. Wu S, et al. BMC Biol. 2007 May 8;5:17. doi: 10.1186/1741-7007-5-17. BMC Biol. 2007. PMID: 17488521 Free PMC article. - LOMETS: a local meta-threading-server for protein structure prediction.
Wu S, Zhang Y. Wu S, et al. Nucleic Acids Res. 2007;35(10):3375-82. doi: 10.1093/nar/gkm251. Epub 2007 May 3. Nucleic Acids Res. 2007. PMID: 17478507 Free PMC article. - Template-based protein structure prediction in CASP11 and retrospect of I-TASSER in the last decade.
Yang J, Zhang W, He B, Walker SE, Zhang H, Govindarajoo B, Virtanen J, Xue Z, Shen HB, Zhang Y. Yang J, et al. Proteins. 2016 Sep;84 Suppl 1(Suppl 1):233-46. doi: 10.1002/prot.24918. Epub 2015 Sep 18. Proteins. 2016. PMID: 26343917 Free PMC article. - AI-Driven Deep Learning Techniques in Protein Structure Prediction.
Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, Pouriyeh S, Dai Z, Chen J, Xie CY. Chen L, et al. Int J Mol Sci. 2024 Aug 1;25(15):8426. doi: 10.3390/ijms25158426. Int J Mol Sci. 2024. PMID: 39125995 Free PMC article. Review. - Critical evaluation of in silico methods for prediction of coiled-coil domains in proteins.
Li C, Ching Han Chang C, Nagel J, Porebski BT, Hayashida M, Akutsu T, Song J, Buckle AM. Li C, et al. Brief Bioinform. 2016 Mar;17(2):270-82. doi: 10.1093/bib/bbv047. Epub 2015 Jul 15. Brief Bioinform. 2016. PMID: 26177815 Free PMC article. Review.
Cited by
- PLD3 in Alzheimer's Disease: a Modest Effect as Revealed by Updated Association and Expression Analyses.
Zhang DF, Fan Y, Wang D, Bi R, Zhang C, Fang Y, Yao YG. Zhang DF, et al. Mol Neurobiol. 2016 Aug;53(6):4034-4045. doi: 10.1007/s12035-015-9353-5. Epub 2015 Jul 21. Mol Neurobiol. 2016. PMID: 26189833 - Unusual shift in the visible absorption spectrum of an active ctenophore photoprotein elucidated by time-dependent density functional theory.
Tomilin FN, Rogova AV, Burakova LP, Tchaikovskaya ON, Avramov PV, Fedorov DG, Vysotski ES. Tomilin FN, et al. Photochem Photobiol Sci. 2021 Apr 8. doi: 10.1007/s43630-021-00039-5. Online ahead of print. Photochem Photobiol Sci. 2021. PMID: 33834429 - An immunogen containing four tandem 10E8 epitope repeats with exposed key residues induces antibodies that neutralize HIV-1 and activates an ADCC reporter gene.
Sun Z, Zhu Y, Wang Q, Ye L, Dai Y, Su S, Yu F, Ying T, Yang C, Jiang S, Lu L. Sun Z, et al. Emerg Microbes Infect. 2016 Jun 22;5(6):e65. doi: 10.1038/emi.2016.86. Emerg Microbes Infect. 2016. PMID: 27329850 Free PMC article. - Antagonizing canonical Wnt signaling pathway by recombinant human sFRP4 purified from E. coli and its implications in cancer therapy.
Ghoshal A, Ghosh SS. Ghoshal A, et al. Mol Cell Biochem. 2016 Jul;418(1-2):119-35. doi: 10.1007/s11010-016-2738-6. Epub 2016 Jun 23. Mol Cell Biochem. 2016. PMID: 27334754 - PBOV1 is a human de novo gene with tumor-specific expression that is associated with a positive clinical outcome of cancer.
Samusik N, Krukovskaya L, Meln I, Shilov E, Kozlov AP. Samusik N, et al. PLoS One. 2013;8(2):e56162. doi: 10.1371/journal.pone.0056162. Epub 2013 Feb 13. PLoS One. 2013. PMID: 23418531 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources