Protein flexibility and intrinsic disorder - PubMed (original) (raw)

Comparative Study

Protein flexibility and intrinsic disorder

Predrag Radivojac et al. Protein Sci. 2004 Jan.

Abstract

Comparisons were made among four categories of protein flexibility: (1) low-B-factor ordered regions, (2) high-B-factor ordered regions, (3) short disordered regions, and (4) long disordered regions. Amino acid compositions of the four categories were found to be significantly different from each other, with high-B-factor ordered and short disordered regions being the most similar pair. The high-B-factor (flexible) ordered regions are characterized by a higher average flexibility index, higher average hydrophilicity, higher average absolute net charge, and higher total charge than disordered regions. The low-B-factor regions are significantly enriched in hydrophobic residues and depleted in the total number of charged residues compared to the other three categories. We examined the predictability of the high-B-factor regions and developed a predictor that discriminates between regions of low and high B-factors. This predictor achieved an accuracy of 70% and a correlation of 0.43 with experimental data, outperforming the 64% accuracy and 0.32 correlation of predictors based solely on flexibility indices. To further clarify the differences between short disordered regions and ordered regions, a predictor of short disordered regions was developed. Its relatively high accuracy of 81% indicates considerable differences between ordered and disordered regions. The distinctive amino acid biases of high-B-factor ordered regions, short disordered regions, and long disordered regions indicate that the sequence determinants for these flexibility categories differ from one another, whereas the significantly-greater-than-chance predictability of these categories from sequence suggest that flexible ordered regions, short disorder, and long disorder are, to a significant degree, encoded at the primary structure level.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

Amino acid compositions of various data sets. The composition of each amino acid of a reference data set of ordered proteins, Globular-3D, is subtracted from the composition of the four sets described herein; thus, negative peaks indicate depletions compared to the ordered reference set, and positive peaks represent enrichments. The order of the amino acids along the x-axis is from the most buried (left) to the most exposed (right) in typical globular proteins. Error bars indicate one standard deviation. Methionine at the N terminus and His-tags were not included in calculations.

Similar articles

Cited by

References

    1. Altman, R.B., Hughes, C., and Jardetzky, O. 1994. Compositional characteristics of disordered regions in proteins. Prot. Pept. Lett. 2 120–127.
    1. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402. - PMC - PubMed
    1. Benner, S.A., Cohen, M.A., and Gerloff, D. 1992. Correct structure prediction? Nature 359 781. - PubMed
    1. Benson, D.A., Boguski, M.S., Lipman, D.J., Ostell, J., Ouellette, B.F., Rapp, B.A., and Wheeler, D.L. 1999. GenBank. Nucleic Acids Res. 27 12–17. - PMC - PubMed
    1. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The protein data bank. Nucleic Acids Res. 28 235–242. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources