Stability of Designed Proteins against Mutations (original) (raw)

Neutrality and evolvability of designed protein sequences

Physical Review E, 2010

The effect of foldability on protein's evolvability is analyzed by a two-prong approach consisting of a self-consistent mean-field theory and Monte Carlo simulations. Theory and simulation models representing protein sequences with binary patterning of amino acid residues compatible with a particular foldability criteria are used. This generalized foldability criterion is derived using the high temperature cumulant expansion approximating the free energy of folding. The effect of cumulative point mutations on these designed proteins is studied under neutral condition. The robustness, protein's ability to tolerate random point mutations is determined with a selective pressure of stability ͑⌬⌬G͒ for the theory designed sequences, which are found to be more robust than that of Monte Carlo and mean-field-biased Monte Carlo generated sequences. The results show that this foldability criterion selects viable protein sequences more effectively compared to the Monte Carlo method, which has a marked effect on how the selective pressure shapes the evolutionary sequence space. These observations may impact de novo sequence design and its applications in protein engineering.

Designability of protein structures

Journal of Molecular Graphics and Modelling, 2000

It has been noted that natural proteins adapt only a limited number of folds. Several researchers have investigated why and how nature has selected this small number of folds. Using simple models of protein folding, we demonstrate systematically that there is a "designability principle" behind nature's selection of protein folds. The designability of a structure (fold) is measured by the number of sequences that can design the structure-that is, sequences that possess the structure as their unique ground state. Structures differ drastically in terms of their designability. A small number of highly designable structures emerge with a number of associated sequences much larger than the average. These highly designable structures possess proteinlike secondary structures, motifs, and even tertiary symmetries. In addition, they are thermodynamically more stable and fold faster than other structures. These results suggest that protein structures are selected in nature because they are readily designed and stable against mutations, and that such a selection simultaneously leads to thermodynamic stability.

The Stability Effects of Protein Mutations Appear to be Universally Distributed

Journal of Molecular Biology, 2007

How the thermodynamic stability effects of protein mutations (ΔΔG) are distributed is a fundamental property related to the architecture, tolerance to mutations (mutational robustness), and evolutionary history of proteins. The stability effects of mutations also dictate the rate and dynamics of protein evolution, with deleterious mutations being the main inhibitory factor. Using the FoldX algorithm that attempts to computationally predict ΔΔG effects of mutations, we deduced the overall distributions of stability effects for all possible mutations in 21 different globular, single domain proteins. We found that these distributions are strikingly similar despite a range of sizes and folds, and largely follow a bi-Gaussian function: The surface residues exhibit a narrow distribution with a mildly destabilizing mean ΔΔG (∼ 0.6 kcal/mol), whereas the core residues exhibit a wider distribution with a stronger destabilizing mean (∼1.4 kcal/mol). Since smaller proteins have a higher fraction of surface residues, the relative weight of these single distributions correlates with size. We also found that proteins evolved in the laboratory follow an essentially identical distribution, whereas de novo designed folds show markedly less destabilizing distributions (i.e. they seem more robust to the effects of mutations). This bi-Gaussian model provides an analytical description of the predicted distributions of mutational stability effects. It comprises a novel tool for analyzing proteins and protein models, for simulating the effect of mutations under evolutionary processes, and a quantitative description of mutational robustness.

Theory for Protein Mutability and Biogenesis

Proceedings of The National Academy of Sciences, 1990

Using an elementary physical model for protein folding, of self-avoiding short copolymer chains on twodimensional square lattices, we address two questions regarding the evolution and origins of globular proteins. (i) How will protein native structures and stabilities be affected by singleand double-site mutations? (ii) What is the probability that a randomly chosen sequence of amino acids will be compact and globular under folding conditions? For a large number of different sequences, we search the conformational space exhaustively to find unequivocally the "native" conformation(s), of global minimum free energy, for each sequence. We find that replacing nonpolar residues in the core by polar residues is generally destabilizing, that surface sites are less sensitive than core sites, that some mutations increase the degeneracy of native states, and that overall it is most probable that a mutation will be neutral, having no effect on the native structure. These results support a "Continuity Principle," that small changes in sequence seldom have large effects on structure or stability of the native state. The simulations also show that (i) the number of "convergent" sequences (different sequences coding for the same native structure) is extremely large and (ii) most sequences become quite dense under folding conditions. This implies that the probability of formation of a globular protein from a random sequence of amino acids by prebiotic or mutational methods is significantly greater than zero.

A Large Scale Test of Computational Protein Design Folding and Stability of Nine Completely Redesigned Globular Proteins

A previously developed computer program for protein design, RosettaDesign, was used to predict low free energy sequences for nine naturally occurring protein backbones. RosettaDesign had no knowledge of the naturally occurring sequences and on average 65% of the residues in the designed sequences differ from wild-type. Synthetic genes for ten completely redesigned proteins were generated, and the proteins were expressed, purified, and then characterized using circular dichroism, chemical and temperature denaturation and NMR experiments. Although high-resolution structures have not yet been determined, eight of these proteins appear to be folded and their circular dichroism spectra are similar to those of their wild-type counterparts. Six of the proteins have stabilities equal to or up to 7 kcal/mol greater than their wild-type counterparts, and four of the proteins have NMR spectra consistent with a well-packed, rigid structure. These encouraging results indicate that the computational protein design methods can, with significant reliability, identify amino acid sequences compatible with a target protein backbone.

Evolutionary information for specifying a protein fold

Nature, 2005

Classical studies show that for many proteins, the information required for specifying the tertiary structure is contained in the amino acid sequence. Here, we attempt to define the sequence rules for specifying a protein fold by computationally creating artificial protein sequences using only statistical information encoded in a multiple sequence alignment and no tertiary structure information. Experimental testing of libraries of artificial WW domain sequences shows that a simple statistical energy function capturing coevolution between amino acid residues is necessary and sufficient to specify sequences that fold into native structures. The artificial proteins show thermodynamic stabilities similar to natural WW domains, and structure determination of one artificial protein shows excellent agreement with the WW fold at atomic resolution. The relative simplicity of the information used for creating sequences suggests a marked reduction to the potential complexity of the protein-folding problem.

Two proteins with the same structure respond very differently to mutation: the role of plasticity in protein stability

Journal of Molecular Biology, 2000

As part of a systematic study of the folding of protein structural families we compare the effect of mutation in two closely related ®bronectin type III (fnIII) domains, the tenth fnIII domain of human ®bronectin (FNfn10) and the third fnIII domain of human tenascin (TNfn3). This comparison of the two related proteins allows us to distinguish any anomalous response to mutation. Although they have very similar structures, the effect of mutation is very different. TNfn3 behaves like a``typical'' protein, with changes in free energy correlated to the number of contacts lost on mutation. The loss of free energy upon mutation is signi®cantly lower for FNfn10, particularly mutations of residues in the A, B and G strands. Remarkably, some of the residues involved are completely buried and closely packed in the core. In FNfn10 the regions of the protein that can accommodate mutation have previously been shown to be mobile. We propose that there is a``plasticity'' in the peripheral regions of FNfn10 that allows it to rearrange to minimise the effect of mutations. This study emphasises the dif®culties that might arise when making generalisations from a single member of a protein family.