Solving the riddle of codon usage preferences: a test for translational selection - PubMed (original) (raw)

. 2004 Sep 24;32(17):5036-44.

doi: 10.1093/nar/gkh834. Print 2004.

Affiliations

Solving the riddle of codon usage preferences: a test for translational selection

Mario dos Reis et al. Nucleic Acids Res. 2004.

Abstract

Translational selection is responsible for the unequal usage of synonymous codons in protein coding genes in a wide variety of organisms. It is one of the most subtle and pervasive forces of molecular evolution, yet, establishing the underlying causes for its idiosyncratic behaviour across living kingdoms has proven elusive to researchers over the past 20 years. In this study, a statistical model for measuring translational selection in any given genome is developed, and the test is applied to 126 fully sequenced genomes, ranging from archaea to eukaryotes. It is shown that tRNA gene redundancy and genome size are interacting forces that ultimately determine the action of translational selection, and that an optimal genome size exists for which this kind of selection is maximal. Accordingly, genome size also presents upper and lower boundaries beyond which selection on codon usage is not possible. We propose a model where the coevolution of genome size and tRNA genes explains the observed patterns in translational selection in all living organisms. This model finally unifies our understanding of codon usage across prokaryotes and eukaryotes. Helicobacter pylori, Saccharomyces cerevisiae and Homo sapiens are codon usage paradigms that can be better understood under the proposed model.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Genetic code and general codon–anticodon recognition rules for tRNA genes. This table simply summarizes all the theoretically possible interactions between the coding codons and the extant tRNA sequences in the organisms analysed in this work. The interested reader is advised to refer to the literature (47,46) for a detailed description of codon–anticodon pairings.

Figure 2

Figure 2

Nc-plot for yeast and simulated E.coli K12 genes. Grey points, simulated E.coli K12 genes; red points, actual yeast genes; dashed line, Wright's proposed function (Equation 5); bold line, the function proposed herein (Equation 8) with optimized parameters.

Figure 3

Figure 3

tAI versus _f_1(x)–Nc for five organisms for which codon usage has been well studied.

Figure 4

Figure 4

Action of natural selection on codon usage in the genomic landscape. (a) Fitted regression surface of _S_-values to tRNA gene number and genome size. Pink dots, predicted _S_-values for every organism; red dots, organisms with _S_-values higher than predicted by the model; blue, organisms with S-values lower than predicted. Vertical lines join each observed data point to its predicted value. (b) Thermal image (contour plot) of the same regression surface. The hottest (highest) _S_-values are shown in white, while the cooler (lowest) values are in red. E, Eukaryota; B, Eubacteria; and A, Archaea. The contour lines reflect the estimated _S_-value for a particular region. (c) Estimated probability density function for the presence of a maximum in the regression analysis. (d) Hypothetical evolution of codon usage optimization. A small genome sized ancestor (0), suffered a series of genome expansions (1–4). During this evolutionary process, the phylogeny would move into, and then out of the selective hot-spot. The process can be reverted at any time if selection for genome size or tRNA set reduction is present, such as in non-free living organisms e.g. H.pylori (5) or P.falciparum (6).

Similar articles

Cited by

References

    1. Kimura M. (1968) Evolutionary rate at the molecular level. Nature, 217, 624–626. - PubMed
    1. King J.L. and Jukes,T.H. (1969) Non-Darwinian evolution. Science, 164, 788–798. - PubMed
    1. Clarke B. (1970) Darwinian evolution of proteins. Science, 168, 1009–1011. - PubMed
    1. Post L.E. and Nomura,M. (1980) DNA sequences from the str operon of Escherichia coli. J. Biol. Chem., 255, 4660–4665. - PubMed
    1. Grantham R., Gautier,C., Guoy,M., Mercier,R. and Pave,A. (1980) Codon catalog usage and the genome hypothesis. Nucleic Acids Res., 8, r49–r62. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources