Genome-wide mutational diversity in an evolving population of Escherichia coli - PubMed (original) (raw)
Genome-wide mutational diversity in an evolving population of Escherichia coli
J E Barrick et al. Cold Spring Harb Symp Quant Biol. 2009.
Abstract
The level of genetic variation in a population is the result of a dynamic tension between evolutionary forces. Mutations create variation, certain frequency-dependent interactions may preserve diversity, and natural selection purges variation. New sequencing technologies offer unprecedented opportunities to discover and characterize the diversity present in evolving microbial populations on a whole-genome scale. By sequencing mixed-population samples, we have identified single-nucleotide polymorphisms (SNPs) present at various points in the history of an Escherichia coli population that has evolved for almost 20 years from a founding clone. With 50-fold genome coverage, we were able to catch beneficial mutations as they swept to fixation, discover contending beneficial alleles that were eliminated by clonal interference, and detect other minor variants possibly adapted to a new ecological niche. Additionally, there was a dramatic increase in genetic diversity late in the experiment after a mutator phenotype evolved. Still finer-resolution details of the structure of genetic variation and how it changes over time in microbial evolution experiments will enable new applications and quantitative tests of population genetic theory.
Figures
Figure 1. Expected dynamics in an evolving bacterial population
Lineages with new beneficial mutations are depicted as shaded wedges that originate in a previous genetic background and rise in frequency as they outcompete their ancestor and other lineages (Muller 1932). The same shading indicates lineages have equivalent fitnesses, and the path to the final dominant genotype containing five mutations is highlighted by the light gray curve. This figure was produced using a simulation with population size and mutation parameters meant to model the first 600 generations of the E. coli long-term evolution experiment (Woods 2005). Notice how the level of genetic diversity changes over time. Early on, a new beneficial mutation sweeps to fixation and the population has little diversity (a). Later, four lineages with different mutations coexist at appreciable frequencies for a time (b) before the descendants of one lineage become a majority (c).
Figure 2. Example coverage distribution and base error rates
The 2K mixed-population sample is displayed as representative of the sequence datasets. (a) The distribution of the number of ancestral genomic positions with a given read coverage depth (open circles) is over-dispersed relative to a Poisson model (dashed line) but is fit reasonably well by a negative binomial model (solid line). Repeat regions were excluded from this analysis. (b) The probability of a base error at a given quality score estimated from the number of observed mismatches in reads aligned to the reference genome usually decreases as a higher quality score is assigned to a base. Bases assigned a quality score of 10 had an anomalously high error rate in this dataset. The accompanying histogram shows that most bases in the dataset had high quality scores. Bases assigned a quality score of 40 do not appear on the log scale because they had zero errors.
Figure 3. Sensitivity of SNP prediction procedure
(a) Estimates of the probability that our statistical procedure would detect SNPs present at various frequencies in a mixed-population sample at different E-value cutoffs. For these calculations, the coverage and quality score distributions were those of the mixed-population 2K sample. (b) Estimates of sensitivity improvements possible by increasing sequencing coverage and by reducing the rate of base errors. For these calculations all sites had uniform coverage and the same error rate for all bases.
Figure 4. SNP predictions
The cumulative distributions of predictions below a given E-value threshold that also passed the bias filtering step are plotted for each dataset. Each panel contains a generation-paired mixed-population sample (squares and solid lines) and clone (circles and dashed lines), except there is only a clone at 0K and only a mixed population at 30K.
Figure 5. Mutational diversity in an evolving E. coli population
(a) Origin and eventual fate of point mutations in the 2K to 40K mixed-population samples. New mutations that first appear as SNPs or fixed alleles are shown as asterisks along the bottom or top, respectively, with arrows leading to the corresponding pools of SNPs and fixed mutations. Transient SNPs that were lost from the population are shown by descending lines ending in closed circles. Note that we only detect SNPs when they are between roughly 4% and 96% frequency in the population, and that we only recover approximately 50% of the SNPs at 5% frequency. Only the 49 SNP predictions in Table 2 were included for the 2K to 20K samples. (b) Stylized summary of the mixed-population SNP analysis. Shaded wedges represent subpopulations containing new mutations relative to the previous genetic background. Mutations are grouped to highlight their eventual fates, but we do not always have linkage information to resolve which SNPs occurred together. Labeled features are explained in the text.
Similar articles
- Tempo and mode of genome evolution in a 50,000-generation experiment.
Tenaillon O, Barrick JE, Ribeck N, Deatherage DE, Blanchard JL, Dasgupta A, Wu GC, Wielgoss S, Cruveiller S, Médigue C, Schneider D, Lenski RE. Tenaillon O, et al. Nature. 2016 Aug 11;536(7615):165-70. doi: 10.1038/nature18959. Epub 2016 Aug 1. Nature. 2016. PMID: 27479321 Free PMC article. - Evolutionary dynamics and structural consequences of de novo beneficial mutations and mutant lineages arising in a constant environment.
Kinnersley M, Schwartz K, Yang DD, Sherlock G, Rosenzweig F. Kinnersley M, et al. BMC Biol. 2021 Feb 4;19(1):20. doi: 10.1186/s12915-021-00954-0. BMC Biol. 2021. PMID: 33541358 Free PMC article. - Adaptation, Clonal Interference, and Frequency-Dependent Interactions in a Long-Term Evolution Experiment with Escherichia coli.
Maddamsetti R, Lenski RE, Barrick JE. Maddamsetti R, et al. Genetics. 2015 Jun;200(2):619-31. doi: 10.1534/genetics.115.176677. Epub 2015 Apr 24. Genetics. 2015. PMID: 25911659 Free PMC article. - Escherichia coli evolution during stationary phase.
Zinser ER, Kolter R. Zinser ER, et al. Res Microbiol. 2004 Jun;155(5):328-36. doi: 10.1016/j.resmic.2004.01.014. Res Microbiol. 2004. PMID: 15207864 Review. - Are Escherichia coli Pathotypes Still Relevant in the Era of Whole-Genome Sequencing?
Robins-Browne RM, Holt KE, Ingle DJ, Hocking DM, Yang J, Tauschek M. Robins-Browne RM, et al. Front Cell Infect Microbiol. 2016 Nov 18;6:141. doi: 10.3389/fcimb.2016.00141. eCollection 2016. Front Cell Infect Microbiol. 2016. PMID: 27917373 Free PMC article. Review.
Cited by
- Genomic analysis of a key innovation in an experimental Escherichia coli population.
Blount ZD, Barrick JE, Davidson CJ, Lenski RE. Blount ZD, et al. Nature. 2012 Sep 27;489(7417):513-8. doi: 10.1038/nature11514. Epub 2012 Sep 19. Nature. 2012. PMID: 22992527 Free PMC article. - Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies.
Zagordi O, Klein R, Däumer M, Beerenwinkel N. Zagordi O, et al. Nucleic Acids Res. 2010 Nov;38(21):7400-9. doi: 10.1093/nar/gkq655. Epub 2010 Jul 29. Nucleic Acids Res. 2010. PMID: 20671025 Free PMC article. - Identification and dynamics of a beneficial mutation in a long-term evolution experiment with Escherichia coli.
Stanek MT, Cooper TF, Lenski RE. Stanek MT, et al. BMC Evol Biol. 2009 Dec 29;9:302. doi: 10.1186/1471-2148-9-302. BMC Evol Biol. 2009. PMID: 20040094 Free PMC article. - Improvement of thermotolerance in Lachancea thermotolerans using a bacterial selection pressure.
Zhou N, Ishchuk OP, Knecht W, Compagno C, Piškur J. Zhou N, et al. J Ind Microbiol Biotechnol. 2019 Feb;46(2):133-145. doi: 10.1007/s10295-018-2107-4. Epub 2018 Nov 28. J Ind Microbiol Biotechnol. 2019. PMID: 30488364 Free PMC article. - Innovation in an E. coli evolution experiment is contingent on maintaining adaptive potential until competition subsides.
Leon D, D'Alton S, Quandt EM, Barrick JE. Leon D, et al. PLoS Genet. 2018 Apr 12;14(4):e1007348. doi: 10.1371/journal.pgen.1007348. eCollection 2018 Apr. PLoS Genet. 2018. PMID: 29649242 Free PMC article.
References
- Atwood KC, Schneider LK, Ryan FJ. Selective mechanisms in bacteria. Cold Spring Harb. Symp. Quant. Biol. 1951;16:345–355. - PubMed
- Barrick JE, Yu D-S, Yoon SH, Jeong H, Oh TK, Schneider D, Lenski RE, Kim JF. Genome dynamics in a long-term experiment with Escherichia coli. submitted. 2009 - PubMed
- Buckling A, Maclean CR, Brockhurst MA, Colegrave N. The Beagle in a bottle. Nature. 2009;457:824–829. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- K99 GM087550/GM/NIGMS NIH HHS/United States
- K99 GM087550-01/GM/NIGMS NIH HHS/United States
- R00 GM087550/GM/NIGMS NIH HHS/United States
- GM-087550/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full Text Sources