Genome-wide detection and analysis of homologous recombination among sequenced strains of Escherichia coli - PubMed (original) (raw)
Genome-wide detection and analysis of homologous recombination among sequenced strains of Escherichia coli
Bob Mau et al. Genome Biol. 2006.
Abstract
Background: Comparisons of complete bacterial genomes reveal evidence of lateral transfer of DNA across otherwise clonally diverging lineages. Some lateral transfer events result in acquisition of novel genomic segments and are easily detected through genome comparison. Other more subtle lateral transfers involve homologous recombination events that result in substitution of alleles within conserved genomic regions. This type of event is observed infrequently among distantly related organisms. It is reported to be more common within species, but the frequency has been difficult to quantify since the sequences under comparison tend to have relatively few polymorphic sites.
Results: Here we report a genome-wide assessment of homologous recombination among a collection of six complete Escherichia coli and Shigella flexneri genome sequences. We construct a whole-genome multiple alignment and identify clusters of polymorphic sites that exhibit atypical patterns of nucleotide substitution using a random walk-based method. The analysis reveals one large segment (approximately 100 kb) and 186 smaller clusters of single base pair differences that suggest lateral exchange between lineages. These clusters include portions of 10% of the 3,100 genes conserved in six genomes. Statistical analysis of the functional roles of these genes reveals that several classes of genes are over-represented, including those involved in recombination, transport and motility.
Conclusion: We demonstrate that intraspecific recombination in E. coli is much more common than previously appreciated and may show a bias for certain types of genes. The described method provides high-specificity, conservative inference of past recombination events.
Figures
Figure 1
A multiple whole-genome alignment of six strains consists of 34 rearranged pieces larger than 1 kb. Each genome is laid out horizontally with homologous segments (LCBs) outlined as colored rectangles. Regions inverted relative to E. coli K-12 are set below those that match in the forward orientation. Lines collate aligned segments between genomes. Average sequence similarities within an LCB, measured in sliding windows, are proportional to the heights of interior colored bars. Large sections of white within blocks and gaps between blocks indicate lineage specific sequence.
Figure 2
Small sample segment of the alignment spanning the start of the mutS gene (denoted in blue). Location of a mismatch is indicated by the integer '1' along the bottom row. Five columns contain SNDs: TTTCTT, AAAGAA, AAATAA, GGGAGG, and GAAAAA. The first four share the same bipartition pattern (111211) and are deemed equivalent, even though one of them results from a transversion. The other SND is considered distinct despite having the same mutation (A to G) found in the second SND.
Figure 3
Three excursions (KS, KO, and KC) spanning the alignment with K-12 MG1655 as reference genome. The KS random walk plot, representing the dominant clonal topology, decreases more gradually than do the two other plots. Excursions for the discordant topologies (patterns KO and KC) run parallel to one another, except in a 100 kb region at 2 Mb where KO abruptly increases. Parallel flat gaps common to all three plots reflect K-12 lineage specific sequence.
Figure 4
The KS local random walk plot showing homologous recombination in the tryptophan (trp) operon. Genes are rectangular boxes positioned above or below the axis based on transcribed strand. KS SNDs form two non-overlapping MSCs with significant local scores exceeding 170. Both MSCs, with a combined length under 2 kb, are contained in a single 6.5 kb HSS covering most the trp operon. The positions of each KO, KC, and KS SND in E. coli K-12 are shown above the KS excursion. Random walk values below 50 are not plotted, resulting in the absence of visible KC or KO excursions.
Figure 5
Mosaic operons and genes. Three of six rha genes (rhaB, rhaA, and rhaD) belong to an operon on the reverse strand. This operon is unusual because well-defined recombination events clearly fall within gene boundaries; rhaD contains two dense KC clusters, whereas rhaA and rhaB contain predominantly KS and KO SNDs, respectively. In a nearby operon consisting of fdoG, fdoH, fdoI, and fdhE, there has been a KC intragenic recombination event with fdoG a mosaic, resulting from two recombination events, one of which is shared with fdoH.
Figure 6
Random walk plots for positive local scores in the vicinity of the speF gene. SpeF is a mosaic gene by virtue of its KS and KO clusters. Note the small cluster of KC SNDs appears to divide a large KS segment near coordinate 718,600. This short KC spike, though not statistically significant on a whole genome scale, would undoubtedly pass a single gene substitution distribution type test.
Figure 7
Percentage of SNDs supporting each of three topologies in a phylogenetic network for six E. coli genomes (four OTUs). Black lines describe the 'species' topology. Green, blue, and orange lines indicate the alternative pairings of sister taxa that result from KS, KO, and KC recombinations, respectively. Also shown is the percentage of SNDs supporting each bipartition in Table 1.
Figure 8
The location of all SNDs in a 5 kb region. In clusters demarcated by colored lines, note the corresponding absence of two more common types of SNDs. Three diamonds in lighter shades of blue, green, and red are compatible tri-partitions (see Additional data file 1). Colored lines demarcate regions where the absence of lineage-specific SNDs is offset by an increase in the corresponding recombinant pattern (for example, in yiaA, no K-12 or S. flexneri only SNDs).
Figure 9
Statistical justification of threshold values - 100, 100, and 170 for topologies KO, KC, and KS, respectively - used to identify recombination events. Values on the x-axis are maximal local scores. EVD probability densities for the maximum maximal local score attained by random walks of length M' appear as bell-shaped curves with a pronounced skew to the right. Threshold values, demarcated by vertical lines, correspond to conservative significance levels (α = 0.05) for these distributions.
Similar articles
- Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths.
Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, Bidet P, Bingen E, Bonacorsi S, Bouchier C, Bouvet O, Calteau A, Chiapello H, Clermont O, Cruveiller S, Danchin A, Diard M, Dossat C, Karoui ME, Frapy E, Garry L, Ghigo JM, Gilles AM, Johnson J, Le Bouguénec C, Lescat M, Mangenot S, Martinez-Jéhanne V, Matic I, Nassif X, Oztas S, Petit MA, Pichon C, Rouy Z, Ruf CS, Schneider D, Tourret J, Vacherie B, Vallenet D, Médigue C, Rocha EP, Denamur E. Touchon M, et al. PLoS Genet. 2009 Jan;5(1):e1000344. doi: 10.1371/journal.pgen.1000344. Epub 2009 Jan 23. PLoS Genet. 2009. PMID: 19165319 Free PMC article. - Understanding the differences between genome sequences of Escherichia coli B strains REL606 and BL21(DE3) and comparison of the E. coli B and K-12 genomes.
Studier FW, Daegelen P, Lenski RE, Maslov S, Kim JF. Studier FW, et al. J Mol Biol. 2009 Dec 11;394(4):653-80. doi: 10.1016/j.jmb.2009.09.021. Epub 2009 Sep 15. J Mol Biol. 2009. PMID: 19765592 - Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T.
Wei J, Goldberg MB, Burland V, Venkatesan MM, Deng W, Fournier G, Mayhew GF, Plunkett G 3rd, Rose DJ, Darling A, Mau B, Perna NT, Payne SM, Runyen-Janecky LJ, Zhou S, Schwartz DC, Blattner FR. Wei J, et al. Infect Immun. 2003 May;71(5):2775-86. doi: 10.1128/IAI.71.5.2775-2786.2003. Infect Immun. 2003. PMID: 12704152 Free PMC article. - A phylogenomic analysis of Escherichia coli / Shigella group: implications of genomic features associated with pathogenicity and ecological adaptation.
Zhang Y, Lin K. Zhang Y, et al. BMC Evol Biol. 2012 Sep 7;12:174. doi: 10.1186/1471-2148-12-174. BMC Evol Biol. 2012. PMID: 22958895 Free PMC article. - Comparison of 61 sequenced Escherichia coli genomes.
Lukjancenko O, Wassenaar TM, Ussery DW. Lukjancenko O, et al. Microb Ecol. 2010 Nov;60(4):708-20. doi: 10.1007/s00248-010-9717-3. Epub 2010 Jul 11. Microb Ecol. 2010. PMID: 20623278 Free PMC article. Review.
Cited by
- Dynamics of genome rearrangement in bacterial populations.
Darling AE, Miklós I, Ragan MA. Darling AE, et al. PLoS Genet. 2008 Jul 18;4(7):e1000128. doi: 10.1371/journal.pgen.1000128. PLoS Genet. 2008. PMID: 18650965 Free PMC article. - Population genomics and the bacterial species concept.
Riley MA, Lizotte-Waniewski M. Riley MA, et al. Methods Mol Biol. 2009;532:367-77. doi: 10.1007/978-1-60327-853-9_21. Methods Mol Biol. 2009. PMID: 19271196 Free PMC article. Review. - Genome-wide survey of mutual homologous recombination in a highly sexual bacterial species.
Yahara K, Kawai M, Furuta Y, Takahashi N, Handa N, Tsuru T, Oshima K, Yoshida M, Azuma T, Hattori M, Uchiyama I, Kobayashi I. Yahara K, et al. Genome Biol Evol. 2012;4(5):628-40. doi: 10.1093/gbe/evs043. Epub 2012 Apr 25. Genome Biol Evol. 2012. PMID: 22534164 Free PMC article. - The population genetics of pathogenic Escherichia coli.
Denamur E, Clermont O, Bonacorsi S, Gordon D. Denamur E, et al. Nat Rev Microbiol. 2021 Jan;19(1):37-54. doi: 10.1038/s41579-020-0416-x. Epub 2020 Aug 21. Nat Rev Microbiol. 2021. PMID: 32826992 Review. - Directed networks reveal genomic barriers and DNA repair bypasses to lateral gene transfer among prokaryotes.
Popa O, Hazkani-Covo E, Landan G, Martin W, Dagan T. Popa O, et al. Genome Res. 2011 Apr;21(4):599-609. doi: 10.1101/gr.115592.110. Epub 2011 Jan 26. Genome Res. 2011. PMID: 21270172 Free PMC article.
References
- Feil EJ, Maiden MC, Achtman M, Spratt BG. The relative contributions of recombination and mutation to the divergence of clones of Neisseria meningitidis. Mol Biol Evol. 1999;16:1496–1502. - PubMed
- Gogarten JP, Doolittle WF, Lawrence JG. Prokaryotic evolution in light of gene transfer. Mol Biol Evol. 2002;19:2226–2238. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous