Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T - PubMed (original) (raw)

. 2003 May;71(5):2775-86.

doi: 10.1128/IAI.71.5.2775-2786.2003.

M B Goldberg, V Burland, M M Venkatesan, W Deng, G Fournier, G F Mayhew, G Plunkett 3rd, D J Rose, A Darling, B Mau, N T Perna, S M Payne, L J Runyen-Janecky, S Zhou, D C Schwartz, F R Blattner

Affiliations

Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T

J Wei et al. Infect Immun. 2003 May.

Erratum in

Abstract

We determined the complete genome sequence of Shigella flexneri serotype 2a strain 2457T (4,599,354 bp). Shigella species cause >1 million deaths per year from dysentery and diarrhea and have a lifestyle that is markedly different from those of closely related bacteria, including Escherichia coli. The genome exhibits the backbone and island mosaic structure of E. coli pathogens, albeit with much less horizontally transferred DNA and lacking 357 genes present in E. coli. The strain is distinctive in its large complement of insertion sequences, with several genomic rearrangements mediated by insertion sequences, 12 cryptic prophages, 372 pseudogenes, and 195 S. flexneri-specific genes. The 2457T genome was also compared with that of a recently sequenced S. flexneri 2a strain, 301. Our data are consistent with Shigella being phylogenetically indistinguishable from E. coli. The S. flexneri-specific regions contain many genes that could encode proteins with roles in virulence. Analysis of these will reveal the genetic basis for aspects of this pathogenic organism's distinctive lifestyle that have yet to be explained.

PubMed Disclaimer

Figures

FIG. 1.

FIG. 1.

Circular representation of the S. flexneri 2a 2457T genome and comparison with the E. coli K-12 genome. The outer circle shows the distribution of all ORFs. Blue represents ORFs in backbone regions with sequence identity to K-12, brown represents ORFs in S. flexneri_-specific islands, and pink represents IS ORFs. The location outside or inside the axis denotes the direction of transcription. The second circle shows the IS elements; the predominant IS, IS_1, is blue. Arrows in the third and fourth circles indicate rRNA (red) and tRNA (green). The fifth circle gives the genome scale in base pairs. The sixth circle shows the C/G skew calculated for each sliding window of 10 kb. In the comparison between S. flexneri 2a 2457T (seventh circle) and E. coli K-12 (the innermost circle), the segments in 2457T that are above (outside) the axis are colinear with K-12, and the segments below (inside) the axis are inverted relative to K-12. Since the difference in K-12 and 2457T genome lengths is small (0.8%) and large segments are homologous, the alignment between circles 7 and 8 is accurate within the limits of resolution for representation of the inversions. Blue represents colinear backbone, various shades of red represent backbone inverted in 2457T relative to K-12, various shades of green represent backbone translocated in 2457T relative to K-12, and yellow represents K-12- or 2457T-specific islands. Segments of the same color and length are homologous. The map was created by GenVision (DNASTAR).

FIG. 2.

FIG. 2.

Venn diagram showing the distribution of common and unique ORFs among S. flexneri 2a, E. coli K-12, and E. coli O157: H7. Only complete protein-coding ORFs, including hypothetical unknowns, are included. The IS element and phage ORFs, as well as pseudogenes, are excluded.

FIG. 3.

FIG. 3.

Comparison of backbone proteins. E. coli K-12 proteins with orthologs in all four pathogens (S. flexneri, E. coli O157: H7, and S. enterica serovars Typhi and Typhimurium), were aligned with each pathogen ortholog, and the percentage of identity was calculated. The results are plotted as a histogram. White bars, S. flexneri; black bars, O157: H7; dark gray bars, S. enterica serovar Typhimurium; light gray bars, S. enterica serovar Typhi.

FIG. 4.

FIG. 4.

Diagram of differences in genome organization between strains 2457T and 301. Diagonal lines join homologous regions that are not colinear in the two genomes. ter, terminus; ori, origin.

FIG. 5.

FIG. 5.

Diagram of the organization of ipaH genes in the 2457T and 301 genomes. Lines connect genes in the same positions relative to backbone. X marks genes inactivated by insertion sequences or frameshift mutations. Similar colors denote homology of the N-terminal portions of the encoded proteins. Asterisks show the relative positions of the terminus of chromosomal replication; a chromosomal rearrangement in 2457T spanning the terminus accounts for S1947 having the same flanking backbone regions as SF1383. The figure is not drawn to scale.

References

    1. Aderem, A., and R. J. Ulevitch. 2000. Toll-like receptors in the induction of the innate immune response. Nature 406:782-787. - PubMed
    1. Al-Hasani, K., I. R. Henderson, H. Sakellaris, K. Rajakumar, T. Grant, J. P. Nataro, R. Robins-Browne, and B. Adler. 2000. The sigA gene which is borne on the she pathogenicity island of Shigella flexneri 2a encodes an exported cytopathic protease involved in intestinal fluid accumulation. Infect. Immun. 68:2457-2463. - PMC - PubMed
    1. Anderson, R. J., M. F. Pasetti, M. B. Sztein, M. M. Levine, and F. R. Noriega. 2000. ΔguaBA attenuated Shigella flexneri2a strain CVD 1204 as a Shigella vaccine and as a live mucosal delivery system for fragment C of tetanus toxin. Vaccine 18:2193-2202. - PubMed
    1. Blattner, F. R., G. Plunkett III, C. A. Bloch, N. T. Perna, V. Burland, M. Riley, J. Collado-Vides, J. D. Glasner, C. K. Rode, G. F. Mayhew, J. Gregor, N. W. Davis, H. A. Kirkpatrick, M. A. Goeden, D. J. Rose, B. Mau, and Y. Shao. 1997. The complete genome sequence of Escherichia coli K-12. Science 277:1453-1474. - PubMed
    1. Brenner, D. J., G. R. Fanning, A. G. Steigerwalt, I. Ørskov, and F. Ørskov. 1972. Polynucleotide sequence relatedness among three groups of pathogenic Escherichia coli strains. Infect. Immun. 6:308-315. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources