De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads - PubMed (original) (raw)
De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads
Rhys A Farrer et al. FEMS Microbiol Lett. 2009 Feb.
Abstract
Illumina's Genome Analyzer generates ultra-short sequence reads, typically 36 nucleotides in length, and is primarily intended for resequencing. We tested the potential of this technology for de novo sequence assembly on the 6 Mbp genome of Pseudomonas syringae pv. syringae B728a with several freely available assembly software packages. Using an unpaired data set, velvet assembled >96% of the genome into contigs with an N50 length of 8289 nucleotides and an error rate of 0.33%. EDENA generated smaller contigs (N50 was 4192 nucleotides) and comparable error rates. SSAKE and VCAKE yielded shorter contigs with very high error rates. Assembly of paired-end sequence data carrying 400 bp inserts produced longer contigs (N50 up to 15 628 nucleotides), but with increased error rates (0.5%). Contig length and error rate were very sensitive to the choice of parameter values. Noncoding RNA genes were poorly resolved in de novo assemblies, while >90% of the protein-coding genes were assembled with 100% accuracy over their full length. This study demonstrates that, in practice, de novo assembly of 36-nucleotide reads can generate reasonably accurate assemblies from about 40 x deep sequence data sets. These draft assemblies are useful for exploring an organism's proteomic potential, at a very economic low cost.
Similar articles
- The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads.
Wang Z, Hobson N, Galindo L, Zhu S, Shi D, McDill J, Yang L, Hawkins S, Neutelings G, Datla R, Lambert G, Galbraith DW, Grassa CJ, Geraldes A, Cronk QC, Cullis C, Dash PK, Kumar PA, Cloutier S, Sharpe AG, Wong GK, Wang J, Deyholos MK. Wang Z, et al. Plant J. 2012 Nov;72(3):461-73. doi: 10.1111/j.1365-313X.2012.05093.x. Epub 2012 Aug 14. Plant J. 2012. PMID: 22757964 - Optimal spliced alignments of short sequence reads.
De Bona F, Ossowski S, Schneeberger K, Rätsch G. De Bona F, et al. Bioinformatics. 2008 Aug 15;24(16):i174-80. doi: 10.1093/bioinformatics/btn300. Bioinformatics. 2008. PMID: 18689821 - Profiling model T-cell metagenomes with short reads.
Warren RL, Nelson BH, Holt RA. Warren RL, et al. Bioinformatics. 2009 Feb 15;25(4):458-64. doi: 10.1093/bioinformatics/btp010. Epub 2009 Jan 9. Bioinformatics. 2009. PMID: 19136549 - De novo sequencing of plant genomes using second-generation technologies.
Imelfort M, Edwards D. Imelfort M, et al. Brief Bioinform. 2009 Nov;10(6):609-18. doi: 10.1093/bib/bbp039. Brief Bioinform. 2009. PMID: 19933209 Review. - Roadmap to new virulence determinants in Pseudomonas syringae: insights from comparative genomics and genome organization.
Lindeberg M, Myers CR, Collmer A, Schneider DJ. Lindeberg M, et al. Mol Plant Microbe Interact. 2008 Jun;21(6):685-700. doi: 10.1094/MPMI-21-6-0685. Mol Plant Microbe Interact. 2008. PMID: 18624633 Review.
Cited by
- De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-Seq.
Mizrachi E, Hefer CA, Ranik M, Joubert F, Myburg AA. Mizrachi E, et al. BMC Genomics. 2010 Dec 1;11:681. doi: 10.1186/1471-2164-11-681. BMC Genomics. 2010. PMID: 21122097 Free PMC article. - Sequencing and validation of the genome of a Campylobacter concisus reveals intra-species diversity.
Deshpande NP, Kaakoush NO, Mitchell H, Janitz K, Raftery MJ, Li SS, Wilkins MR. Deshpande NP, et al. PLoS One. 2011;6(7):e22170. doi: 10.1371/journal.pone.0022170. Epub 2011 Jul 29. PLoS One. 2011. PMID: 21829448 Free PMC article. - Extensive remodeling of the Pseudomonas syringae pv. avellanae type III secretome associated with two independent host shifts onto hazelnut.
O'Brien HE, Thakur S, Gong Y, Fung P, Zhang J, Yuan L, Wang PW, Yong C, Scortichini M, Guttman DS. O'Brien HE, et al. BMC Microbiol. 2012 Jul 16;12:141. doi: 10.1186/1471-2180-12-141. BMC Microbiol. 2012. PMID: 22800299 Free PMC article. - The long and winding road: virulence effector proteins of plant pathogenic bacteria.
Hann DR, Rathjen JP. Hann DR, et al. Cell Mol Life Sci. 2010 Oct;67(20):3425-34. doi: 10.1007/s00018-010-0428-1. Epub 2010 Jun 13. Cell Mol Life Sci. 2010. PMID: 20549537 Free PMC article. Review. - Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies.
Cahill MJ, Köser CU, Ross NE, Archer JA. Cahill MJ, et al. PLoS One. 2010 Jul 12;5(7):e11518. doi: 10.1371/journal.pone.0011518. PLoS One. 2010. PMID: 20634954 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Miscellaneous