Organellar Genomes of White Spruce (Picea glauca): Assembly and Annotation - PubMed (original) (raw)

René L Warren 1, Ewan A Gibb 1, Benjamin P Vandervalk 1, Hamid Mohamadi 1, Justin Chu 1, Anthony Raymond 1, Stephen Pleasance 1, Robin Coope 1, Mark R Wildung 2, Carol E Ritland 3, Jean Bousquet 4, Steven J M Jones 5, Joerg Bohlmann 6, Inanç Birol 7

Affiliations

Organellar Genomes of White Spruce (Picea glauca): Assembly and Annotation

Shaun D Jackman et al. Genome Biol Evol. 2015.

Abstract

The genome sequences of the plastid and mitochondrion of white spruce (Picea glauca) were assembled from whole-genome shotgun sequencing data using ABySS. The sequencing data contained reads from both the nuclear and organellar genomes, and reads of the organellar genomes were abundant in the data as each cell harbors hundreds of mitochondria and plastids. Hence, assembly of the 123-kb plastid and 5.9-Mb mitochondrial genomes were accomplished by analyzing data sets primarily representing low coverage of the nuclear genome. The assembled organellar genomes were annotated for their coding genes, ribosomal RNA, and transfer RNA. Transcript abundances of the mitochondrial genes were quantified in three developmental tissues and five mature tissues using data from RNA-seq experiments. C-to-U RNA editing was observed in the majority of mitochondrial genes, and in four genes, editing events were noted to modify ACG codons to create cryptic AUG start codons. The informatics methodology presented in this study should prove useful to assemble organellar genomes of other plant species using whole-genome shotgun sequencing data.

Keywords: ABySS; genome assembly; gymnosperms; organelle; sequencing; white spruce.

© The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

PubMed Disclaimer

Figures

F<sc>ig</sc>. 1.—

Fig. 1.—

The complete plastid genome of white spruce. The PG29 white spruce chloroplast genome was annotated using MAKER and plotted using OrganellarGenomeDRAW (Lohse et al. 2007). The inner gray track depicts the G+C content of the genome.

F<sc>ig</sc>. 2.—

Fig. 2.—

Relative order and size of genes on the scaffolds of the white spruce mitochondrial genome. Each box is proportional to the size of the gene including introns, except that genes smaller than 200 bp are shown as 200 bp. The space between genes is not to scale. An asterisk indicates that the gene name is truncated. Only scaffolds that harbor annotated genes are shown.

F<sc>ig</sc>. 3.—

Fig. 3.—

Gene content of the white spruce mitochondrial genome, grouped by gene family. Each box is proportional to the size of the gene including introns. The color of each gene is unique within its gene family.

F<sc>ig</sc>. 4.—

Fig. 4.—

Repetitive sequence content of the white spruce mitochondrial genome, annotated using RepeatMasker and RepeatModeler.

F<sc>ig</sc>. 5.—

Fig. 5.—

Heatmap of the transcript abundance of mitochondrial protein-coding genes of white spruce. Each column is a tissue sample. Each row is a gene. Each cell represents the transcript abundance of one gene in one sample. The color scale is log10(TPM+1), where TPM is transcripts per million as measured by Salmon (Patro et al. 2014).

F<sc>ig</sc>. 6.—

Fig. 6.—

Heatmap of the transcript abundance of mitochondrial protein-coding genes of white spruce, including ORFs. Each column is a tissue sample. Each row is a gene. Each cell represents the transcript abundance of one gene in one sample. The color scale is log10(TPM+1), where TPM is transcripts per million as measured by Salmon (Patro et al. 2014).

References

    1. Aizawa M, Kim ZS, Yoshimaru H. 2012. Phylogeography of the Korean pine (Pinus koraiensis) in northeast Asia: inferences from organelle gene sequences. J Plant Res. 125:713–723. -PubMed
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215:403–410. -PubMed
    1. Alverson AJ, et al. 2010. Insights into the evolution of mitochondrial genome size from complete sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol Biol Evol. 27:1436–1448. -PMC -PubMed
    1. Barkan A. 1988. Proteins encoded by a complex chloroplast transcription unit are each translated from both monocistronic and polycistronic mRNAs. Embo J. 7:2637–2644. -PMC -PubMed
    1. Benson DA, et al. 2014. GenBank. Nucleic Acids Res. 42:D32–D37. -PMC -PubMed

Publication types

MeSH terms

LinkOut - more resources