Recharacterization of ancient DNA miscoding lesions: insights in the era of sequencing-by-synthesis - PubMed (original) (raw)

Recharacterization of ancient DNA miscoding lesions: insights in the era of sequencing-by-synthesis

M Thomas P Gilbert et al. Nucleic Acids Res. 2007.

Abstract

Although ancient DNA (aDNA) miscoding lesions have been studied since the earliest days of the field, their nature remains a source of debate. A variety of conflicting hypotheses exist about which miscoding lesions constitute true aDNA damage as opposed to PCR polymerase amplification error. Furthermore, considerable disagreement and speculation exists on which specific damage events underlie observed miscoding lesions. The root of the problem is that it has previously been difficult to assemble sufficient data to test the hypotheses, and near-impossible to accurately determine the specific strand of origin of observed damage events. With the advent of emulsion-based clonal amplification (emPCR) and the sequencing-by-synthesis technology this has changed. In this paper we demonstrate how data produced on the Roche GS20 genome sequencer can determine miscoding lesion strands of origin, and subsequently be interpreted to enable characterization of the aDNA damage behind the observed phenotypes. Through comparative analyses on 390,965 bp of modern chloroplast and 131,474 bp of ancient woolly mammoth GS20 sequence data we conclusively demonstrate that in this sample at least, a permafrost preserved specimen, Type 2 (cytosine-->thymine/guanine-->adenine) miscoding lesions represent the overwhelming majority of damage-derived miscoding lesions. Additionally, we show that an as yet unidentified guanine-->adenine analogue modification, not the conventionally argued cytosine-->uracil deamination, underpins a significant proportion of Type 2 damage. How widespread these implications are for aDNA will become apparent as future studies analyse data recovered from a wider range of substrates.

PubMed Disclaimer

Figures

Figure 1

Figure 1

Orientation of the DNA molecule at different steps of the data production process, and the algorithm used to subsequently segregate and analyse the sequence data. The GS20 emPCR and pyrosequencing process occurs in isolated, parallel reactions (up to 0.8 million per GS20 run). This figure illustrates key stages of the data-generation and analytical process for each individual reaction within a single GS20 run. (1a) Before emPCR original single-stranded DNA molecules are isolated. (2a) Post-emPCR, only descendents of the original molecule that are in the complementary orientation are retained. (1c) These molecules are pyrosequenced, generating sequence in the identical orientation to the originally isolated single-stranded DNA molecule. (1d) A simple analytical algorithm is then applied to the sequence to identify the orientation of, and any miscoding lesions in, the original single-stranded DNA molecule.

Similar articles

Cited by

References

    1. Shapiro B., Drummond A.J., Rambaut A., Wilson M., Sher A., Pybus O.G., Gilbert M.T.P., Barnes I., Binladen J., Willerslev E., et al. Rise and fall of the Beringian steppe bison. Science. 2004;306:1561–1565. - PubMed
    1. Poinar H.N., Schwarz C., Qi J., Shapiro B., MacPhee R.D.E., Buigues B., Tikhonov A., Huson D.H., Tomsho L.P., Auch A., et al. Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. Science. 2006;311:392–394. - PubMed
    1. Willerslev E., Hansen A.J., Binladen J., Brandt T.B., Gilbert M.T.P., Shapiro B., Bunce M., Wiuf C., Gilichinsky D.A., Cooper A. Diverse plant and animal genetic records from Holocene and Pleistocene sediments. Science. 2003;300:791–795. - PubMed
    1. Pääbo S. Ancient DNA: extraction, characterization, molecular cloning and enzymatic amplification. Proc. Natl Acad. Sci. USA. 1989;86:1939–1943. - PMC - PubMed
    1. Anderung C., Bouwman A., Persson P., Carretero J.M., Ortega A.I., Elburg R., Smith C., Arsuaga J.L., Ellegren H., Götherström A. Prehistoric contacts over the Straits of Gibraltar indicated by genetic analysis of Iberian Bronze Age cattle. Proc. Natl Acad. Sci. USA. 2005;102:8431–8435. - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources