A pyrosequencing-tailored nucleotide barcode design unveils opportunities for large-scale sample multiplexing - PubMed (original) (raw)
A pyrosequencing-tailored nucleotide barcode design unveils opportunities for large-scale sample multiplexing
Poornima Parameswaran et al. Nucleic Acids Res. 2007.
Abstract
Multiplexed high-throughput pyrosequencing is currently limited in complexity (number of samples sequenced in parallel), and in capacity (number of sequences obtained per sample). Physical-space segregation of the sequencing platform into a fixed number of channels allows limited multiplexing, but obscures available sequencing space. To overcome these limitations, we have devised a novel barcoding approach to allow for pooling and sequencing of DNA from independent samples, and to facilitate subsequent segregation of sequencing capacity. Forty-eight forward-reverse barcode pairs are described: each forward and each reverse barcode unique with respect to at least 4 nt positions. With improved read lengths of pyrosequencers, combinations of forward and reverse barcodes may be used to sequence from as many as n(2) independent libraries for each set of 'n' forward and 'n' reverse barcodes, for each defined set of cloning-linkers. In two pilot series of barcoded sequencing using the GS20 Sequencer (454/Roche), we found that over 99.8% of obtained sequences could be assigned to 25 independent, uniquely barcoded libraries based on the presence of either a perfect forward or a perfect reverse barcode. The false-discovery rate, as measured by the percentage of sequences with unexpected perfect pairings of unmatched forward and reverse barcodes, was estimated to be <0.005%.
Figures
Figure 1.
Design of forward and reverse primers. The synthesized primers are 45–46 nt long. (A and B) The template for the primers is: 454-Adapter:: Barcode:: Linker-Primer. Individual specifications for forward and reverse barcodes are also indicated. (C) Diagrammatic representation of the GS20 forward and reverse sequencing reads. ‘RC’ stands for Reverse Complement. Teeth denote base pairing.
Figure 2.
A diagrammatic representation of heteroduplexes that may be formed in an amplified pool. (A) Heteroduplexes formed between molecules from the same sample that have the same barcode, but different RNA inserts (X and Y). (B) Heteroduplexes formed between molecules with RNA inserts and molecules with no RNA inserts (or with fragments of linkers as inserts). Formation of these unusual duplexes may be facilitated by the 45–46 nt complementarity at either end of the insert. Thus, three types of molecules may be present during later stages of PCR: single-stranded, perfectly double stranded (Figure 1C) and heteroduplexes (shown here). The ratio of the three species is determined by the number of PCR cycles. As in Figure 1, ‘RC’ stands for Reverse Complement, and teeth denote base pairing.
Figure 3.
Visualizing the nature of amplified products from various cycles of PCR. With an increase in cycle number (twenty cycles of an initial round of PCR followed by six, eight or ten cycles of a second round of PCR), there is an evident shift in mobility of PCR products that contain a small RNA insert. Effect of PCR cycle number on three different samples is shown, stressing the importance of titrating the total number of DNA amplification cycles, to avoid saturation of the PCR amplification. Red arrows represent the two sizes of the insert-containing PCR products. PCR products without small RNA inserts migrate as faint bands between 75 and 100 bp (black arrow).
Similar articles
- Pair-barcode high-throughput sequencing for large-scale multiplexed sample analysis.
Tu J, Ge Q, Wang S, Wang L, Sun B, Yang Q, Bai Y, Lu Z. Tu J, et al. BMC Genomics. 2012 Jan 25;13:43. doi: 10.1186/1471-2164-13-43. BMC Genomics. 2012. PMID: 22276739 Free PMC article. - Impact of next-generation sequencing error on analysis of barcoded plasmid libraries of known complexity and sequence.
Deakin CT, Deakin JJ, Ginn SL, Young P, Humphreys D, Suter CM, Alexander IE, Hallwirth CV. Deakin CT, et al. Nucleic Acids Res. 2014;42(16):e129. doi: 10.1093/nar/gku607. Epub 2014 Jul 10. Nucleic Acids Res. 2014. PMID: 25013183 Free PMC article. - BARCOSEL: a tool for selecting an optimal barcode set for high-throughput sequencing.
Somervuo P, Koskinen P, Mei P, Holm L, Auvinen P, Paulin L. Somervuo P, et al. BMC Bioinformatics. 2018 Jul 5;19(1):257. doi: 10.1186/s12859-018-2262-7. BMC Bioinformatics. 2018. PMID: 29976145 Free PMC article. - Enhancing the detection of barcoded reads in high throughput DNA sequencing data by controlling the false discovery rate.
Buschmann T, Zhang R, Brash DE, Bystrykh LV. Buschmann T, et al. BMC Bioinformatics. 2014 Aug 7;15(1):264. doi: 10.1186/1471-2105-15-264. BMC Bioinformatics. 2014. PMID: 25099007 Free PMC article. - Dissecting intercellular and intracellular signaling networks with barcoded genetic tools.
Herholt A, Sahoo VK, Popovic L, Wehr MC, Rossner MJ. Herholt A, et al. Curr Opin Chem Biol. 2022 Feb;66:102091. doi: 10.1016/j.cbpa.2021.09.002. Epub 2021 Oct 10. Curr Opin Chem Biol. 2022. PMID: 34644670 Review.
Cited by
- Amplicon sequencing for the quantification of spoilage microbiota in complex foods including bacterial spores.
de Boer P, Caspers M, Sanders JW, Kemperman R, Wijman J, Lommerse G, Roeselers G, Montijn R, Abee T, Kort R. de Boer P, et al. Microbiome. 2015 Jul 27;3:30. doi: 10.1186/s40168-015-0096-3. eCollection 2015. Microbiome. 2015. PMID: 26217487 Free PMC article. - Partitioning the C. elegans genome by nucleosome modification, occupancy, and positioning.
Gu SG, Fire A. Gu SG, et al. Chromosoma. 2010 Feb;119(1):73-87. doi: 10.1007/s00412-009-0235-3. Epub 2009 Aug 25. Chromosoma. 2010. PMID: 19705140 Free PMC article. - PANGEA: pipeline for analysis of next generation amplicons.
Giongo A, Crabb DB, Davis-Richardson AG, Chauliac D, Mobberley JM, Gano KA, Mukherjee N, Casella G, Roesch LF, Walts B, Riva A, King G, Triplett EW. Giongo A, et al. ISME J. 2010 Jul;4(7):852-61. doi: 10.1038/ismej.2010.16. Epub 2010 Feb 25. ISME J. 2010. PMID: 20182525 Free PMC article. - High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species.
Grattapaglia D, Silva-Junior OB, Kirst M, de Lima BM, Faria DA, Pappas GJ Jr. Grattapaglia D, et al. BMC Plant Biol. 2011 Apr 14;11:65. doi: 10.1186/1471-2229-11-65. BMC Plant Biol. 2011. PMID: 21492434 Free PMC article. - Quantitative bias in Illumina TruSeq and a novel post amplification barcoding strategy for multiplexed DNA and small RNA deep sequencing.
Van Nieuwerburgh F, Soetaert S, Podshivalova K, Ay-Lin Wang E, Schaffer L, Deforce D, Salomon DR, Head SR, Ordoukhanian P. Van Nieuwerburgh F, et al. PLoS One. 2011;6(10):e26969. doi: 10.1371/journal.pone.0026969. Epub 2011 Oct 28. PLoS One. 2011. PMID: 22046424 Free PMC article.
References
- Bennett ST, Barnes C, Cox A, Davies L, Brown C. Toward the 1,000 dollars human genome. Pharmacogenomics. 2005;6:373–382. - PubMed
- Ahmadian A, Ehn M, Hober S. Pyrosequencing: history, biochemistry and future. Clin. Chim. Acta. 2006;363:83–94. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- R01 HG003571/HG/NHGRI NIH HHS/United States
- 1S10RR022982/RR/NCRR NIH HHS/United States
- R01HG003571/HG/NHGRI NIH HHS/United States
- R01 GM37706/GM/NIGMS NIH HHS/United States
- R01 GM037706/GM/NIGMS NIH HHS/United States
- S10 RR022982/RR/NCRR NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials