Genome-wide detection of single-nucleotide and copy-number variations of a single human cell - PubMed (original) (raw)
Genome-wide detection of single-nucleotide and copy-number variations of a single human cell
Chenghang Zong et al. Science. 2012.
Abstract
Kindred cells can have different genomes because of dynamic changes in DNA. Single-cell sequencing is needed to characterize these genomic differences but has been hindered by whole-genome amplification bias, resulting in low genome coverage. Here, we report on a new amplification method-multiple annealing and looping-based amplification cycles (MALBAC)-that offers high uniformity across the genome. Sequencing MALBAC-amplified DNA achieves 93% genome coverage ≥1x for a single human cell at 25x mean sequencing depth. We detected digitized copy-number variations (CNVs) of a single cancer cell. By sequencing three kindred cells, we were able to identify individual single-nucleotide variations (SNVs), with no false positives detected. We directly measured the genome-wide mutation rate of a cancer cell line and found that purine-pyrimidine exchanges occurred unusually frequently among the newly acquired SNVs.
Conflict of interest statement
Competing Financial Interests
CZ, SL and XSX are authors on a patent applied for by Harvard University that covers the MALBAC technology
Figures
Figure 1
MALBAC single cell whole genome amplification. A single cell is picked and lysed. First, genomic DNA of the single cell is melted into single-stranded DNA molecules at 94°C. MALBAC primers then anneal randomly to single-stranded DNA molecules at 0°C and are extended by a polymerase with displacement activity at elevated temperatures, creating semi-amplicons. In the following five temperature cycles, after the step of looping the full amplicons, single stranded amplicons and the genomic DNA are used as template to produce full amplicons and additional semi-amplicons, respectively. For full amplicons, the 3′ end is complementary to the sequence on the 5′ end. The two ends hybridize will form the looped DNA, which can efficiently prevents the full amplicon from being used as template, therefore warrant a close-to-linear amplification. After the five cycles of linear preamplification, only the full amplicons can be exponentially amplified in the following PCR using the common 27-nucleotide sequence as the primer. PCR reaction will generate microgram level of DNA material for sequencing experiments.
Figure 2
Characterization of amplification uniformity. (A) Histograms of reads over the entirety of Chromosome 1 of a single cell from the SW480 cancer cell line and the zoom-in of a ~8 million base region (chr1: 62,023,147–70,084,845). (B) Lorenz curves of MALBAC, MDA and bulk sample. A Lorenz curve gives the cumulated fraction of reads as a function of the cumulated fraction of genome. Perfectly uniform coverage would result in a diagonal line and a large deviation from the diagonal is indicative of a biased coverage. The green and blue arrows indicate the uncovered fractions of the genome for MALBAC and MDA respectively. All samples are sequenced at 25x depth. (C) Power spectrum of read density throughout the genome (as a function of spatial frequency). MALBAC performs similarly to bulk, while the MDA spectrum shows high amplitude at low frequency, demonstrating that regions of several megabases suffer from under- and over- amplification. This observation is consistent with the variations of read depth in Fig. S3 (SOM).
Figure 3
CNVs of single cancer cells. Digitized copy numbers across the genome are plotted for three single cells (Panel A to C) as well as the bulk sample (Panel D) from the SW480 cancer cell line. The bottom panel shows the result based on MDA amplification (Panel E). Green lines are fitted CNV numbers obtained from the hidden Markov model (SOM). The single cells are sequenced at only 0.8x depth, while the bulk and MDA are done at 25x. More single cells’ CNV analyses are included in the SOM (Fig. S4). The regions within the dashed box exhibit the CNV differences among single cells and the bulk, which cannot be resolved by MDA. The binning window is 200kb.
Figure 4
Calling newly acquired SNVs and estimation of mutation rate of a cancer cell line (SW480). (A) Experiment design. A single ancestor cell is chosen and cultured for ~20 generations. The vast majority of cells are used to extract DNA for bulk sequencing to represent the ancestor cell’s genome. A single cell from this culture is chosen for another expansion of four generations. The kindred cells are isolated for single cell whole genome amplification. Single cell sample C1, C2, and C3 are used for high-throughput sequencing. Sample C4, C5, and C6 are used for varying SNVs with Sanger sequencing. (B) 3D p-value plot of a one-sided binomial test for SNV candidates from the three kindred cells. The black dots are the false positives due to uncorrelated amplification errors; all of them are on the x-y-z axis and x-y, y-z, x-z planes. Outside of the three planes, the 166 green dots are the residual false positives due to correlated errors from homopolymers, tandem repeats, high-GC content and high density SNV regions, and the 35 red dots are the newly acquired SNVs during the 20 generations of clonal expansion (SOM). We note that the homozygous SNVs are located at the (1,1,1) position. (C) Locations of the 35 newly acquired SNVs on the chromosomes of a single cell (SOM). (D) Next-generation sequencing data of a newly acquired SNV. The SNV (C→G) exists in the high throughput data of all three kindred cells but not in the bulk data. (E) Sanger sequencing data of single cells C4, C5, and C6 confirms that this SNV is not a false positive, while the Sanger sequencing of the bulk confirms that this SNV is not a false negative of next generation sequencing of the bulk (i.e. this SNV is indeed absent in the bulk).
Comment in
- Single cells go fully genomic.
Rusk N. Rusk N. Nat Methods. 2013 Mar;10(3):190-1. doi: 10.1038/nmeth.2391. Nat Methods. 2013. PMID: 23570041 No abstract available.
Similar articles
- Comparison of multiple displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC) in single-cell sequencing.
Chen M, Song P, Zou D, Hu X, Zhao S, Gao S, Ling F. Chen M, et al. PLoS One. 2014 Dec 8;9(12):e114520. doi: 10.1371/journal.pone.0114520. eCollection 2014. PLoS One. 2014. PMID: 25485707 Free PMC article. - Microfluidic whole genome amplification device for single cell sequencing.
Yu Z, Lu S, Huang Y. Yu Z, et al. Anal Chem. 2014 Oct 7;86(19):9386-90. doi: 10.1021/ac5032176. Epub 2014 Sep 22. Anal Chem. 2014. PMID: 25233049 - Systematic assessment of the performance of whole-genome amplification for SNP/CNV detection and β-thalassemia genotyping.
He F, Zhou W, Cai R, Yan T, Xu X. He F, et al. J Hum Genet. 2018 Apr;63(4):407-416. doi: 10.1038/s10038-018-0411-5. Epub 2018 Feb 13. J Hum Genet. 2018. PMID: 29440707 - [Comparison of different single cell whole genome amplification methods and MALBAC applications in assisted reproduction].
Yao YX, La YF, Di R, Liu QY, Hu WP, Wang XY, Chu MX. Yao YX, et al. Yi Chuan. 2018 Aug 16;40(8):620-631. doi: 10.16288/j.yczz.18-091. Yi Chuan. 2018. PMID: 30117418 Review. Chinese. - Single-Cell Whole-Genome Amplification and Sequencing: Methodology and Applications.
Huang L, Ma F, Chapman A, Lu S, Xie XS. Huang L, et al. Annu Rev Genomics Hum Genet. 2015;16:79-102. doi: 10.1146/annurev-genom-090413-025352. Epub 2015 Jun 11. Annu Rev Genomics Hum Genet. 2015. PMID: 26077818 Review.
Cited by
- Deciphering breast cancer dynamics: insights from single-cell and spatial profiling in the multi-omics era.
Xiong X, Wang X, Liu CC, Shao ZM, Yu KD. Xiong X, et al. Biomark Res. 2024 Sep 18;12(1):107. doi: 10.1186/s40364-024-00654-1. Biomark Res. 2024. PMID: 39294728 Free PMC article. Review. - Single-Cell RNA-Sequencing: Opening New Horizons for Breast Cancer Research.
Xiang L, Rao J, Yuan J, Xie T, Yan H. Xiang L, et al. Int J Mol Sci. 2024 Aug 31;25(17):9482. doi: 10.3390/ijms25179482. Int J Mol Sci. 2024. PMID: 39273429 Free PMC article. Review. - Application of single cell sequencing technology in ovarian cancer research (review).
Yuan Q, Lv N, Chen Q, Shen S, Wang Y, Tong J. Yuan Q, et al. Funct Integr Genomics. 2024 Aug 28;24(5):144. doi: 10.1007/s10142-024-01432-w. Funct Integr Genomics. 2024. PMID: 39196391 Free PMC article. Review. - Single-Cell Sequencing: High-Resolution Analysis of Cellular Heterogeneity in Autoimmune Diseases.
Tang X, Zhang Y, Zhang H, Zhang N, Dai Z, Cheng Q, Li Y. Tang X, et al. Clin Rev Allergy Immunol. 2024 Aug 26. doi: 10.1007/s12016-024-09001-6. Online ahead of print. Clin Rev Allergy Immunol. 2024. PMID: 39186216 Review. - Applications of single-cell technologies in drug discovery for tumor treatment.
Liu B, Hu S, Wang X. Liu B, et al. iScience. 2024 Jul 10;27(8):110486. doi: 10.1016/j.isci.2024.110486. eCollection 2024 Aug 16. iScience. 2024. PMID: 39171294 Free PMC article. Review.
References
- Elowitz MB, Levine AJ, Siggia ED, Swain PS. Stochastic gene expression in a single cell. Science. 2002 Aug 16;297:1183. - PubMed
- Negrini S, Gorgoulis VG, Halazonetis TD. Genomic instability--an evolving hallmark of cancer. Nature reviews Molecular cell biology. 2010 Mar;11:220. - PubMed
- Lengauer C, Kinzler KW, Vogelstein B. Genetic instabilities in human cancers. Nature. 1998 Dec 17;396:643. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- RC2 HG005613/HG/NHGRI NIH HHS/United States
- HG005613-01/HG/NHGRI NIH HHS/United States
- HG005097-1/HG/NHGRI NIH HHS/United States
- T32 GM008313/GM/NIGMS NIH HHS/United States
- R01 HG005097/HG/NHGRI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical