Interactive analysis and assessment of single-cell copy-number variations - PubMed (original) (raw)
Interactive analysis and assessment of single-cell copy-number variations
Tyler Garvin et al. Nat Methods. 2015 Nov.
Abstract
We present Ginkgo (http://qb.cshl.edu/ginkgo), a user-friendly, open-source web platform for the analysis of single-cell copy-number variations (CNVs). Ginkgo automatically constructs copy-number profiles of cells from mapped reads and constructs phylogenetic trees of related cells. We validated Ginkgo by reproducing the results of five major studies. After comparing three commonly used single-cell amplification techniques, we concluded that degenerate oligonucleotide-primed PCR is the most consistent for CNV analysis.
Figures
Figure 1
The Ginkgo flowchart for performing single-cell copy-number analysis. Usage and parameters are described in the online methods and on the website.
Figure 2
(a) Lowess fit of GC content with respect to log normalized bin counts for all samples in each of the 9 datasets analyzed: 3 for MDA (top left – green), 3 for MALBAC (center left – orange), and 3 for DOP-PCR (bottom left – blue). Each colored line within a plot corresponds to the lowess fit of a single sample. The dashed lines show a two fold increase or decrease in average observed coverage. Note that the three MDA datasets (top left) have a different y-axis scale due to the large GC biases present. (b) The median absolute deviation (MAD) of neighboring bins: A single pair-wise MAD value is generated for each sample in a given dataset and represented by a box and whisker plot. The bold center line represents the mean, the box boundaries represent the quartiles, and the whiskers represent the remaining data points. The high biases present in the MDA datasets make comparing DOP-PCR and MABLAC samples difficult. Figure 3 of the Online Methods shows this comparison more clearly.
Figure 3
A comparison of MAD between the Navin et al. (T10) shown in blue and Ni et al. (CTC) shown in orange. As the bin offset increases the separation between the mean T10 MAD and mean CTC MAD grows.
Figure 4
Histograms of normalized bin counts across the CTC and T10 datasets, for a high-, typical-, and poor-quality cell. The rightmost column contains histograms of high quality cell population averages. Distinct peaks are representative of clean data from which accurate copy number calls can be made.
Figure 5
(a) Model representation of the 9 distinct subclones generated by simulation of 100 copy number events with respect to the reference. (b) Hierarchical clustering of the 90 samples by Ginkgo. Ginkgo perfectly recovers the underlying subclonal population structure.
Similar articles
- Family-Based Benchmarking of Copy Number Variation Detection Software.
Nutsua ME, Fischer A, Nebel A, Hofmann S, Schreiber S, Krawczak M, Nothnagel M. Nutsua ME, et al. PLoS One. 2015 Jul 21;10(7):e0133465. doi: 10.1371/journal.pone.0133465. eCollection 2015. PLoS One. 2015. PMID: 26197066 Free PMC article. - COKGEN: a software for the identification of rare copy number variation from SNP microarrays.
Yavaş G, Koyutürk M, Ozsoyoğlu M, Gould MP, Laframboise T. Yavaş G, et al. Pac Symp Biocomput. 2010:371-82. Pac Symp Biocomput. 2010. PMID: 19908389 - CNVinspector: a web-based tool for the interactive evaluation of copy number variations in single patients and in cohorts.
Knierim E, Schwarz JM, Schuelke M, Seelow D. Knierim E, et al. J Med Genet. 2013 Aug;50(8):529-33. doi: 10.1136/jmedgenet-2012-101497. Epub 2013 May 31. J Med Genet. 2013. PMID: 23729504 - Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays.
Komura D, Shen F, Ishikawa S, Fitch KR, Chen W, Zhang J, Liu G, Ihara S, Nakamura H, Hurles ME, Lee C, Scherer SW, Jones KW, Shapero MH, Huang J, Aburatani H. Komura D, et al. Genome Res. 2006 Dec;16(12):1575-84. doi: 10.1101/gr.5629106. Epub 2006 Nov 22. Genome Res. 2006. PMID: 17122084 Free PMC article. - Targeted screening and validation of copy number variations.
Ceulemans S, van der Ven K, Del-Favero J. Ceulemans S, et al. Methods Mol Biol. 2012;838:311-28. doi: 10.1007/978-1-61779-507-7_15. Methods Mol Biol. 2012. PMID: 22228019 Review.
Cited by
- Single-cell somatic copy number variants in brain using different amplification methods and reference genomes.
Kalef-Ezra E, Turan ZG, Perez-Rodriguez D, Bomann I, Behera S, Morley C, Scholz SW, Jaunmuktane Z, Demeulemeester J, Sedlazeck FJ, Proukakis C. Kalef-Ezra E, et al. Commun Biol. 2024 Oct 9;7(1):1288. doi: 10.1038/s42003-024-06940-w. Commun Biol. 2024. PMID: 39384904 Free PMC article. - Improved allele-specific single-cell copy number estimation in low-coverage DNA-sequencing.
Weiner S, Li B, Nabavi S. Weiner S, et al. Bioinformatics. 2024 Aug 2;40(8):btae506. doi: 10.1093/bioinformatics/btae506. Bioinformatics. 2024. PMID: 39133157 Free PMC article. - SCCNAInfer: a robust and accurate tool to infer the absolute copy number on scDNA-seq data.
Zhang L, Zhou XM, Mallory X. Zhang L, et al. Bioinformatics. 2024 Jul 27;40(7):btae454. doi: 10.1093/bioinformatics/btae454. Online ahead of print. Bioinformatics. 2024. PMID: 39067018 Free PMC article. - HATCHet2: clone- and haplotype-specific copy number inference from bulk tumor sequencing data.
Myers MA, Arnold BJ, Bansal V, Balaban M, Mullen KM, Zaccaria S, Raphael BJ. Myers MA, et al. Genome Biol. 2024 May 21;25(1):130. doi: 10.1186/s13059-024-03267-x. Genome Biol. 2024. PMID: 38773520 Free PMC article. - Mapping recurrent mosaic copy number variation in human neurons.
Sun C, Kathuria K, Emery SB, Kim B, Burbulis IE, Shin JH; Brain Somatic Mosaicism Network; Weinberger DR, Moran JV, Kidd JM, Mills RE, McConnell MJ. Sun C, et al. Nat Commun. 2024 May 17;15(1):4220. doi: 10.1038/s41467-024-48392-0. Nat Commun. 2024. PMID: 38760338 Free PMC article.
References
- Shapiro E, Biezuner T, Linnarsson S. Single-cell sequencing-based technologies will revolutionize whole-organism science. Nature reviews. Genetics. 2013;14:618–630. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- R01 HG006677/HG/NHGRI NIH HHS/United States
- 5T32GM065094/GM/NIGMS NIH HHS/United States
- R01-HG006677/HG/NHGRI NIH HHS/United States
- T32 GM065094/GM/NIGMS NIH HHS/United States
- P30 CA045508/CA/NCI NIH HHS/United States
- 5P30CA045508/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources