VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing - PubMed (original) (raw)
VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing
Daniel C Koboldt et al. Genome Res. 2012 Mar.
Abstract
Cancer is a disease driven by genetic variation and mutation. Exome sequencing can be utilized for discovering these variants and mutations across hundreds of tumors. Here we present an analysis tool, VarScan 2, for the detection of somatic mutations and copy number alterations (CNAs) in exome data from tumor-normal pairs. Unlike most current approaches, our algorithm reads data from both samples simultaneously; a heuristic and statistical algorithm detects sequence variants and classifies them by somatic status (germline, somatic, or LOH); while a comparison of normalized read depth delineates relative copy number changes. We apply these methods to the analysis of exome sequence data from 151 high-grade ovarian tumors characterized as part of the Cancer Genome Atlas (TCGA). We validated some 7790 somatic coding mutations, achieving 93% sensitivity and 85% precision for single nucleotide variant (SNV) detection. Exome-based CNA analysis identified 29 large-scale alterations and 619 focal events per tumor on average. As in our previous analysis of these data, we observed frequent amplification of oncogenes (e.g., CCNE1, MYC) and deletion of tumor suppressors (NF1, PTEN, and CDKN2A). We searched for additional recurrent focal CNAs using the correlation matrix diagonal segmentation (CMDS) algorithm, which identified 424 significant events affecting 582 genes. Taken together, our results demonstrate the robust performance of VarScan 2 for somatic mutation and CNA detection and shed new light on the landscape of genetic alterations in ovarian cancer.
Figures
Figure 1.
The VarScan 2 mutation and copy number alteration detection algorithms. Alignments in BAM format for a tumor–normal pair are read simultaneously to identify inherited (germline), loss-of-heterozygosity (LOH), and somatic mutation events. Variants in each category are further classified as high confidence (HC) or low confidence (LC). HC variants are filtered to remove false positives from common sequencing- and alignment-related artifacts (see Table 1). The resulting variants are annotated and organized by tier; the average number of “tier 1” coding variants per tumor is shown for each category. At positions with at least 20× coverage (default), copy number alterations are detected by comparison of Q20 read depths from matched tumor–normal pairs, normalized based on the amount of input data for each sample. Raw contiguous regions from VarScan 2 are processed by circular binary segmentation (CBS) and a subsequent merging procedure that joins adjacent segments yields a set of somatic copy number alterations, which are further classified as large-scale (>25% of chromosome arm) or focal (<25%) events. Shown are the average numbers of events detected in 142 ovarian exomes.
Figure 2.
Detection of large-scale and focal copy number alterations by sequencing- and array-based approaches. (A) Deletions and focal amplifications of chromosome 4 in sample TCGA-24-1103. Copy number estimates from array (gray), WGS (light blue), and exome (dark blue) indicate two regions of deletion as well as a focal amplification (window). Red lines indicate segmented exome CBS calls. (Below) Variant allele frequencies in the normal (blue) and tumor (green) indicate regions of loss of heterozygosity (LOH) in deleted segments. (B) Intersection of large-scale copy number alterations detected by SNP array, whole-genome sequencing, and exome sequencing approaches for five HGS-OVCa cases. For details, see Supplemental Table 3. (C) Intersection of gene-level (focal) copy number alterations detected by SNP array, whole-genome sequencing, and exome sequencing approaches for five HGS-OVCa cases.
Figure 3.
Recurrent chromosome-arm gains and losses in ovarian cancer. Eight significant gains and 22 significant losses of chromosome arms identified by TCGA in SNP array data for 489 cases were recapitulated using exome data for 142 cases. Observed frequencies were highly correlated between data sets for both gains (r2 = 0.84) and losses (r2 = 0.86).
Figure 4.
Global copy number alteration profile of ovarian cancer. Average log2 of copy number difference is plotted for chromosomes 1–22 and X. Amplifications are shown in red, deletions in blue, and neutral regions in gray. Significant peaks associated with known oncogenes or tumor suppressor genes are indicated.
Figure 5.
Frequent copy number alteration of ovarian cancer genes. Exome-based copy number estimates were used to compute the proportion of ovarian cancer tumors (n = 142) exhibiting amplification or deletion of key ovarian cancer genes. Asterisks (*) indicate significantly altered genes identified from SNP array data in our previous study.
Similar articles
- Inferring copy number and genotype in tumour exome data.
Amarasinghe KC, Li J, Hunter SM, Ryland GL, Cowin PA, Campbell IG, Halgamuge SK. Amarasinghe KC, et al. BMC Genomics. 2014 Aug 28;15(1):732. doi: 10.1186/1471-2164-15-732. BMC Genomics. 2014. PMID: 25167919 Free PMC article. - A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal.
Sun JX, He Y, Sanford E, Montesion M, Frampton GM, Vignot S, Soria JC, Ross JS, Miller VA, Stephens PJ, Lipson D, Yelensky R. Sun JX, et al. PLoS Comput Biol. 2018 Feb 7;14(2):e1005965. doi: 10.1371/journal.pcbi.1005965. eCollection 2018 Feb. PLoS Comput Biol. 2018. PMID: 29415044 Free PMC article. - CNApp, a tool for the quantification of copy number alterations and integrative analysis revealing clinical implications.
Franch-Expósito S, Bassaganyas L, Vila-Casadesús M, Hernández-Illán E, Esteban-Fabró R, Díaz-Gay M, Lozano JJ, Castells A, Llovet JM, Castellví-Bel S, Camps J. Franch-Expósito S, et al. Elife. 2020 Jan 15;9:e50267. doi: 10.7554/eLife.50267. Elife. 2020. PMID: 31939734 Free PMC article. - Exome sequence read depth methods for identifying copy number changes.
Kadalayil L, Rafiq S, Rose-Zerilli MJ, Pengelly RJ, Parker H, Oscier D, Strefford JC, Tapper WJ, Gibson J, Ennis S, Collins A. Kadalayil L, et al. Brief Bioinform. 2015 May;16(3):380-92. doi: 10.1093/bib/bbu027. Epub 2014 Aug 28. Brief Bioinform. 2015. PMID: 25169955 Review. - Whole-exome sequencing identifies unique mutations and copy number losses in calcifying fibrous tumor of the pleura: report of 3 cases and review of the literature.
Mehrad M, LaFramboise WA, Lyons MA, Trejo Bittar HE, Yousem SA. Mehrad M, et al. Hum Pathol. 2018 Aug;78:36-43. doi: 10.1016/j.humpath.2018.04.005. Epub 2018 Apr 22. Hum Pathol. 2018. PMID: 29689243 Review.
Cited by
- miR-4477b gene as a novel pathogenic mutation occurring during the transformation of colorectal adenoma into colorectal cancer.
He X, Cheng G, Xiao F, Zhang L, Jin G, Zhao X, Liu Y, Liang J, Li Y, Liu Z, Yuan Q, Ren H, Wu Q, Wu J, Xue L, Feng J, Wang Z, Xing Y, Wu W, Li Z, Wei D, Song X. He X, et al. J Gastrointest Oncol. 2021 Feb;12(1):69-78. doi: 10.21037/jgo-20-600. J Gastrointest Oncol. 2021. PMID: 33708425 Free PMC article. - A comprehensive DNA panel next generation sequencing approach supporting diagnostics and therapy prediction in neurooncology.
Lorenz J, Rothhammer-Hampl T, Zoubaa S, Bumes E, Pukrop T, Kölbl O, Corbacioglu S, Schmidt NO, Proescholdt M, Hau P, Riemenschneider MJ. Lorenz J, et al. Acta Neuropathol Commun. 2020 Aug 5;8(1):124. doi: 10.1186/s40478-020-01000-w. Acta Neuropathol Commun. 2020. PMID: 32758285 Free PMC article. - Landscape of potentially targetable receptor tyrosine kinase fusions in diverse cancers by DNA-based profiling.
Wang T, Wei L, Lu Q, Shao Y, You S, Yin JC, Wang S, Shao Y, Chen Z, Wang Z. Wang T, et al. NPJ Precis Oncol. 2022 Nov 11;6(1):84. doi: 10.1038/s41698-022-00325-0. NPJ Precis Oncol. 2022. PMID: 36369474 Free PMC article. - Aedes aegypti miRNA-33 modulates permethrin induced toxicity by regulating VGSC transcripts.
Kubik TD, Snell TK, Saavedra-Rodriguez K, Wilusz J, Anderson JR, Lozano-Fuentes S, Black WC 4th, Campbell CL. Kubik TD, et al. Sci Rep. 2021 Mar 31;11(1):7301. doi: 10.1038/s41598-021-86665-6. Sci Rep. 2021. PMID: 33790374 Free PMC article. - KRAS and CREBBP mutations: a relapse-linked malicious liaison in childhood high hyperdiploid acute lymphoblastic leukemia.
Malinowska-Ozdowy K, Frech C, Schönegger A, Eckert C, Cazzaniga G, Stanulla M, zur Stadt U, Mecklenbräuker A, Schuster M, Kneidinger D, von Stackelberg A, Locatelli F, Schrappe M, Horstmann MA, Attarbaschi A, Bock C, Mann G, Haas OA, Panzer-Grümayer R. Malinowska-Ozdowy K, et al. Leukemia. 2015 Aug;29(8):1656-67. doi: 10.1038/leu.2015.107. Epub 2015 Apr 28. Leukemia. 2015. PMID: 25917266 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous