MuSiC: identifying mutational significance in cancer genomes - PubMed (original) (raw)

. 2012 Aug;22(8):1589-98.

doi: 10.1101/gr.134635.111. Epub 2012 Jul 3.

Qunyuan Zhang, Cyriac Kandoth, Michael C Wendl, William Schierding, Daniel C Koboldt, Thomas B Mooney, Matthew B Callaway, David Dooling, Elaine R Mardis, Richard K Wilson, Li Ding

Affiliations

MuSiC: identifying mutational significance in cancer genomes

Nathan D Dees et al. Genome Res. 2012 Aug.

Abstract

Massively parallel sequencing technology and the associated rapidly decreasing sequencing costs have enabled systemic analyses of somatic mutations in large cohorts of cancer cases. Here we introduce a comprehensive mutational analysis pipeline that uses standardized sequence-based inputs along with multiple types of clinical data to establish correlations among mutation sites, affected genes and pathways, and to ultimately separate the commonly abundant passenger mutations from the truly significant events. In other words, we aim to determine the Mutational Significance in Cancer (MuSiC) for these large data sets. The integration of analytical operations in the MuSiC framework is widely applicable to a broad set of tumor types and offers the benefits of automation as well as standardization. Herein, we describe the computational structure and statistical underpinnings of the MuSiC pipeline and demonstrate its performance using 316 ovarian cancer samples from the TCGA ovarian cancer project. MuSiC correctly confirms many expected results, and identifies several potentially novel avenues for discovery.

PubMed Disclaimer

Figures

Figure 1.

Figure 1.

MuSiC flow diagram. MuSiC modules can either be implemented individually with various required input files or may be implemented in serial via one command where four inputs are used to execute the entire package of tools.

Figure 2.

Figure 2.

Mutation rates and SMGs in the OV data set. (A) The cohort-wide background mutation rates for all seven mutational mechanism categories are plotted for the OV data set. The overall BMR is also plotted, combining all types of mutations. (B) −Log10(P) for the top 12 OV SMGs are plotted for all three SMG tests in order of decreasing convolution test _P_-value.

Figure 3.

Figure 3.

Mutation relation analysis. A heat map showing mutations in highly mutated genes for all 316 OV samples. Dotted-line boxes highlight concurrent nonsynonymous EMR3 and FAT3 mutations (two concurrent mutations out of five nonsynonymous mutations from EMR3 and 18 from FAT3, P = 0.0333) and mutually exclusive nonsynonymous RB1 and TP53 mutations (297 mutually exclusive mutations out of six nonsynonymous mutations from RB1 and 299 from TP53, P = 0.0141).

Figure 4.

Figure 4.

BRCA1 variant status versus sample age of diagnosis for the OV data set. A boxplot of the age of diagnosis of 315 OV patients grouped by their BRCA1 mutation status. Germline BRCA1 variant status is correlated with a lower age of diagnosis via the CCT (P = 2.456 × 10−5).

Figure 5.

Figure 5.

Proximity analysis mutation diagrams. These mutation diagrams show recurrent triplet mutations in both UBR4 and RB1CC1, both of which harbor a relationship with tumor suppressor gene RB1.

Figure 6.

Figure 6.

Pfam domains affected by OV mutations. (A) A histogram of the most highly mutated domains in the OV data set next to the number of genes affected in each domain. (B) A stacked bar-graph where the value 100% represents the total number of mutations in a particular Pfam domain. Lighter and darker sections of the bars represent which proportions of the total mutations are nonsynonymous and synonymous, respectively.

References

    1. Abdul M, Ramlal S, Hoosein N 2008. Ryanodine receptor expression correlates with tumor grade in breast cancer. Pathol Oncol Res 14: 157–160 - PubMed
    1. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR 2010. A method and server for predicting damaging missense mutations. Nat Methods 7: 248–249 - PMC - PubMed
    1. Ahmed AA, Etemadmoghadam D, Temple J, Lynch AG, Riad M, Sharma R, Stewart C, Fereday S, Caldas C, Defazio A, et al. 2010. Driver mutations in TP53 are ubiquitous in high grade serous carcinoma of the ovary. J Pathol 221: 49–56 - PMC - PubMed
    1. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE 2000. The Protein Data Bank. Nucleic Acids Res 28: 235–242 - PMC - PubMed
    1. The Cancer Genome Atlas Research Network 2011. Integrated genomic analyses of ovarian carcinoma. Nature 474: 609–615 - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources