Systematic Functional Annotation of Somatic Mutations in Cancer - PubMed (original) (raw)
. 2018 Mar 12;33(3):450-462.e10.
doi: 10.1016/j.ccell.2018.01.021.
Jun Li 2, Kang Jin Jeong 3, Shan Shao 1, Hu Chen 4, Yiu Huen Tsang 5, Sohini Sengupta 6, Zixing Wang 2, Venkata Hemanjani Bhavana 5, Richard Tran 1, Stephanie Soewito 1, Darlan Conterno Minussi 7, Daniela Moreno 5, Kathleen Kong 5, Turgut Dogruluk 5, Hengyu Lu 5, Jianjiong Gao 8, Collin Tokheim 9, Daniel Cui Zhou 6, Amber M Johnson 1, Jia Zeng 1, Carman Ka Man Ip 3, Zhenlin Ju 2, Matthew Wester 3, Shuangxing Yu 3, Yongsheng Li 3, Christopher P Vellano 3, Nikolaus Schultz 8, Rachel Karchin 10, Li Ding 11, Yiling Lu 3, Lydia Wai Ting Cheung 12, Ken Chen 2, Kenna R Shaw 1, Funda Meric-Bernstam 13, Kenneth L Scott 5, Song Yi 14, Nidhi Sahni 15, Han Liang 16, Gordon B Mills 3
Affiliations
- PMID: 29533785
- PMCID: PMC5926201
- DOI: 10.1016/j.ccell.2018.01.021
Systematic Functional Annotation of Somatic Mutations in Cancer
Patrick Kwok-Shing Ng et al. Cancer Cell. 2018.
Abstract
The functional impact of the vast majority of cancer somatic mutations remains unknown, representing a critical knowledge gap for implementing precision oncology. Here, we report the development of a moderate-throughput functional genomic platform consisting of efficient mutant generation, sensitive viability assays using two growth factor-dependent cell models, and functional proteomic profiling of signaling effects for select aberrations. We apply the platform to annotate >1,000 genomic aberrations, including gene amplifications, point mutations, indels, and gene fusions, potentially doubling the number of driver mutations characterized in clinically actionable genes. Further, the platform is sufficiently sensitive to identify weak drivers. Our data are accessible through a user-friendly, public data portal. Our study will facilitate biomarker discovery, prediction algorithm improvement, and drug development.
Keywords: TCGA; cellular assay; clinical marker; driver mutation; drug sensitivity; functional genomics; functional proteomics; therapeutic target.
Copyright © 2018 Elsevier Inc. All rights reserved.
Figures
Figure 1. Overview of the functional genomic platform and cancer mutations tested
(A) Mutations (muts), corresponding wild-type (WT) and fusion genes were selected from TCGA projects and MD Anderson Cancer Center patient databases. Clones were generated by the HiTMMoB approach, and tested in in vitro growth-factor dependent cell viability assays with Ba/F3 and MCF10A cell models. Mutations and wild-type variants were classified into functional categories based on these results. MCF10A cell lines stably expressing selected mutations were generated for reverse-phase protein array (RPPA) analysis. The numbers of mutant, wild-type and fusion constructs are annotated at each step. (B) Pie charts showing the proportions of the mutations annotated in OncoKB or Personalized Cancer Therapy (PCT) or PubMed literature among all the 1049 mutations tested. (C) Bar plots showing the literature coverage of mutations for the top 10 genes with the greatest number of mutations tested, as shown by the percentages of tested mutations per gene annotated in OncoKB or PCT or PubMed. See also Figure S1 and Table S1.
Figure 2. Functional annotation summary of wild-type genes and mutations
(A) The numbers of missense (purple), indel (orange), nonsense (red) and silent (white) mutations tested are shown in parentheses. (B) The distribution of mutation types tested per gene for the 21 genes with >10 mutations tested is shown. (C, D) The functional annotations for wild-type genes (C) and mutations (D) in Ba/F3 (blue) and MCF10A (red) cell line models are presented based on the growth-factor independent cell viability assay results. (E) The number of mutations in each functional annotation is shown in parentheses. Eleven mutations with inconclusive functional annotations in Ba/F3 and MCF10A models were excluded. See also Figure S2 and Table S2.
Figure 3. Comparison of our functional annotation with literature data and computational predictions
(A) Activating and neutral mutations from our (non-pooled) in vitro platform results were compared to oncogenic, likely oncogenic, and likely neutral mutations annotated from OncoKB. The percentage of mutations in each category is shown. Activating mutations were further classified into strong, moderate and weak based on the degree of activating comparing with the corresponding wild-type genes. Numbers on the bars indicate the mutation numbers in each group. (B) ROC curves of 21 commonly used computational algorithms based on the functional calls in this study, with AUC scores for the top 5 algorithms. (C) Enrichment of activating mutations in three 3D computational algorithms. Numbers on the bars indicate the mutation numbers in each group. ****, p < 10-5. See Figure also S3 and Table S3.
Figure 4. Functional proteomic profiling of select mutations in MCF10A
(A, B) A rank order plot showing the overall reverse-phase protein assay (RPPA) protein expression pattern of each BRAF mutation relative to _BRAF_L584F (A) or each EGFR mutation relative to EGFRG719D (B). Spearman rank correlations were calculated based on all the proteins profiled, and the mutants are color coded by their functional annotations. (C) RPPA unsupervised clustering analysis of 268 MCF10A cell lines stably overexpressing selected mutations was performed. Cluster names are annotated in the top row of the feature track. Gene names and functional calls are also presented in the feature track. Key differentially expressed proteins across clusters are highlighted on the right. (D) Differential cell cycle pathway activities among different clusters. (E) Differential PI3K/Akt pathway activities between activating and non-activating mutations in the PI3K cluster. (F) Differential EMT pathway activities between activating and non-activating mutations in the BRAF cluster. (D-F) The middle lines indicate median values, the top and bottom of the box indicate 25th and 75th percentiles, and whiskers indicate 10th and 90th percentiles. See also Figure S4 and Table S4.
Figure 5. Analysis of EGFR and BRAF mutation allelic series
(A) Functional annotations of EGFR (top) and BRAF (bottom) allelic series. Only recurrent mutations of the series are shown. The frequency (based on TCGA and GENIE databases) and location of mutations tested are shown in lollipop plots. In the heatmap (from top to bottom), the consensus functional annotation, OncoKB annotation, computational prediction by 3D structural cluster (HotSpot3D, HotMAPs), population-based (VEST) cancer-focus (CanDrA), Mutation Assessor and hotspot predictions (based on Chang et al., 2016) of mutations tested in this study are shown. (B) Structural clusters of activating mutations in EGFR (left) and BRAF (right). Filled color and border color of the mutation label indicate the OncoKB annotation and our consensus functional annotation, respectively. See also Figure S5.
Figure 6. Overview of FASMIC portal
(A) Data portal summary. (B) Mutation table of EGFR. (C) The 3D protein structure of p110α (encoded by PIK3CA) with residue K111 highlighted in red. (D) Bar plot of mutational frequency in different cancer types. (E) Functional predictions of various computational algorithms shown in a table with damaging mutations highlighted in dark red. (F) Differential protein expression profile of an EGFR mutant related to the wild-type gene is displayed in a sorted scatter plot.
Similar articles
- On the Road to Precision Cancer Medicine: Analysis of Genomic Biomarker Actionability in 439 Patients.
Schwaederle M, Daniels GA, Piccioni DE, Fanta PT, Schwab RB, Shimabukuro KA, Parker BA, Kurzrock R. Schwaederle M, et al. Mol Cancer Ther. 2015 Jun;14(6):1488-94. doi: 10.1158/1535-7163.MCT-14-1061. Epub 2015 Apr 7. Mol Cancer Ther. 2015. PMID: 25852059 - Knowledge base toward understanding actionable alterations and realizing precision oncology.
Takeuchi S, Okuda S. Takeuchi S, et al. Int J Clin Oncol. 2019 Feb;24(2):123-130. doi: 10.1007/s10147-018-1378-0. Epub 2018 Dec 12. Int J Clin Oncol. 2019. PMID: 30542800 Free PMC article. Review. - Next generation sequencing in cancer: opportunities and challenges for precision cancer medicine.
Paolillo C, Londin E, Fortina P. Paolillo C, et al. Scand J Clin Lab Invest Suppl. 2016;245:S84-91. doi: 10.1080/00365513.2016.1210331. Epub 2016 Aug 17. Scand J Clin Lab Invest Suppl. 2016. PMID: 27542004 - Genomic profiling in oncology clinical practice.
Rodríguez N, Viñal D, Rodríguez-Cobos J, De Castro J, Domínguez G. Rodríguez N, et al. Clin Transl Oncol. 2020 Sep;22(9):1430-1439. doi: 10.1007/s12094-020-02296-9. Epub 2020 Jan 24. Clin Transl Oncol. 2020. PMID: 31981077 Review. - Data resources for the identification and interpretation of actionable mutations by clinicians.
Prawira A, Pugh TJ, Stockley TL, Siu LL. Prawira A, et al. Ann Oncol. 2017 May 1;28(5):946-957. doi: 10.1093/annonc/mdx023. Ann Oncol. 2017. PMID: 28327901 Review.
Cited by
- Developmental mosaicism underlying EGFR-mutant lung cancer presenting with multiple primary tumors.
Burr R, Leshchiner I, Costantino CL, Blohmer M, Sundaresan T, Cha J, Seeger K, Guay S, Danysh BP, Gore I, Jacobs RA, Slowik K, Utro F, Rhrissorrakrai K, Levovitz C, Barth JL, Dubash T, Chirn B, Parida L, Sequist LV, Lennerz JK, Mino-Kenudson M, Maheswaran S, Naxerova K, Getz G, Haber DA. Burr R, et al. Nat Cancer. 2024 Oct 15. doi: 10.1038/s43018-024-00840-y. Online ahead of print. Nat Cancer. 2024. PMID: 39406916 - Machine learning optimized DriverDetect software for high precision prediction of deleterious mutations in human cancers.
Koh HYK, Lam UTF, Ban KH, Chen ES. Koh HYK, et al. Sci Rep. 2024 Sep 30;14(1):22618. doi: 10.1038/s41598-024-71422-2. Sci Rep. 2024. PMID: 39349509 Free PMC article. - Cell-Int: a cell-cell interaction assay to identify native membrane protein interactions.
Aymoz-Bressot T, Canis M, Meurisse F, Wijkhuisen A, Favier B, Mousseau G, Dupressoir A, Heidmann T, Bacquin A. Aymoz-Bressot T, et al. Life Sci Alliance. 2024 Sep 5;7(11):e202402844. doi: 10.26508/lsa.202402844. Print 2024 Nov. Life Sci Alliance. 2024. PMID: 39237366 Free PMC article. - A protein expression atlas on tissue samples and cell lines from cancer patients provides insights into tumor heterogeneity and dependencies.
Li J, Liu W, Mojumdar K, Kim H, Zhou Z, Ju Z, Kumar SV, Ng PK, Chen H, Davies MA, Lu Y, Akbani R, Mills GB, Liang H. Li J, et al. Nat Cancer. 2024 Sep 3. doi: 10.1038/s43018-024-00817-x. Online ahead of print. Nat Cancer. 2024. PMID: 39227745 - RAS-ON inhibition overcomes clinical resistance to KRAS G12C-OFF covalent blockade.
Nokin MJ, Mira A, Patrucco E, Ricciuti B, Cousin S, Soubeyran I, San José S, Peirone S, Caizzi L, Vietti Michelina S, Bourdon A, Wang X, Alvarez-Villanueva D, Martínez-Iniesta M, Vidal A, Rodrigues T, García-Macías C, Awad MM, Nadal E, Villanueva A, Italiano A, Cereda M, Santamaría D, Ambrogio C. Nokin MJ, et al. Nat Commun. 2024 Aug 30;15(1):7554. doi: 10.1038/s41467-024-51828-2. Nat Commun. 2024. PMID: 39215000 Free PMC article.
References
- Bustamante CD, Townsend JP, Hartl DL. Solvent accessibility and purifying selection within proteins of Escherichia coli and Salmonella enterica. Molecular biology and evolution. 2000;17:301–308. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- P30 CA016672/CA/NCI NIH HHS/United States
- U54 HG008100/HG/NHGRI NIH HHS/United States
- U24 CA210950/CA/NCI NIH HHS/United States
- P50 CA098258/CA/NCI NIH HHS/United States
- U01 CA168394/CA/NCI NIH HHS/United States
- U01 CA217842/CA/NCI NIH HHS/United States
- P30 CA008748/CA/NCI NIH HHS/United States
- F31 CA200266/CA/NCI NIH HHS/United States
- P50 CA070907/CA/NCI NIH HHS/United States
- R01 CA175486/CA/NCI NIH HHS/United States
- U01 CA176284/CA/NCI NIH HHS/United States
- U24 CA143883/CA/NCI NIH HHS/United States
- U24 CA209851/CA/NCI NIH HHS/United States
- R50 CA221675/CA/NCI NIH HHS/United States
- U24 CA204817/CA/NCI NIH HHS/United States
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous