Systematic investigation of cancer-associated somatic point mutations in SNP databases (original) (raw)

Nature Biotechnology volume 31, pages 787–789 (2013)Cite this article

Subjects

To the Editor:

Whole-genome and exome sequencing enables the discovery of druggable cancer-driver genes, most commonly through the analysis of somatic point mutations1. As a key step in detecting somatic point mutations, it is common practice to compare sequencing results against public single-nucleotide polymorphism (SNP) databases to help remove previously described variants that occur naturally in the human population. A problem with this approach is that public SNP databases include somatic mutations that may be cancer related, and indiscriminate filtering can often remove such SNPs, even though they may have relevance to disease biology. Here, we carry out a systematic investigation of somatic mutations in SNP databases and illustrate the importance of appropriate filtering. Finally, we propose an improved filtering workflow for the detection of cancer-related mutations.

This is a preview of subscription content, access via your institution

Access options

Subscribe to this journal

Receive 12 print issues and online access

$209.00 per year

only $17.42 per issue

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Additional access options:

Figure 1: Survey of SNP filtering approach and rescued mutations in two example articles.

References

  1. Shendure, J. & Aiden, E.L. Nat. Biotechnol. 30, 1084–1094 (2012).
    Article CAS Google Scholar
  2. Shah, S.P. et al. Nature 486, 395–399 (2012).
    Article CAS Google Scholar
  3. Sherry, S.T. et al. Nucleic Acids Res. 29, 308–311 (2001).
    Article CAS Google Scholar
  4. Abecasis, G.R. et al. Nature 491, 56–65 (2012).
    Article Google Scholar
  5. Hamosh, A., Scott, A.F., Amberger, J.S., Bocchini, C.A. & McKusick, V.A. Nucleic Acids Res. 33, D514–D517 (2005).
    Article CAS Google Scholar
  6. Forbes, S.A. et al. Nucleic Acids Res. 39, D945–D950 (2011).
    Article CAS Google Scholar
  7. Wang, N.J. et al. Proc. Natl. Acad. Sci. USA 108, 17761–17766 (2011).
    Article CAS Google Scholar
  8. Kumar, P., Henikoff, S. & Ng, P.C. Nat. Protoc. 4, 1073–1081 (2009).
    Article CAS Google Scholar
  9. Adzhubei, I.A. et al. Nat. Methods 7, 248–249 (2010).
    Article CAS Google Scholar
  10. Reva, B., Antipin, Y. & Sander, C. Nucleic Acids Res. 39, e118 (2011).
    Article CAS Google Scholar
  11. Pollard, K.S., Hubisz, M.J., Rosenbloom, K.R. & Siepel, A. Genome Res. 20, 110–121 (2010).
    Article CAS Google Scholar
  12. Petitjean, A. et al. Hum. Mutat. 28, 622–629 (2007).
    Article CAS Google Scholar
  13. Pugh, T.J. et al. Nat. Genet. 45, 279–284 (2013).
    Article CAS Google Scholar
  14. Barrett, J.H. et al. Nat. Genet. 43, 1108–1113 (2011).
    Article CAS Google Scholar
  15. Kim, H.S., Minna, J.D. & White, M.A. Cell 152, 387–389 (2013).
    Article CAS Google Scholar
  16. Fletcher, O. & Houlston, R.S. Nat. Rev. Cancer 10, 353–361 (2010).
    Article CAS Google Scholar
  17. Gui, Y. et al. Nat. Genet. 43, 875–878 (2011).
    Article CAS Google Scholar
  18. Kumar, A. et al. Proc. Natl. Acad. Sci. USA 108, 17087–17092 (2011).
    Article CAS Google Scholar
  19. Kompier, L.C. et al. PLoS ONE 5, e13821 (2010).
    Article Google Scholar
  20. Grasso, C.S. et al. Nature 487, 239–243 (2012).
    Article CAS Google Scholar

Download references

Acknowledgements

We thank Y.S. Ju and J.-I. Kim at Wellcome Trust Sanger Institute and Seoul National University for their personal comments regarding this manuscript, and W. Jun and J. Shendure at Beijing Genomics Institute (BGI) and the University of Washington for providing data and comments about data verification. This study was supported by a grant from the National Cancer Center, Republic of Korea (NCC-1110272 and NCC-1310190 to D.H.).

Author contributions D.H. and H.J. conceived of, planned and managed the project, and analyzed sequencing data. J.L. and H.J. developed the genome browser. J.L. assisted in the data analysis. D.H., H.J. and T.B. wrote the manuscript.

Author information

Authors and Affiliations

  1. Division of Convergence Technology, Cancer Genomics Branch, National Cancer Center, Gyeonggi-do, Korea
    HyunChul Jung, Jongkeun Lee & Dongwan Hong
  2. Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, California, USA
    HyunChul Jung
  3. College of Natural Sciences, Seoul National University Graduate School, Seoul, Korea
    Thomas Bleazard

Authors

  1. HyunChul Jung
    You can also search for this author inPubMed Google Scholar
  2. Thomas Bleazard
    You can also search for this author inPubMed Google Scholar
  3. Jongkeun Lee
    You can also search for this author inPubMed Google Scholar
  4. Dongwan Hong
    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toDongwan Hong.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Jung, H., Bleazard, T., Lee, J. et al. Systematic investigation of cancer-associated somatic point mutations in SNP databases.Nat Biotechnol 31, 787–789 (2013). https://doi.org/10.1038/nbt.2681

Download citation