Defining the blueprint of the cancer genome (original) (raw)

Journal Article

Ludwig Center for Cancer Genetics and Therapeutics, The Johns Hopkins Kimmel Cancer Center, Baltimore, MD 21231, USA

* To whom correspondence should be addressed. Tel: +410 955 8878; Fax: +410 955 0548; Email: velculescu@jhmi.edu

Search for other works by this author on:

Revision received:

07 April 2008

Navbar Search Filter Mobile Enter search term Search

Abstract

It is widely accepted that cancer is a disease caused by accumulation of mutations in specific genes. These tumor-specific mutations provide clues to the cellular processes underlying tumorigenesis and have proven useful for diagnostic and therapeutic purposes. To date, however, only a small fraction of genes has been analyzed and the number and type of alterations responsible for the development of common tumor types are unknown. The determination of the human genome sequence coupled with improvements in sequencing and bioinformatic approaches have made it possible to examine the cancer cell genome in a comprehensive and unbiased manner. Systematic sequencing studies have been performed on gene families involved in signal transduction in several tumor types, and have now been extended to include the majority of protein-coding genes in breast and colorectal cancers. These analyses have identified new genes and pathways that had not been linked previously to human cancer. One example has been the discovery of genetic alterations in the PIK3CA gene encoding p110α phosphatidylinositol 3-kinase and in related pathway genes in >30% of colon and breast cancers. These mutational analyses provide a window into the genetic landscape of human cancer, indicate new targets for personalized diagnostic and therapeutic intervention, and suggest lessons for future large-scale genomic analyses in human tumors.

Introduction

Cancer research is poised for a transformation that will soon permit the comprehensive identification of genomic changes in any tumor type. In the past, identification of genes implicated in tumorigenesis was a long-term endeavor driven by the analysis of candidate genes in certain chromosomal regions, by clues from functional studies, or by linkage in families with hereditary syndromes ( 1 ). Though the results of such analyses represent the foundation of our current understanding of tumor initiation and progression, many molecular changes underlying human cancer remain to be discovered. Recent improvements in technologies for high-throughput sequencing and mutation detection together with the sequence of the human genome have now permitted rapid analyses of a large numbers of genes for somatic (i.e. tumor-specific) alterations ( 2 , 3 ).

Development of approaches for high-throughput DNA sequencing in human cancer

Several important advances have aided the development of high-throughput approaches for DNA sequencing and mutation detection in human cancer. The first has been the collection and isolation of high-quality tumor tissue for these analyses, either through generation of early passage tumor cell lines or through selective capture or microdissection of neoplastic tissue. This has permitted the sensitive detection of somatic mutations that would otherwise have been masked by contaminating normal tissue. The second advance has been the development of automated methods for large-scale sequence analysis of specific loci by polymerase chain reaction and Sanger sequencing. These methods have now been optimized to provide rapid and robust sequence analysis of nearly all exonic regions in the human genome ( 4 , 5 ). Finally, several methods for automated mutation detection have been developed and applied for analysis of somatic alterations in cancer ( 6 , 7 ). By direct comparison of sequence traces from tumor and normal tissues, these methods have allowed the sensitive identification of most types of somatic sequence alterations, including nucleotide substitutions, and small insertions, duplications and deletions.

Further improvements are likely to make these analyses even more facile in the future. These will include the use of next generation sequencing technologies that can potentially allow sequence analyses of entire human genomes through use of massively parallel short sequence reads ( 8 ). Currently, such approaches suffer from relatively high sequencing error rates, from the requirement for redundant analyses at each locus to ensure that both alleles have been accurately genotyped, and from the difficulty of assessing related regions in the genome using short sequences. Though these issues reduce the attractiveness of these methods for mutation detection, there are a variety of other applications, including analyses of expression and other epigenetic changes, that can be readily performed at this time ( 9 ). Given the pace at which sequencing technology has improved, it would be reasonable to expect that further progress in reducing error rates and increasing read lengths will ultimately lead to simpler and more sensitive methods of mutation detection in tumor DNA in the future.

Sequence analyses of gene families involved in signal transduction

The application of high-throughput sequencing methods for analysis of human cancer has already permitted analyses of increasingly larger numbers of genes for somatic mutations ( Table I ). These approaches were initially used to estimate the number of somatic alterations that one may expect to detect in a human cancer genome ( 10 ). Once a baseline for the number of background somatic changes in a tumor was thus established, efforts focused on analyses of groups of genes involved in signal transduction pathways, in particular protein kinases and phosphatases. The proteins encoded by such genes have been shown to play an important role in regulating cellular aspects related to tumorigenesis, including differentiation, cell cycle progression, apoptosis, motility and invasion ( 11 ). By virtue of their enzymatic activities, these genes were also attractive as they may be amenable to therapeutic intervention. Although a few kinases and phosphatase genes had been shown to be mutationally altered in specific human cancers ( 12 ), the involvement of the vast majority of these genes in neoplasia had not been explored.

Table I.

Large-scale mutation analyses in human cancer

Genes analyzed	Number of genes a	Tumor tissue analyzed	Mutated genes identified	Reference
Gene families
RAS-RAF pathway	4	Melanoma, others	BRAF	( 13 )
Tyrosine kinases	136	Colorectal	MLK4, others	( 6 )
Tyrosine phosphatases	87	Colorectal	PTPRT, others	( 15 )
Serine/theronine kinases	340	Colorectal	PI3K pathway genes	( 16 )
PI3Ks	16	Colorectal	PIK3CA	( 19 )
Tyrosine kinases	58	Lung	EGFR	( 25 , 26 )
Tyrosine kinases	85	Polycythemia vera	JAK2	( 30–33 )
Protein kinases	518	Lung, breast, others	ERBB2, others	( 34–37 , 40 )
Chromosomal instability	202	Colorectal	MRE11, cohesin genes, others	( 38 , 54 )
Genome-wide analyses
CCDS genes	13 023	Breast and colorectal	Hundreds of genes	( 4 )
RefSeq genes	18 191	Breast and colorectal	Hundreds of genes	( 5 )

Genes analyzed	Number of genes a	Tumor tissue analyzed	Mutated genes identified	Reference
Gene families
RAS-RAF pathway	4	Melanoma, others	BRAF	( 13 )
Tyrosine kinases	136	Colorectal	MLK4, others	( 6 )
Tyrosine phosphatases	87	Colorectal	PTPRT, others	( 15 )
Serine/theronine kinases	340	Colorectal	PI3K pathway genes	( 16 )
PI3Ks	16	Colorectal	PIK3CA	( 19 )
Tyrosine kinases	58	Lung	EGFR	( 25 , 26 )
Tyrosine kinases	85	Polycythemia vera	JAK2	( 30–33 )
Protein kinases	518	Lung, breast, others	ERBB2, others	( 34–37 , 40 )
Chromosomal instability	202	Colorectal	MRE11, cohesin genes, others	( 38 , 54 )
Genome-wide analyses
CCDS genes	13 023	Breast and colorectal	Hundreds of genes	( 4 )
RefSeq genes	18 191	Breast and colorectal	Hundreds of genes	( 5 )

CCD, consensus coding sequences.

Number of genes in the referenced studies. For the kinase genes analyzed in references ( 6 , 16 , 19 ), only the kinase domains were initially sequenced, whereas for those analyzed in references ( 25 , 30 ), only the activation loop was assessed. The analysis of RefSeq genes (ref. 5 ) includes the CCDS genes analyzed in reference ( 4 ).

Table I.

Large-scale mutation analyses in human cancer

Genes analyzed	Number of genes a	Tumor tissue analyzed	Mutated genes identified	Reference
Gene families
RAS-RAF pathway	4	Melanoma, others	BRAF	( 13 )
Tyrosine kinases	136	Colorectal	MLK4, others	( 6 )
Tyrosine phosphatases	87	Colorectal	PTPRT, others	( 15 )
Serine/theronine kinases	340	Colorectal	PI3K pathway genes	( 16 )
PI3Ks	16	Colorectal	PIK3CA	( 19 )
Tyrosine kinases	58	Lung	EGFR	( 25 , 26 )
Tyrosine kinases	85	Polycythemia vera	JAK2	( 30–33 )
Protein kinases	518	Lung, breast, others	ERBB2, others	( 34–37 , 40 )
Chromosomal instability	202	Colorectal	MRE11, cohesin genes, others	( 38 , 54 )
Genome-wide analyses
CCDS genes	13 023	Breast and colorectal	Hundreds of genes	( 4 )
RefSeq genes	18 191	Breast and colorectal	Hundreds of genes	( 5 )

Genes analyzed	Number of genes a	Tumor tissue analyzed	Mutated genes identified	Reference
Gene families
RAS-RAF pathway	4	Melanoma, others	BRAF	( 13 )
Tyrosine kinases	136	Colorectal	MLK4, others	( 6 )
Tyrosine phosphatases	87	Colorectal	PTPRT, others	( 15 )
Serine/theronine kinases	340	Colorectal	PI3K pathway genes	( 16 )
PI3Ks	16	Colorectal	PIK3CA	( 19 )
Tyrosine kinases	58	Lung	EGFR	( 25 , 26 )
Tyrosine kinases	85	Polycythemia vera	JAK2	( 30–33 )
Protein kinases	518	Lung, breast, others	ERBB2, others	( 34–37 , 40 )
Chromosomal instability	202	Colorectal	MRE11, cohesin genes, others	( 38 , 54 )
Genome-wide analyses
CCDS genes	13 023	Breast and colorectal	Hundreds of genes	( 4 )
RefSeq genes	18 191	Breast and colorectal	Hundreds of genes	( 5 )

CCD, consensus coding sequences.

In one of the first examples of these types of analyses, investigators at the Sanger Center examined genes in the RAS–RAF pathway for genetic alterations in a variety of tumor types. This analysis identified a high frequency of mutations in the v-raf murine sarcoma viral oncogene homolog protein kinase gene in melanomas and to a lesser degree in other tumors ( 13 ). This discovery was surprising because this pathway was well known and had been extensively characterized at the biochemical level. Mutations in v-rat murine sarcoma viral oncogene homolog were shown to be mutually exclusive with alterations in the Kirsten rat sarcoma viral oncogene homolog (KRAS) providing genetic evidence that these genes operated in the same pathway in human tumors and suggesting that mutation in either was sufficient to activate downstream signaling ( 13 , 14 ).

Using a sequencing-based approach, our group at Johns Hopkins University performed a series of mutational analyses of gene families encoding protein kinases and phosphatases in human colorectal cancers ( 6 , 15 , 16 ). To determine whether these genes were genetically altered, Hidden Markov models and previous reports in the literature were used to identify the genes containing kinase and phosphate domains in the human genome, and these were then directly analyzed by sequence analysis of tumor DNA. From the sequence information obtained, seven tyrosine kinases, eight serine/threonine kinases and six tyrosine phosphatases were identified that contained somatic mutations. In aggregate, these mutated genes affected a substantial fraction of the colorectal cancers analyzed. For example, of the tyrosine phosphatase genes identified, protein tyrosine phosphatase receptor type T was shown to be altered in >10% of colorectal cancers and was also mutated in lung and gastric cancers ( 15 ). Some of the alterations in protein tyrosine phosphatase receptor type T were predicted to result in truncated mutant proteins lacking the phosphatase domain, while missense mutations were shown to lead to reduced phosphatase activity. Recent analyses have identified STAT3 as being a substrate of protein tyrosine phosphatase receptor type T and suggest that dysregulation of this pathway may be an important feature of many colorectal tumors ( 17 ). Despite years of research on protein kinases and phosphatase genes, few of the genes identified had been linked previously to human cancer and pointed to new pathways that were involved in tumor development.

A similar yet smaller approach was undertaken for analysis of the phosphatidylinositol 3-kinase (PI3K) genes, a family of lipid kinases that mediate pathways important for proliferation, adhesion, survival and motility ( 18 ). To evaluate whether PI3Ks may be genetically implicated in tumorigenesis, sequence based analyses were used to identify 16 PI3K genes in the human genome. These were examined for sequence alterations in their kinase domains in a panel of colorectal cancers ( 19 ). PIK3CA , encoding the p110α catalytic subunit, was the only gene identified with somatic mutations affecting 32% of colorectal tumors examined. Analysis of PIK3CA in other tumor types identified somatic mutations in a smaller fraction of breast, brain, gastric and lung cancers. In subsequent studies, PIK3CA was shown to be altered in 36% of hepatocellular carcinomas, 36% of endometrial carcinomas, 25% of breast carcinomas, 15% of anaplastic oligodendrogliomas and 5% of medulloblastomas and anaplastic astrocytomas ( 20–23 ). Analysis of other members of the PI3K pathway has shown that a number of additional genes within the PI3K pathway are altered in colorectal, breast and other tumor types ( 16 , 18 , 24 ). In most cases, the mutations in these genes appeared to be mutually exclusive, suggesting that alterations in any one gene were sufficient to drive tumorigenesis. In colorectal cancers, >40% of tumors had alterations in one of eight PI3K pathway genes, demonstrating the importance of this pathway in colorectal cancer pathogenesis.

Additional screens for somatic alterations in human tumors have identified mutations in the epidermal growth factor receptor (EGFR) in a small fraction of lung cancers and have linked mutations in this gene with increased sensitivity to EGFR inhibitors such as gefitinib (Iressa) and erlotinib (Tarceva) ( 25 , 26 ). Although the frequency of these alterations was low, these observations were important to the field as they substantiated the hypothesis that tumor cells were dependent on specific pathways for continued proliferation ( 27 , 28 ) and provided fresh impetus for the development of novel therapeutic agents against EGFR and other protein kinases. Sequence analysis of the tyrosine kinase genes in other neoplasms have identified mutations in janus kinase 2 in polycythemia vera and in other myeloproliferative disorders ( 29–33 ). Interestingly, the mutations in janus kinase 2 mostly occurred at a single residue, V617F, providing a potentially facile and sensitive means of detecting alterations in individuals with these disorders. Additional analyses of protein kinases in lung, breast and other tumor types have revealed mutations of the HER-2/neu receptor (ERBB2) in a fraction of lung cancers, have identified a subset of breast tumors with an unusually high mutator phenotype and have shown that certain tumors, such as testicular germ cell tumors, have a very low prevalence of somatic mutations ( 34–37 ). Finally, systematic analyses have been performed on genes that may be involved in chromosomal instability, as this phenotype is an underlying characteristic of the vast majority of human cancers. These analyses identified somatic alterations in the DNA repair gene MRE11 and in a number of genes involved in the cohesin complex that are thought to be important for sister chromatid cohesion and accurate chromosome segregation ( 38 , 54 ).

Sequence analyses of cancer genomes

Taken together, the unexpected discoveries from these systematic analyses of gene families involved in signaling pathways provided a clear rationale for expanding these studies to examine the remaining genes in the human genome. Such analyses could reveal additional genes in known pathways that are significantly affected by genetic alterations as well as identify genes that may be pointing to entirely different cellular processes. To achieve such goals, we recently undertook an effort to sequence a large fraction of the protein-coding genes in the human genome in breast and colorectal cancers. We initially focused on a set of ∼13 000 genes comprising the consensus coding sequences as these represented the most highly curated gene set available ( 4 ). We have recently extended these analyses to include the remaining ∼5100 genes in the Reference Sequence database ( 5 ). The goals of these studies were to provide a methodological strategy that would allow genome-wide mutational analyses in human tumors, to identify the spectrum and extent of somatic mutations in human tumors and to identify new genes and molecular pathways that were important in these tumors.

From the combination of these two studies, a total of ∼200,000 coding genomic regions were analyzed in 11 samples of each tumor type (breast and colorectal carcinomas). Over 4 million polymerase chain reaction products were generated and directly sequenced, resulting in nearly 660 million bp of tumor sequence. Examination of sequence traces from these amplicons revealed over a million putative nucleotide changes. These changes could represent germ line variants, artifacts of polymerase chain reaction or sequencing, or bona fide somatic mutations. A variety of bioinformatic and experimental steps were employed to distinguish among these possibilities. The combination of these steps removed >99% of the potential alterations, resulting in 2185 confirmed somatic mutations in 1885 genes.

The great majority of the mutations observed were single-base substitutions. Though the fraction of these was similar in breast and colorectal cancers, the spectrum and nucleotide contexts of mutations were very different between the two tumor types. The most dramatic of these differences occurred at C:G base pairs (many of which were at 5′-CpG-3′ dinucleotide sites). Over half of the colorectal cancer mutations were C:G to T:A transitions, whereas <10% were C:G to G:C transversions. In breast cancers, however, only 35% of the mutations were C:G to T:A transitions, whereas 29% were C:G to G:C transversions. In contrast, mutations occurring at 5′-TpC-3′ sites (or complementary 5′-GpA-3′ sites) comprised nearly a third of alterations in breast cancers but a much smaller fraction in colorectal cancers. These observations have important implications for processes of carcinogenesis and suggest that the mechanisms underlying mutagenesis and repair in the two tumor types are probably different. These conclusions have been extended by analysis of somatic alterations in other tumor types and suggest that additional spectra of mutations may affect tumors derived from other tissues ( 39 ).

Somatic mutations in human tumors can arise either through selection of functionally important alterations via their effect on net cell growth or through accumulation of non-functional ‘passenger’ alterations that arise during repeated rounds of cell division in the tumor or in its progenitor stem cell. To distinguish between these possibilities, several statistical approaches have been developed to estimate the probability that the number of mutations in a given gene reflects a mutation frequency that is greater than expected from the background mutation rate ( 5 , 40 ). In general, these analyses incorporate the number of somatic alterations observed, the number of tumors studied, the number of nucleotides that were successfully analyzed and the nucleotide type and context of each mutation. We used such approaches to identify those genes in our genome-wide study that were most likely to have been selected during tumorigenesis. Over 200 such candidate genes were discovered in breast and colorectal tumors. The genes we identified that were previously known to be somatically mutated in human cancers represented the vast majority of genes that are thought to be important in these two tumor types, thereby providing an important validation of such unbiased approaches in genetic analyses of neoplasia.

This study also revealed a substantial number of genes that had not been suspected previously to be involved in cancer. The potential roles of these genes has been analyzed by their annotation in various functional databases, including Gene Ontology, kyoto encyclopedia of genes and genomes and GeneGo databases, or through previously published literature ( 4 , 5 , 41 , 55 ). Several of the groups identified in this way were of special interest, as a substantial fraction of the genes were transcriptional regulators, cell adhesion molecules and members of signal transduction pathways. At least one member of each of these gene groups was mutated in >70% of the tumors of each type. Subsets of these groups were also of interest and included metalloproteinases, ephrin receptors and G proteins and their regulators. Interestingly, additional members of the PI3K pathway were also identified that had not been detected previously. These data suggest that dysregulation of specific cellular processes are genetically selected during neoplasia and that distinct members of each group may serve similar roles in different tumors.

The observations from these genome-wide mutational analyses suggest that breast and colorectal cancers display a large degree of complexity at the genetic level. This is reflected by the fact that individual tumors harbor ∼80 non-silent mutations in the coding regions of different genes and that 15–20 of these are probably to be causally implicated in human cancer. Although a handful of these genes are mutated in a high fraction of tumors, the vast majority are mutated at relatively low frequencies and are different among tumors. This genomic landscape of a few highly mutated genes among a large number of less frequently mutated genes is a feature of both breast and colorectal cancers and is probably a defining characteristic of other solid cancers. This picture of genomic complexity intuitively suggests that the highly mutated genes provide a large selective advantage to the mutated cell, whereas the genes with low frequency mutations provide only a modest advantage. Mathematical modeling of tumor progression is consistent with this notion and shows that even small degrees of fitness advantage are sufficient for such mutations to be selected during tumorigenesis ( 42 ).

Diagnostic and therapeutic implications

A consequence of the genetic heterogeneity that we observed from these large-scale studies is that individual tumors are different with respect to the mutations that they contain within their genomes. These differences may in part be responsible for the clinical diversity that tumors display in development, response to therapy and clinical outcome. Taking advantage of these alterations will be challenging because of the multitude of changes observed and because of the current lack of understanding of the role of many of the mutations in the pathogenesis of the disease.

Nevertheless, some of the observed mutations may be useful for therapeutic targeting. It has been proposed that mutations in tumor cells may lead to ‘oncogenic addiction’ ( 27 , 28 ). This hypothesis suggests that tumor cells are addicted or dependent on mutated genes or pathways, and that inhibition of these can result in cellular arrest or death. This hypothesis has been supported by successful examples in the clinic, including use of imatinib (Gleevec) to inhibit BCR-ABL in patients with chronic myeloid leukemia and use of gefitinib (Iressa) and erlotinib (Tarceva) in patients with lung tumors containing EGFR mutations. Moreover, genetic and cellular analyses of two commonly mutated oncogenes, kirsten rat sarcoma viral oncogene homolog and PIK3CA, suggest that tumor cells depend on the activity of these mutant genes for continued cellular proliferation and that their disruption reduces the neoplastic potential of these cells ( 43 , 44 ).

A potential target identified through systematic genomic studies that may be amenable to therapeutic intervention is PIK3CA. The positions of the mutations within this gene immediately suggested that they were likely to increase kinase activity. Over 75% of alterations occurred in two small clusters in evolutionarily conserved regions of the helical and kinase domains. This clustering of somatic missense mutations in specific domains was similar to that observed for activating mutations in other oncogenes, such as kirsten rat sarcoma viral oncogene homolog, v-rat murine sarcoma viral oncogene homolog and tyrosine kinases. A number of studies have now shown that these alterations increase kinase activity compared with the wild-type protein and are oncogenic ( 19 , 44–46 ). To examine the function of mutated PIK3CA in human cancer cells, we have used homologous recombination to disrupt the PIK3CA locus, thereby generating isogenic cancer cell lines containing either the wild-type or the mutant version of PIK3CA ( 44 ). These studies show that mutant PIK3CA appears to be important for cell growth and invasion both in vitro and in vivo . Treatment of these cells with PI3K inhibitors have shown that those with mutant PIK3CA were preferentially inhibited, and suggested that PIK3CA may be a useful target in tumors with mutations in this gene. The combination of these genetic and functional properties of PIK3CA have spurred the development of PI3K inhibitors, leading to at least several compounds that are in phase I clinical trials ( 47 ). Additionally, as most of the genes altered in the PI3K pathway encode protein kinases, the encoded proteins of these genes could also serve as potential therapeutic targets. Targeting of the proteins that act downstream in the PI3K pathway may be effective in treating the larger fraction of tumors containing mutations in PIK3CA, in the phosphatase and tensin homolog, or in other components.

Another potential modality of therapeutic intervention that takes advantage of the large number of alterations observed in human tumors is based on immune recognition of novel mutant epitopes. In silico analyses of the tumors analyzed in our genome-wide study suggest that each accumulated on average 7–10 unique mutant peptides that could be presented as major histocompatibility complex, class IA HLA-A*0201 epitopes ( 48 ). As tumor cells have potentially six distinct HLA class I molecules, the average number of presented mutant epitopes may actually be closer to ∼60. If these predictions are confirmed experimentally, they suggest the possibility for the development of immunologic-based approaches for treatment cancers that would be broadly applicable yet highly specific to individual tumors.

These mutation data can also be useful for early detection of cancer. Diagnostic avenues may ultimately be more valuable than development of new therapies, as most tumors can be cured if they are detected and surgically excised at an early stage. Mutated tumor DNA, released from the primary tumor or from circulating tumor cells, can be used as markers of tumorigenesis that can be detected in the blood or other bodily fluids ( 49 , 50 ). Detection of such mutations would be highly specific for tumors as they would not be expected to be present in normal tissues at appreciable levels. The challenge of using mutation-based markers is that such methods need to be able to detect small numbers of mutant DNA molecules in the context of much larger amounts of normal DNA. Techniques have already been devised for fecal DNA mutation detection that permit screening of colorectal cancers with sensitivities nearing those of colonoscopies ( 51 , 52 ). The sensitivity of such approaches would be expected to increase with the inclusion of additional mutated genes within these assays. Though the use of a large number of gene mutations would currently make such tests expensive, it is likely that new next generation sequencing technologies will permit these approaches to be accessible in the future.

Lessons for the future

Several general lessons have emerged from these genome-wide mutational analyses. The first is that a relatively large number of previously uncharacterized mutated genes exist in human cancers and that these genes can be discovered by unbiased genomic approaches. These results support the prediction that large-scale mutational analyses of other tumor types will prove useful for identifying genes not currently known to be linked to tumorigenesis. Along these lines, The Cancer Genome Atlas Project has recently been initiated to provide mutational analyses of a large number of genes in brain, lung and ovarian cancers ( 53 ). Even larger international cancer genome sequencing efforts may arise to tackle these issues in other tumor types. Second, our results suggest that the number of mutational events occurring during the evolution of human tumors from a benign to a metastatic state is much larger than previously imagined and that breast and colorectal cancers show substantial differences in their mutation spectra and in the genes that are mutated. These data show that tumors have a substantial heterogeneity at the genetic level, and that the observed alterations may be reflected in the biologic and clinical differences between breast and colorectal tumors and between individual patients within the same tumor type. Finally, it appears that a substantial number of mutated genes may participate in common biologic groups or molecular pathways, thereby reducing the apparent genetic complexity. The enrichment of mutations in novel pathways will probably be facilitated in the future by unbiased genome-wide mutational analyses.

It is clear that the studies performed to date represent only an initial foray into the determination of the genetic understanding of human cancer. Additional analyses using complementary approaches, including those assessing copy number alterations, translocations and epigenetic modifications, in combination with genomic sequencing, will provide a more complete picture of the compendium of genetic alterations in human cancer. Improvements in next generation sequencing technologies may allow for more rapid analyses that provide insight into most of these genomic alterations simultaneously. Ideally, these data should be combined with information regarding patient prognosis, response to therapy and outcome in order to identify those changes that have important clinical implications. These results will undoubtedly stimulate widespread efforts to understand the functional effects of the observed genetic alterations and to develop targeted therapeutic and diagnostic approaches against cancers containing these changes. Such large-scale studies may ultimately allow us to envisage a future where patients would receive cancer diagnoses through non-invasive DNA analyses and based on the alterations identified would receive personalized therapies designed to be effective against the combination of mutations present in their tumors. Though such a scenario may not be immediately realizable, the systematic genomic studies and the targeted therapies already described provide a road map for the long-term management of human cancer.

Funding

Virginia and D.K. Ludwig Fund for Cancer Research; National Institutes of Health (CA121113, CA 43460, CA 57345); National Cancer Institute, Division of Cancer Prevention (HHSN261200433002C); Dr. Miriam and Sheldon G. Adelson Medical Research Foundation; Pew Charitable Trusts.

Abbreviations

EGFR
epidermal growth factor receptor
PI3K
phosphatidylinositol 3-kinase

Conflict of Interest Statement: None declared.

References

, et al.

Cancer genes and the pathways they control

Nat. Med.

2004

, vol.

(pg.

789

799

)

, et al.

A census of human cancer genes

Nat. Rev. Cancer

2004

, vol.

(pg.

177

183

)

, et al.

Mutational analysis of gene families in human cancer

Curr. Opin. Genet. Dev.

2005

, vol.

(pg.

)

, et al.

The consensus coding sequences of human breast and colorectal cancers

Science

2006

, vol.

314

(pg.

268

274

)

, et al.

The genomic landscapes of human breast and colorectal cancers

Science

2007

, vol.

318

(pg.

1108

1113

)

, et al.

Mutational analysis of the tyrosine kinome in colorectal cancers

Science

2003

, vol.

300

pg.

949

, et al.

Automating sequence-based detection and genotyping of SNPs from diploid samples

Nat. Genet.

2006

, vol.

(pg.

375

381

)

, et al.

Genomics: massively parallel sequencing

Nature

2005

, vol.

437

(pg.

326

327

)

, et al.

Gene expression analysis goes digital

Nat. Biotechnol.

2007

, vol.

(pg.

878

880

)

, et al.

Prevalence of somatic alterations in the colorectal cancer cell genome

Proc. Natl Acad. Sci. USA

2002

, vol.

(pg.

3076

3080

)

The Croonian Lecture 1997. The phosphorylation of proteins on tyrosine: its role in cell growth and disease

Philos. Trans. R. Soc. Lond. B Biol. Sci.

1998

, vol.

353

(pg.

583

605

)

, et al.

Oncogenic kinase signalling

Nature

2001

, vol.

411

(pg.

355

365

)

, et al.

Mutations of the BRAF gene in human cancer

Nature

2002

, vol.

417

(pg.

949

954

)

, et al.

Tumorigenesis: RAF/RAS oncogenes and mismatch-repair status

Nature

2002

, vol.

418

pg.

934

, et al.

Mutational analysis of the tyrosine phosphatome in colorectal cancers

Science

2004

, vol.

304

(pg.

1164

1166

)

, et al.

Colorectal cancer: mutations in a signalling pathway

Nature

2005

, vol.

436

pg.

792

, et al.

Identification of STAT3 as a substrate of receptor protein tyrosine phosphatase T

Proc. Natl Acad. Sci. USA

2007

, vol.

104

(pg.

4060

4064

)

, et al.

The evolution of phosphatidylinositol 3-kinases as regulators of growth and metabolism

Nat. Rev. Genet.

2006

, vol.

(pg.

606

619

)

, et al.

High frequency of mutations of the PIK3CA gene in human cancers

Science

2004

, vol.

304

pg.

554

, et al.

The PIK3CA gene is mutated with high frequency in human breast cancers

Cancer Biol. Ther.

2004

, vol.

(pg.

772

775

)

, et al.

Mutations of PIK3CA in anaplastic oligodendrogliomas, high-grade astrocytomas, and medulloblastomas

Cancer Res.

2004

, vol.

(pg.

5048

5050

)

, et al.

PIK3CA gene is frequently mutated in breast carcinomas and hepatocellular carcinomas

Oncogene

2005

, vol.

(pg.

1477

1480

)

, et al.

High frequency of coexistent mutations of PIK3CA and PTEN genes in endometrial carcinoma

Cancer Res.

2005

, vol.

(pg.

10669

10673

)

, et al.

PIK3CA mutations correlate with hormone receptors, node metastasis, and ERBB2, and are mutually exclusive with PTEN loss in human breast carcinoma

Cancer Res.

2005

, vol.

(pg.

2554

2559

)

, et al.

EGFR mutations in lung cancer: correlation with clinical response to gefitinib therapy

Science

2004

, vol.

304

(pg.

1497

1500

)

, et al.

Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib

N. Engl. J. Med.

2004

, vol.

350

(pg.

2129

2139

)

Cancer. Addiction to oncogenes–the Achilles heal of cancer

Science

2002

, vol.

297

(pg.

)

, et al.

Oncogenes come of age

Cold Spring Harb. Symp. Quant. Biol.

2005

, vol.

(pg.

)

, et al.

Role of JAK2 in the pathogenesis and therapy of myeloproliferative disorders

Nat. Rev. Cancer

2007

, vol.

(pg.

673

683

)

, et al.

Activating mutation in the tyrosine kinase JAK2 in polycythemia vera, essential thrombocythemia, and myeloid metaplasia with myelofibrosis

Cancer Cell

2005

, vol.

(pg.

387

397

)

, et al.

A unique clonal JAK2 mutation leading to constitutive signalling causes polycythaemia vera

Nature

2005

, vol.

434

(pg.

1144

1148

)

, et al.

Acquired mutation of the tyrosine kinase JAK2 in human myeloproliferative disorders

Lancet

2005

, vol.

365

(pg.

1054

1061

)

, et al.

A gain-of-function mutation of JAK2 in myeloproliferative disorders

N. Engl. J. Med.

2005

, vol.

352

(pg.

1779

1790

)

, et al.

Lung cancer: intragenic ERBB2 kinase mutations in tumours

Nature

2004

, vol.

431

(pg.

525

526

)

, et al.

Somatic mutations of the protein kinase gene family in human lung cancer

Cancer Res.

2005

, vol.

(pg.

7591

7595

)

, et al.

A screen of the complete protein kinase gene family identifies diverse patterns of somatic mutations in human breast cancer

Nat. Genet.

2005

, vol.

(pg.

590

592

)

, et al.

Sequence analysis of the protein kinase gene family in human testicular germ-cell tumors of adolescents and adults

Genes Chromosomes Cancer

2006

, vol.

(pg.

)

, et al.

Three classes of genes mutated in colorectal cancers with chromosomal instability

Cancer Res.

2004

, vol.

(pg.

2998

3001

)

, et al.

Patterns of somatic mutation in human cancer genomes

Nature

2007

, vol.

446

(pg.

153

158

)

, et al.

Statistical analysis of pathogenicity of somatic mutations in cancer

Genetics

2006

, vol.

173

(pg.

2187

2198

)

, et al.

A multidimensional analysis of genes mutated in breast and colorectal cancers

Genome Res.

2007

, vol.

(pg.

1304

1318

)

, et al.

Genetic progression and the waiting time to cancer

PLoS Comput. Biol.

2007

in press

, et al.

Altered growth of human colon cancer cell lines disrupted at activated Ki-ras

Science

1993

, vol.

260

(pg.

)

, et al.

Mutant PIK3CA promotes cell growth and invasion of human cancer cells

Cancer Cell

2005

, vol.

(pg.

561

573

)

, et al.

Cancer-specific mutations in PIK3CA are oncogenic in vivo

Proc. Natl Acad. Sci. USA

2006

, vol.

103

(pg.

1475

1479

)

, et al.

Phosphatidylinositol 3-kinase mutations identified in human cancer are oncogenic

Proc. Natl Acad. Sci. USA

2005

, vol.

102

(pg.

802

807

)

, et al.

Class IA phosphatidylinositol 3-kinase: from their biologic implication in human cancers to drug discovery

Expert Opin. Ther. Targets

2008

, vol.

(pg.

223

238

)

, et al.

Epitope landscape in breast and colorectal cancer

Cancer Res.

2008

, vol.

(pg.

889

892

)

, et al.

Digital quantification of mutant DNA in cancer patients

Curr. Opin. Oncol.

2007

, vol.

(pg.

)

, et al.

Circulating nucleic acids in plasma or serum (CNAPS) as prognostic and predictive markers in patients with solid neoplasias

Dis. Markers

2005

, vol.

(pg.

105

120

)

, et al.

Detection of APC mutations in fecal DNA from patients with colorectal tumors

N. Engl. J. Med.

2002

, vol.

346

(pg.

311

320

)

, et al.

Fecal DNA versus fecal occult blood for colorectal-cancer screening in an average-risk population

N. Engl. J. Med.

2004

, vol.

351

(pg.

2704

2714

)

, et al.

Mapping the cancer genome. Pinpointing the genes involved in cancer will help chart a new course across the complex landscape of human malignancies

Sci. Am.

2007

, vol.

296

(pg.

)

, et al.

Chromatid cohesion defects may underlie chromosome instability in human colorectal cancers

Proc. Natl. Acad. Sci. USA

2008

, vol.

105

(pg.

3443

3448

)

, et al.

Functional classification analysis of somatically mutated genes in human breast and colorectal cancers

Genomics

2008

Apr 21. (epub ahead of print)

Topic:

Citations

Views

Altmetric

Metrics

Total Views 777

467 Pageviews

310 PDF Downloads

Since 12/1/2016

Month:	Total Views:
December 2016	1
March 2017	5
April 2017	1
May 2017	3
June 2017	2
July 2017	1
August 2017	1
September 2017	2
October 2017	5
November 2017	3
December 2017	17
January 2018	11
February 2018	11
March 2018	14
April 2018	8
May 2018	7
June 2018	8
July 2018	13
August 2018	5
September 2018	2
October 2018	9
November 2018	5
December 2018	3
January 2019	6
February 2019	8
March 2019	8
April 2019	16
May 2019	9
June 2019	4
July 2019	10
August 2019	15
September 2019	14
October 2019	8
November 2019	13
December 2019	7
January 2020	15
February 2020	4
March 2020	7
April 2020	7
May 2020	4
June 2020	5
July 2020	7
August 2020	4
September 2020	19
October 2020	3
November 2020	2
December 2020	9
January 2021	3
February 2021	4
March 2021	11
April 2021	3
May 2021	6
June 2021	2
July 2021	8
August 2021	13
September 2021	7
October 2021	11
November 2021	4
December 2021	9
January 2022	9
February 2022	6
March 2022	6
April 2022	16
May 2022	20
June 2022	6
July 2022	12
August 2022	8
September 2022	16
October 2022	11
November 2022	1
December 2022	4
January 2023	8
February 2023	5
March 2023	9
April 2023	4
May 2023	3
June 2023	15
July 2023	12
August 2023	11
September 2023	11
October 2023	8
November 2023	11
December 2023	6
January 2024	17
February 2024	16
March 2024	14
April 2024	26
May 2024	17
June 2024	8
July 2024	18
August 2024	18
September 2024	3

Citations

52 Web of Science

Defining the blueprint of the cancer genome (original) (raw)

Abstract

Introduction

Development of approaches for high-throughput DNA sequencing in human cancer

Sequence analyses of gene families involved in signal transduction

Sequence analyses of cancer genomes

Diagnostic and therapeutic implications

Lessons for the future

Funding

Abbreviations

References

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Cited

Defining the blueprint of the cancer genome (original) (raw)

Abstract

Introduction

Development of approaches for high-throughput DNA sequencing in human cancer

Sequence analyses of gene families involved in signal transduction

Sequence analyses of cancer genomes

Diagnostic and therapeutic implications

Lessons for the future

Funding

Abbreviations

References

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited