U87MG decoded: the genomic sequence of a cytogenetically aberrant human cancer cell line - PubMed (original) (raw)

U87MG decoded: the genomic sequence of a cytogenetically aberrant human cancer cell line

Michael James Clark et al. PLoS Genet. 2010.

Erratum in

Abstract

U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30x genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library. A total of 1,014,984,286 mate-end and 120,691,623 single-end two-base encoded reads were generated from five slides. All data were aligned using a custom designed tool called BFAST, allowing optimal color space read alignment and accurate identification of DNA variants. The aligned sequence reads and mate-pair information identified 35 interchromosomal translocation events, 1,315 structural variations (>100 bp), 191,743 small (<21 bp) insertions and deletions (indels), and 2,384,470 single nucleotide variations (SNVs). Among these observations, the known homozygous mutation in PTEN was robustly identified, and genes involved in cell adhesion were overrepresented in the mutated gene list. Data were compared to 219,187 heterozygous single nucleotide polymorphisms assayed by Illumina 1M Duo genotyping array to assess accuracy: 93.83% of all SNPs were reliably detected at filtering thresholds that yield greater than 99.99% sequence accuracy. Protein coding sequences were disrupted predominantly in this cancer cell line due to small indels, large deletions, and translocations. In total, 512 genes were homozygously mutated, including 154 by SNVs, 178 by small indels, 145 by large microdeletions, and 35 by interchromosomal translocations to reveal a highly mutated cell line genome. Of the small homozygously mutated variants, 8 SNVs and 99 indels were novel events not present in dbSNP. These data demonstrate that routine generation of broad cancer genome sequence is possible outside of genome centers. The sequence analysis of U87MG provides an unparalleled level of mutational resolution compared to any cell line to date.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Figure 1

Figure 1. SNP chip and base sequence coverage compared.

Circos was used to plot base sequence coverage from ABI sequencing as a histogram (the outermost plot, green and black) alongside dot plots of the LogR-ratio (middle plot, blue) and B-allele frequency (innermost plot, red) from the Illumina Human 1M Duo BeadChip microarray. LogR-ratio represents copy number (CN) and B-allele frequency represents loss of heterozygosity (LOH).

Figure 2

Figure 2. Small insertion/deletion size distribution.

(A) Distribution of small deletion sizes as a percent of total, comparing amino-acid encoding deletions (blue) with non-coding deletions (red). (B) Distribution of small insertion sizes as a percent of total, comparing amino-acid encoding insertions (blue) with non-coding insertions (red).

Figure 3

Figure 3. Base substitution frequencies.

Ratio of specific nucleotide substitutions as a percent of total single nucleotide variants, comparing SNVs in coding regions (blue) to SNVs genome-wide (red).

Figure 4

Figure 4. Individual genome comparison.

(A) Venn diagram showing overlap in SNVs among the U87 genome, the Watson genome, and dbSNP 129. (B) Venn diagram showing overlap in SNVs among the U87 genome, the YanHuang genome, and dbSNP 129.

Figure 5

Figure 5. Structural variations in U87MG.

Structural variations detected by whole genome sequencing in the U87MG genome are plotted in the Circos program. Orange lines linking two chromosomes represent the 35 interchromosomal translocations. Blue lines around the edge of the circle represent microdeletions and intrachromosomal translocations. The outermost histogram represents sequence coverage and demonstrates how the boundaries of changes in coverage typically coincide with a significant structural variation.

Figure 6

Figure 6. Reads spanning interchromosomal translocation breakpoints.

Two genomic breakpoint events are highlighted between chromosomes 2 and 16. The outer ring represents the chromosomes displaying tick marks every 100 bases. The green plot shows base-coverage for each position. Each orange line represents a single mate-pair as a link between one end of a read and its mate-pair. Between the breakpoints on each chromosome (chr2:56792000–56953300 and chr16:8826200–8826700), base coverage drops to about half of what it is on the other side of the event, from two to one copy. This suggests an interchromosomal translocation between chromosomes 2 and 16 resulting in a loss of the genomic material between the translocation breakpoints.

Figure 7

Figure 7. Increased resolution of structural variations by sequencing.

Resolution of karyotyping and SKY approaches is not high enough to see the complex nature of this translocation event between chr1 and chr16. With high-resolution whole-genome sequencing, the true structure of the translocation is revealed as mutual translocations between a small fragment of chr2 with chr1 and chr16 on either end.

Similar articles

Cited by

References

    1. Lee Y, Scheck AC, Cloughesy TF, Lai A, Dong J, et al. Gene expression analysis of glioblastomas identifies the major molecular basis for the prognostic benefit of younger age. BMC Med Genomics. 2008;1:52. - PMC - PubMed
    1. CBTRUS 2007–2008 Statistical Report: Primary Brain Tumors in the United States Statistical Report, 2000–2004 (Years of Data Collected). 2008. CBTRUS.
    1. Stupp R, Mason WP, van den Bent MJ, Weller M, Fisher B, et al. Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. N Engl J Med. 2005;352:987–996. - PubMed
    1. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008;455:1061–1068. - PMC - PubMed
    1. Ponten J, Macintyre EH. Long term culture of normal and neoplastic human glia. Acta Pathol Microbiol Scand. 1968;74:465–486. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources