Xenome--a tool for classifying reads from xenograft samples - PubMed (original) (raw)

Xenome--a tool for classifying reads from xenograft samples

Thomas Conway et al. Bioinformatics. 2012.

Abstract

Motivation: Shotgun sequence read data derived from xenograft material contains a mixture of reads arising from the host and reads arising from the graft. Classifying the read mixture to separate the two allows for more precise analysis to be performed.

Results: We present a technique, with an associated tool Xenome, which performs fast, accurate and specific classification of xenograft-derived sequence read data. We have evaluated it on RNA-Seq data from human, mouse and human-in-mouse xenograft datasets.

Availability: Xenome is available for non-commercial use from http://www.nicta.com.au/bioinformatics.

PubMed Disclaimer

Figures

Fig. 1.

A Venn diagram showing the different classes that a given _k_-mer may belong to. The marginal host (and marginal graft) partitions are for those host (and graft) _k_-mers that are Hamming distance 1 from a _k_-mer in the graft (and host) reference

Fig. 2.

Summary of the results with Human cDNA. Each of the classes of reads is divided into those reads assigned to the class only by Xenome (Xenome), only by the Tophat analysis (Tophat) or by both Xenome and the Tophat analysis (Concordant)

Fig. 3.

Summary of the results with Murine cDNA

Fig. 4.

Summary of the results with BM18 xenograft cDNA

Fig. 5.

Validation of the in silico classification of xenograft RNA-Seq data with qRT-PCR. The horizontal axis shows log10_FPKM_ for the _Xenome_-derived gene expression for the 18 test genes. The vertical axis shows the _C_t value for each gene relative to the C_t of actin. There were two RNA-Seq samples processed (biological replicates), and four replicates of the qRT-PCR. For each gene, an ellipse is shown centered on the mean log10_FPKM in the _x_-axis, and on the mean relative _C_t in the _y_-axis. The horizontal and vertical radii show the variance in the samples

Fig. 6.

An in silico analysis showing the degree of ambiguity in HG19 refGene, according to the _k_-mer based analysis used by Xenome. In this analysis, k = 25

Fig. 7.

A plot showing the distribution of human genes with respect to the proportion of xenograft reads which are classed as both by the _Tophat_-based analysis and the Xenome analysis. The reads considered are only those mapped by Tophat since Xenome does not yield mappings, so cannot be used to assign reads to genes. Only genes for which at least 20 reads mapped were considered. The horizontal axis corresponds to the number of reads classified as both or ambiguous by Xenome as a proportion of all the reads that might possibly be human (i.e. both, ambiguous or human). The vertical axis corresponds to the number of reads classified as both by the _Tophat_-based analysis, once again, as a proportion of all the reads that might possibly be human

Cited by

SUMO2 Inhibition Reverses Aberrant Epigenetic Rewiring Driven by Synovial Sarcoma Fusion Oncoproteins and Impairs Sarcomagenesis.
Iyer R, Deshpande A, Pedgaonkar A, Bala PA, Kim T, Brien GL, Finlay D, Vuori K, Soragni A, Murad R, Deshpande AJ. Iyer R, et al. bioRxiv [Preprint]. 2024 Sep 25:2024.09.23.614593. doi: 10.1101/2024.09.23.614593. bioRxiv. 2024. PMID: 39386552 Free PMC article. Preprint.
DNA demethylation triggers cell free DNA release in colorectal cancer cells.
Pessei V, Macagno M, Mariella E, Congiusta N, Battaglieri V, Battuello P, Viviani M, Gionfriddo G, Lamba S, Lorenzato A, Oddo D, Idrees F, Cavaliere A, Bartolini A, Guarrera S, Linnebacher M, Monteonofrio L, Cardone L, Milella M, Bertotti A, Soddu S, Grassi E, Crisafulli G, Bardelli A, Barault L, Di Nicolantonio F. Pessei V, et al. Genome Med. 2024 Oct 9;16(1):118. doi: 10.1186/s13073-024-01386-5. Genome Med. 2024. PMID: 39385243 Free PMC article.
Patient-derived xenografts and single-cell sequencing identifies three subtypes of tumor-reactive lymphocytes in uveal melanoma metastases.
Karlsson JW, Sah VR, Olofsson Bagge R, Kuznetsova I, Iqba M, Alsen S, Stenqvist S, Saxena A, Ny L, Nilsson LM, Nilsson JA. Karlsson JW, et al. Elife. 2024 Sep 23;12:RP91705. doi: 10.7554/eLife.91705. Elife. 2024. PMID: 39312285 Free PMC article.
Proteostasis perturbation of N-Myc leveraging HSP70 mediated protein turnover improves treatment of neuroendocrine prostate cancer.
Xu P, Yang JC, Chen B, Ning S, Zhang X, Wang L, Nip C, Shen Y, Johnson OT, Grigorean G, Phinney B, Liu L, Wei Q, Corey E, Tepper CG, Chen HW, Evans CP, Dall'Era MA, Gao AC, Gestwicki JE, Liu C. Xu P, et al. Nat Commun. 2024 Aug 5;15(1):6626. doi: 10.1038/s41467-024-50459-x. Nat Commun. 2024. PMID: 39103353 Free PMC article.
Genomic profiling of a multi-lineage and multi-passage patient-derived xenograft biobank reflects heterogeneity of ovarian cancer.
Qin T, Hu Z, Zhang L, Lu F, Xiao R, Liu Y, Fan J, Guo E, Yang B, Fu Y, Zhuang X, Kang X, Wu Z, Fang Z, Cui Y, Hu X, Yin J, Yan M, Li F, Song K, Chen G, Sun C. Qin T, et al. Cell Rep Med. 2024 Jul 16;5(7):101631. doi: 10.1016/j.xcrm.2024.101631. Epub 2024 Jul 9. Cell Rep Med. 2024. PMID: 38986623 Free PMC article.

References

1. Arbitman Y., et al. 2010 IEEE 51st Annual Symposium on Foundations of Computer Science. Los Alamos California: IEEE Computer Society; 2010. Backyard cuckoo hashing: Constant worst-case operations with a succinct representation; pp. 787–796.
1. Chung D., et al. Discovering transcription factor binding sites in highly repetitive regions of genomes with multi-read analysis of ChIP-Seq data. PLoS Comput. Biol. 2011;7:e1002111. - PMC - PubMed
1. Conway T.C., Bromage A.J. Succinct data structures for assembling large genomes. Bioinformatics. 2011;27:479–486. - PubMed
1. Ding L., et al. Genome remodelling in a basal-like breast cancer metastasis and xenograft. Nature. 2010;464:999–1005. - PMC - PubMed
1. Hormozdiari F., et al. Next-generation VariationHunter: combinatorial algorithms for transposon insertion discovery. Bioinformatics. 2010;26:i350–i357. - PMC - PubMed

Xenome--a tool for classifying reads from xenograft samples - PubMed (original) (raw)