Reference alignment of SNP microarray signals for copy number analysis of tumors - PubMed (original) (raw)
Reference alignment of SNP microarray signals for copy number analysis of tumors
Stan Pounds et al. Bioinformatics. 2009.
Abstract
A new procedure to align single nucleotide polymorphism (SNP) microarray signals for copy number analysis is proposed. For each individual array, this reference alignment procedure (RAP) uses a set of selected markers as internal references to direct the signal alignment. RAP aligns the signals so that each array has a similar signal distribution among its reference markers. An accompanying reference selection algorithm (RSA) uses genotype calls and initial signal intensities to choose two-copy markers as the internal references for each array. After RSA and RAP are applied, each array has a similar distribution of signals of two-copy markers so that across-array signal comparisons are biologically meaningful. An upper bound for a statistical metric of signal misalignment is derived and provides a theoretical basis to choose RSA-RAP over other alignment procedures for copy number analysis of cancers. In our study of acute lymphoblastic leukemia, RSA-RAP gives copy number analysis results that show substantially better concordance with cytogenetics than do two other alignment procedures.
Availability: Documented R code is freely available from www.stjuderesearch.org/depts/biostats/refnorm.
Figures
Fig. 1.
Alignment and segmentation results for a hyperdiploid tumor. (A) The quantiles of QA signals for markers on the amplified chromosomes (black dashed line) and chromosomes with no cytogenetic abnormality (solid black line) of a hyperdiploid tumor's array against the corresponding quantiles of QA signals for all markers of a female control tissue array. A diagonal gray line along _y_=x is included for reference. (C) The results of segmenting the standardized differences (
Supplementary Materials
, Section A) computed from QA signals. The _x_-axis represents markers (ordered by chromosome and position), and the _y_-axis represents the standardized differences. Each gray point represents the standardized difference for one marker, and the thick horizontal lines represent the median of the standardized differences in the determined segments. (B, D) panels show analogous results for the same tumor using RSA-RAP signals. RSA-RAP signals and QA signals are on different scales (A, B) because RSA-RAP maps signals to the targeted normal (0,1) distribution and QA maps to the empirically defined distribution of Equation (1).
Fig. 2.
ROC-type curves. (A–D) the concordance of each alignment procedure's results with cytogenetics across all tumor arrays as a function of the threshold γ among markers on amplified chromosomes, deleted chromosomes, chromosomes with no cytogenetically detected abnormality and all markers on chromosomes in one of these three categories. The solid black, dashed black, solid gray and dashed gray lines give the results for RSA-RAP, cyto-RAP, QA and ISA, respectively.
Similar articles
- SNiPer: improved SNP genotype calling for Affymetrix 10K GeneChip microarray data.
Huentelman MJ, Craig DW, Shieh AD, Corneveaux JJ, Hu-Lince D, Pearson JV, Stephan DA. Huentelman MJ, et al. BMC Genomics. 2005 Oct 31;6:149. doi: 10.1186/1471-2164-6-149. BMC Genomics. 2005. PMID: 16262895 Free PMC article. - A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.
Xiao Y, Segal MR, Yang YH, Yeh RF. Xiao Y, et al. Bioinformatics. 2007 Jun 15;23(12):1459-67. doi: 10.1093/bioinformatics/btm131. Epub 2007 Apr 25. Bioinformatics. 2007. PMID: 17459966 - Implementation of array based whole-genome high-resolution technologies confirms the absence of secondary copy-number alterations in MLL-AF4-positive infant ALL patients.
Bardini M, Galbiati M, Lettieri A, Bungaro S, Gorletta TA, Biondi A, Cazzaniga G. Bardini M, et al. Leukemia. 2011 Jan;25(1):175-8. doi: 10.1038/leu.2010.232. Epub 2010 Oct 14. Leukemia. 2011. PMID: 20944671 No abstract available. - CGHcall: calling aberrations for array CGH tumor profiles.
van de Wiel MA, Kim KI, Vosse SJ, van Wieringen WN, Wilting SM, Ylstra B. van de Wiel MA, et al. Bioinformatics. 2007 Apr 1;23(7):892-4. doi: 10.1093/bioinformatics/btm030. Epub 2007 Jan 31. Bioinformatics. 2007. PMID: 17267432 - SNP array analysis in constitutional and cancer genome diagnostics--copy number variants, genotyping and quality control.
de Leeuw N, Hehir-Kwa JY, Simons A, Geurts van Kessel A, Smeets DF, Faas BH, Pfundt R. de Leeuw N, et al. Cytogenet Genome Res. 2011;135(3-4):212-21. doi: 10.1159/000331273. Epub 2011 Sep 16. Cytogenet Genome Res. 2011. PMID: 21934286 Review.
Cited by
- Multi-platform profiling characterizes molecular subgroups and resistance networks in chronic lymphocytic leukemia.
Bloehdorn J, Braun A, Taylor-Weiner A, Jebaraj BMC, Robrecht S, Krzykalla J, Pan H, Giza A, Akylzhanova G, Holzmann K, Scheffold A, Johnston HE, Yeh RF, Klymenko T, Tausch E, Eichhorst B, Bullinger L, Fischer K, Weisser M, Robak T, Schneider C, Gribben J, Dahal LN, Carter MJ, Elemento O, Landau DA, Neuberg DS, Cragg MS, Benner A, Hallek M, Wu CJ, Döhner H, Stilgenbauer S, Mertens D. Bloehdorn J, et al. Nat Commun. 2021 Sep 13;12(1):5395. doi: 10.1038/s41467-021-25403-y. Nat Commun. 2021. PMID: 34518531 Free PMC article. - Network-based systems pharmacology reveals heterogeneity in LCK and BCL2 signaling and therapeutic sensitivity of T-cell acute lymphoblastic leukemia.
Gocho Y, Liu J, Hu J, Yang W, Dharia NV, Zhang J, Shi H, Du G, John A, Lin TN, Hunt J, Huang X, Ju B, Rowland L, Shi L, Maxwell D, Smart B, Crews KR, Yang W, Hagiwara K, Zhang Y, Roberts K, Wang H, Jabbour E, Stock W, Eisfelder B, Paietta E, Newman S, Roti G, Litzow M, Easton J, Zhang J, Peng J, Chi H, Pounds S, Relling MV, Inaba H, Zhu X, Kornblau S, Pui CH, Konopleva M, Teachey D, Mullighan CG, Stegmaier K, Evans WE, Yu J, Yang JJ. Gocho Y, et al. Nat Cancer. 2021 Mar;2(3):284-299. doi: 10.1038/s43018-020-00167-4. Epub 2021 Jan 21. Nat Cancer. 2021. PMID: 34151288 Free PMC article. - Single nucleotide polymorphism array-based signature of low hypodiploidy in acute lymphoblastic leukemia.
Creasey T, Enshaei A, Nebral K, Schwab C, Watts K, Cuthbert G, Vora A, Moppett J, Harrison CJ, Fielding AK, Haas OA, Moorman AV. Creasey T, et al. Genes Chromosomes Cancer. 2021 Sep;60(9):604-615. doi: 10.1002/gcc.22956. Epub 2021 May 17. Genes Chromosomes Cancer. 2021. PMID: 33938069 Free PMC article. - 13q12.2 deletions and FLT3 overexpression in acute leukemias.
Poubel CP, Boroni M, Emerenciano M. Poubel CP, et al. Blood Adv. 2021 Apr 27;5(8):2075-2078. doi: 10.1182/bloodadvances.2020003643. Blood Adv. 2021. PMID: 33877294 Free PMC article. No abstract available. - Mutational landscape and patterns of clonal evolution in relapsed pediatric acute lymphoblastic leukemia.
Waanders E, Gu Z, Dobson SM, Antić Ž, Crawford JC, Ma X, Edmonson MN, Payne-Turner D, van de Vorst M, Jongmans MCJ, McGuire I, Zhou X, Wang J, Shi L, Pounds S, Pei D, Cheng C, Song G, Fan Y, Shao Y, Rusch M, McCastlain K, Yu J, van Boxtel R, Blokzijl F, Iacobucci I, Roberts KG, Wen J, Wu G, Ma J, Easton J, Neale G, Olsen SR, Nichols KE, Pui CH, Zhang J, Evans WE, Relling MV, Yang JJ, Thomas PG, Dick JE, Kuiper RP, Mullighan CG. Waanders E, et al. Blood Cancer Discov. 2020 Jul;1(1):96-111. doi: 10.1158/0008-5472.BCD-19-0041. Blood Cancer Discov. 2020. PMID: 32793890 Free PMC article.
References
- Bolstad BM, et al. A comparison of normalization methods for high density oligonucleotide array based on variance and bias. Bioinformatics. 2003;19:185–193. - PubMed
- Mullighan CG, et al. Genes regulating B cell development are mutated in acute lymphoid leukaemia. Nature. 2007;446:758–764. - PubMed
- Pounds S, Cheng C. Statistical development and evaluation of gene expression data filters. J. Comput. Biol. 2005;12:482–495. - PubMed
- Raimondi SC, et al. Cytogenetics as a diagnostic aid for childhood hematologic disorders: conventional cytogenetic techniques, fluorescence in situ hybridization, and comparative genomic hybridization. In: Hanausek M, Walaszek Z, editors. Tumor Marker Protocols. Methods Molecular Medicine. Totowa, NJ: Humana Press; 1998. pp. 209–227. 1998.
Publication types
MeSH terms
Substances
Grants and funding
- R01 CA129541/CA/NCI NIH HHS/United States
- U01 GM061393/GM/NIGMS NIH HHS/United States
- U01GM61393/GM/NIGMS NIH HHS/United States
- R01 CA132946/CA/NCI NIH HHS/United States
- U01GM61374/GM/NIGMS NIH HHS/United States
- CA21765/CA/NCI NIH HHS/United States
- R01CA132946-01/CA/NCI NIH HHS/United States
- R01CA115422-01A1/CA/NCI NIH HHS/United States
- R01 CA115422/CA/NCI NIH HHS/United States
- R01CA129541-01/CA/NCI NIH HHS/United States
- P30 CA021765/CA/NCI NIH HHS/United States
- U01 GM061374/GM/NIGMS NIH HHS/United States