A sequence-based identification of the genes detected by probesets on the Affymetrix U133 plus 2.0 array - PubMed (original) (raw)
A sequence-based identification of the genes detected by probesets on the Affymetrix U133 plus 2.0 array
Jeremy Harbig et al. Nucleic Acids Res. 2005.
Abstract
One of the biggest problems facing microarray experiments is the difficulty of translating results into other microarray formats or comparing microarray results to other biochemical methods. We believe that this is largely the result of poor gene identification. We re-identified the probesets on the Affymetrix U133 plus 2.0 GeneChip array. This identification was based on the sequence of the probes and the sequence of the human genome. Using the BLAST program, we matched probes with documented and postulated human transcripts. This resulted in the redefinition of approximately 37% of the probes on the U133 plus 2.0 array. This updated identification specifically points out where the identification is complicated by cross-hybridization from splice variants or closely related genes. More than 5000 probesets detect multiple transcripts and therefore the exact protein affected cannot be readily concluded from the performance of one probeset alone. This makes naming difficult and impacts any downstream analysis such as associating gene ontologies, mapping affected pathways or simply validating expression changes. We have now automated the sequence-based identification and can more appropriately annotate any array where the sequence on each spot is known.
Figures
Figure 1
The signal captured by some probesets on the U133A array from 100 RNA samples collected from various tissues. Probeset 202029_x_at detects the expression of ribosomal protein L38. The other three probesets were designed to the complementary strand of the intended reference gene. Probeset 202028_s_at detects sequences complementary to the ribosomal protein L38. The plots for probesets 213619_at and 216868_s_at illustrate the difference between a probeset that detects a transcript and a probeset that does not detect a transcript. Although each plot is represented against a different scale, the relative expression levels are directly comparable.
Similar articles
- Quality assessment of the Affymetrix U133A&B probesets by target sequence mapping and expression data analysis.
Orlov YL, Zhou J, Lipovich L, Shahab A, Kuznetsov VA. Orlov YL, et al. In Silico Biol. 2007;7(3):241-60. In Silico Biol. 2007. PMID: 18415975 - Transcript-level annotation of Affymetrix probesets improves the interpretation of gene expression data.
Yu H, Wang F, Tu K, Xie L, Li YY, Li YX. Yu H, et al. BMC Bioinformatics. 2007 Jun 11;8:194. doi: 10.1186/1471-2105-8-194. BMC Bioinformatics. 2007. PMID: 17559689 Free PMC article. - Gene expression and isoform variation analysis using Affymetrix Exon Arrays.
Bemmo A, Benovoy D, Kwan T, Gaffney DJ, Jensen RV, Majewski J. Bemmo A, et al. BMC Genomics. 2008 Nov 7;9:529. doi: 10.1186/1471-2164-9-529. BMC Genomics. 2008. PMID: 18990248 Free PMC article. - Expression Profiling Using Affymetrix GeneChip Microarrays.
Auer H, Newsom DL, Kornacker K. Auer H, et al. Methods Mol Biol. 2009;509:35-46. doi: 10.1007/978-1-59745-372-1_3. Methods Mol Biol. 2009. PMID: 19212713 Review. - A comparison of analog and Next-Generation transcriptomic tools for mammalian studies.
Roy NC, Altermann E, Park ZA, McNabb WC. Roy NC, et al. Brief Funct Genomics. 2011 May;10(3):135-50. doi: 10.1093/bfgp/elr005. Epub 2011 Mar 9. Brief Funct Genomics. 2011. PMID: 21389008 Review.
Cited by
- Misspellings or "miscellings"-Non-verifiable and unknown cell lines in cancer research publications.
Oste DJ, Pathmendra P, Richardson RAK, Johnson G, Ao Y, Arya MD, Enochs NR, Hussein M, Kang J, Lee A, Danon JJ, Cabanac G, Labbé C, Davis AC, Stoeger T, Byrne JA. Oste DJ, et al. Int J Cancer. 2024 Oct 1;155(7):1278-1289. doi: 10.1002/ijc.34995. Epub 2024 May 15. Int J Cancer. 2024. PMID: 38751110 - Personalized targeted therapy prescription in colorectal cancer using algorithmic analysis of RNA sequencing data.
Sorokin M, Zolotovskaia M, Nikitin D, Suntsova M, Poddubskaya E, Glusker A, Garazha A, Moisseev A, Li X, Sekacheva M, Naskhletashvili D, Seryakov A, Wang Y, Buzdin A. Sorokin M, et al. BMC Cancer. 2022 Oct 31;22(1):1113. doi: 10.1186/s12885-022-10177-3. BMC Cancer. 2022. PMID: 36316649 Free PMC article. - Analysis of dynamic molecular networks: the progression from colorectal adenoma to cancer.
Jiang Y, Song F, Hu X, Guo D, Liu Y, Wang J, Jiang L, Huang P, Zhang Y. Jiang Y, et al. J Gastrointest Oncol. 2021 Dec;12(6):2823-2837. doi: 10.21037/jgo-21-674. J Gastrointest Oncol. 2021. PMID: 35070410 Free PMC article. - Identification of human gene research articles with wrongly identified nucleotide sequences.
Park Y, West RA, Pathmendra P, Favier B, Stoeger T, Capes-Davis A, Cabanac G, Labbé C, Byrne JA. Park Y, et al. Life Sci Alliance. 2022 Jan 12;5(4):e202101203. doi: 10.26508/lsa.202101203. Print 2022 Apr. Life Sci Alliance. 2022. PMID: 35022248 Free PMC article. - Higher content of microcystin-leucine-arginine promotes the survival of intrahepatic cholangiocarcinoma cells via regulating SET resulting in the poorer prognosis of patients.
Gu S, He W, Yan M, He J, Zhou Q, Yan X, Fu X, Chen J, Han X, Qiu Y. Gu S, et al. Cell Prolif. 2021 Feb;54(2):e12961. doi: 10.1111/cpr.12961. Epub 2020 Nov 25. Cell Prolif. 2021. PMID: 33241617 Free PMC article.
References
- Chuaqui R.F., Bonner R.F., Best C.J., Gillespie J.W., Flaig M.J., Hewitt S.M., Phillips J.L., Krizman D.B., Tangrea M.A., Ahram M., et al. Post-analysis follow-up and validation of microarray experiments. Nature Genet. 2002;32(Suppl.):509–514. - PubMed
- Ohyama H., Zhang X., Kohno Y., Alevizos I., Posner M., Wong D.T., Todd R. Laser capture microdissection-generated target sample for high-density oligonucleotide array hybridization. Biotechniques. 2000;29:530–536. - PubMed
- Iscove N.N., Barbara M., Gu M., Gibson M., Modi C., Winegarden N. Representation is faithfully preserved in global cDNA amplified exponentially from sub-picogram quantities of mRNA. Nat. Biotechnol. 2002;20:940–943. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials