Derivation of HLA types from shotgun sequence datasets - PubMed (original) (raw)
Derivation of HLA types from shotgun sequence datasets
René L Warren et al. Genome Med. 2012.
Abstract
The human leukocyte antigen (HLA) is key to many aspects of human physiology and medicine. All current sequence-based HLA typing methodologies are targeted approaches requiring the amplification of specific HLA gene segments. Whole genome, exome and transcriptome shotgun sequencing can generate prodigious data but due to the complexity of HLA loci these data have not been immediately informative regarding HLA genotype. We describe HLAminer, a computational method for identifying HLA alleles directly from shotgun sequence datasets (http://www.bcgsc.ca/platform/bioinfo/software/hlaminer). This approach circumvents the additional time and cost of generating HLA-specific data and capitalizes on the increasing accessibility and affordability of massively parallel sequencing.
Figures
Figure 1
Computational predictions of HLA-I from shotgun data by targeted assembly (left) or read alignment (right). For targeted assembly, NGS reads having their first fifteen 5' bases matching one of HLA CDS (RNA-Seq) or genomic (WGS/exon capture) sequences are recruited and assembled de novo with TASR. Resulting sequence contigs are aligned against a database sequence of all predicted HLA CDS (RNA-Seq) or genomic sequences (WGS/exon capture), tracking best HLA hit(s). Reciprocal best alignments are considered in the same manner. Putative allele assignments from shotgun datasets (HLAminer) are informed by contig length, depth of coverage and similarity to reference sequences, when applicable. The probability of each prediction being correct is estimated by determining the probability of that prediction being observed by chance.
Figure 2
HLAminer performance. HLA allele group and protein coding allele predictions derived from targeted read assembly (black symbols) or direct read alignment (grey symbols) of simulated 100-nucleotide RNA-Seq, WGS and exon capture (ExCap) datasets were compared to original, spiked-in, HLA sequences and performance metrics evaluated (ambiguity, sensitivity and specificity represented by circle, triangle and square symbols, respectively). HLAminer predictions were also obtained from targeted assembly of colorectal cancer (CRC; blue symbols), lymphoma (DLBCL; red, orange and yellow symbols), 1000 Genomes (1KG; green symbols) and ovarian cancer (OV; violet and magenta symbols) patient tumor (T) and/or matched normal (N) shotgun datasets and compared to PCR-based HLA types to calculate performance metrics.
Similar articles
- Evaluation of computational programs to predict HLA genotypes from genomic sequencing data.
Bauer DC, Zadoorian A, Wilson LOW; Melbourne Genomics Health Alliance; Thorne NP. Bauer DC, et al. Brief Bioinform. 2018 Mar 1;19(2):179-187. doi: 10.1093/bib/bbw097. Brief Bioinform. 2018. PMID: 27802932 Free PMC article. - Using Exome and Amplicon-Based Sequencing Data for High-Resolution HLA Typing with ATHLATES.
Liu C, Yang X. Liu C, et al. Methods Mol Biol. 2018;1802:203-213. doi: 10.1007/978-1-4939-8546-3_14. Methods Mol Biol. 2018. PMID: 29858811 - Comparison of Sequence Specific Primers in the Next Generation Sequencing in Human Leukocyte Antigen Typing for Transplant Recipients.
Tipu HN. Tipu HN. J Coll Physicians Surg Pak. 2020 Nov;30(11):1138-1142. doi: 10.29271/jcpsp.2020.11.1138. J Coll Physicians Surg Pak. 2020. PMID: 33222728 - HLA DNA typing: past, present, and future.
Erlich H. Erlich H. Tissue Antigens. 2012 Jul;80(1):1-11. doi: 10.1111/j.1399-0039.2012.01881.x. Tissue Antigens. 2012. PMID: 22651253 Review.
Cited by
- High-Throughput Sequencing to Investigate Associations Between HLA Genes and Metamizole-Induced Agranulocytosis.
Cismaru AL, Grimm L, Rudin D, Ibañez L, Liakoni E, Bonadies N, Kreutz R, Hallberg P, Wadelius M; EuDAC Collaborators; Haschke M, Largiadèr CR, Amstutz U. Cismaru AL, et al. Front Genet. 2020 Aug 21;11:951. doi: 10.3389/fgene.2020.00951. eCollection 2020. Front Genet. 2020. PMID: 32973882 Free PMC article. - Differential haplotype expression in class I MHC genes during SARS-CoV-2 infection of human lung cell lines.
Francisco Junior RDS, Temerozo JR, Ferreira CDS, Martins Y, Souza TML, Medina-Acosta E, de Vasconcelos ATR. Francisco Junior RDS, et al. Front Immunol. 2023 Feb 1;13:1101526. doi: 10.3389/fimmu.2022.1101526. eCollection 2022. Front Immunol. 2023. PMID: 36818472 Free PMC article. - ProGeo-Neo v2.0: A One-Stop Software for Neoantigen Prediction and Filtering Based on the Proteogenomics Strategy.
Liu C, Zhang Y, Jian X, Tan X, Lu M, Ouyang J, Liu Z, Li Y, Xu L, Chen L, Lin Y, Xie L. Liu C, et al. Genes (Basel). 2022 Apr 28;13(5):783. doi: 10.3390/genes13050783. Genes (Basel). 2022. PMID: 35627168 Free PMC article. - Co-occurrence of anaerobic bacteria in colorectal carcinomas.
Warren RL, Freeman DJ, Pleasance S, Watson P, Moore RA, Cochrane K, Allen-Vercoe E, Holt RA. Warren RL, et al. Microbiome. 2013 May 15;1(1):16. doi: 10.1186/2049-2618-1-16. Microbiome. 2013. PMID: 24450771 Free PMC article. - Artificial intelligence and neoantigens: paving the path for precision cancer immunotherapy.
Bulashevska A, Nacsa Z, Lang F, Braun M, Machyna M, Diken M, Childs L, König R. Bulashevska A, et al. Front Immunol. 2024 May 29;15:1394003. doi: 10.3389/fimmu.2024.1394003. eCollection 2024. Front Immunol. 2024. PMID: 38868767 Free PMC article. Review.
References
- Mizuki N, Meguro A, Ota M, Ohno S, Shiota T, Kawagoe T, Ito N, Kera J, Okada E, Yatsu K, Song YW, Lee EB, Kitaichi N, Namba K, Horie Y, Takeno M, Sugita S, Mochizuki M, Bahram S, Ishigatsubo Y, Inoko H. Genome-wide association studies identify IL23R-IL12RB2 and IL10 as Behçet's disease susceptibility loci. Nat Genet. 2010;42:703–706. doi: 10.1038/ng.624. - DOI - PubMed
- Rioux JD, Goyette P, Vyse TJ, Hammarström L, Fernando MM, Green T, De Jager PL, Foisy S, Wang J, de Bakker PI, Leslie S, McVean G, Padyukov L, Alfredsson L, Annese V, Hafler DA, Pan-Hammarström Q, Matell R, Sawcer SJ, Compston AD, Cree BA, Mirel DB, Daly MJ, Behrens TW, Klareskog L, Gregersen PK, Oksenberg JR, Hauser SL. Mapping of multiple susceptibility variants within the MHC region for 7 immune-mediated diseases. Proc Natl Acad Sci USA. 2009;106:18680–18685. - PMC - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials