Inference of high resolution HLA types using genome-wide RNA or DNA sequencing reads
- PMID: 24884790
- PMCID: PMC4035057
- DOI: 10.1186/1471-2164-15-325
Inference of high resolution HLA types using genome-wide RNA or DNA sequencing reads
Abstract
Background: Accurate HLA typing at amino acid level (four-digit resolution) is critical in hematopoietic and organ transplantations, pathogenesis studies of autoimmune and infectious diseases, as well as the development of immunoncology therapies. With the rapid adoption of genome-wide sequencing in biomedical research, HLA typing based on transcriptome and whole exome/genome sequencing data becomes increasingly attractive due to its high throughput and convenience. However, unlike targeted amplicon sequencing, genome-wide sequencing often employs a reduced read length and coverage that impose great challenges in resolving the highly homologous HLA alleles. Though several algorithms exist and have been applied to four-digit typing, some deliver low to moderate accuracies, some output ambiguous predictions. Moreover, few methods suit diverse read lengths and depths, and both RNA and DNA sequencing inputs. New algorithms are therefore needed to leverage the accuracy and flexibility of HLA typing at high resolution using genome-wide sequencing data.
Results: We have developed a new algorithm named PHLAT to discover the most probable pair of HLA alleles at four-digit resolution or higher, via a unique integration of a candidate allele selection and a likelihood scoring. Over a comprehensive set of benchmarking data (a total of 768 HLA alleles) from both RNA and DNA sequencing and with a broad range of read lengths and coverage, PHLAT consistently achieves a high accuracy at four-digit (92%-95%) and two-digit resolutions (96%-99%), outcompeting most of the existing methods. It also supports targeted amplicon sequencing data from Illumina Miseq.
Conclusions: PHLAT significantly leverages the accuracy and flexibility of high resolution HLA typing based on genome-wide sequencing data. It may benefit both basic and applied research in immunology and related fields as well as numerous clinical applications.
Figures



Similar articles
-
PHLAT: Inference of High-Resolution HLA Types from RNA and Whole Exome Sequencing.Methods Mol Biol. 2018;1802:193-201. doi: 10.1007/978-1-4939-8546-3_13. Methods Mol Biol. 2018. PMID: 29858810
-
HLA-VBSeq: accurate HLA typing at full resolution from whole-genome sequencing data.BMC Genomics. 2015;16 Suppl 2(Suppl 2):S7. doi: 10.1186/1471-2164-16-S2-S7. Epub 2015 Jan 21. BMC Genomics. 2015. PMID: 25708870 Free PMC article.
-
HLAscan: genotyping of the HLA region using next-generation sequencing data.BMC Bioinformatics. 2017 May 12;18(1):258. doi: 10.1186/s12859-017-1671-3. BMC Bioinformatics. 2017. PMID: 28499414 Free PMC article.
-
A long road/read to rapid high-resolution HLA typing: The nanopore perspective.Hum Immunol. 2021 Jul;82(7):488-495. doi: 10.1016/j.humimm.2020.04.009. Epub 2020 May 1. Hum Immunol. 2021. PMID: 32386782 Free PMC article. Review.
-
Recent Advances of Human Leukocyte Antigen (HLA) Typing Technology Based on High-Throughput Sequencing.J Biomed Nanotechnol. 2022 Mar 1;18(3):617-639. doi: 10.1166/jbn.2022.3280. J Biomed Nanotechnol. 2022. PMID: 35715925 Review.
Cited by
-
The combination of neoantigen quality and T lymphocyte infiltrates identifies glioblastomas with the longest survival.Commun Biol. 2019 Apr 23;2:135. doi: 10.1038/s42003-019-0369-7. eCollection 2019. Commun Biol. 2019. PMID: 31044160 Free PMC article.
-
Performance of a multiplexed amplicon-based next-generation sequencing assay for HLA typing.PLoS One. 2020 Apr 23;15(4):e0232050. doi: 10.1371/journal.pone.0232050. eCollection 2020. PLoS One. 2020. PMID: 32324777 Free PMC article.
-
High-Accuracy HLA Type Inference from Whole-Genome Sequencing Data Using Population Reference Graphs.PLoS Comput Biol. 2016 Oct 28;12(10):e1005151. doi: 10.1371/journal.pcbi.1005151. eCollection 2016 Oct. PLoS Comput Biol. 2016. PMID: 27792722 Free PMC article.
-
HLA*LA-HLA typing from linearly projected graph alignments.Bioinformatics. 2019 Nov 1;35(21):4394-4396. doi: 10.1093/bioinformatics/btz235. Bioinformatics. 2019. PMID: 30942877 Free PMC article.
-
Claudin-low bladder tumors are immune infiltrated and actively immune suppressed.JCI Insight. 2016 Mar 17;1(3):e85902. doi: 10.1172/jci.insight.85902. JCI Insight. 2016. PMID: 27699256 Free PMC article.
References
MeSH terms
Substances
Associated data
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials