Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Dec 1;30(23):3310-6.
doi: 10.1093/bioinformatics/btu548. Epub 2014 Aug 20.

OptiType: precision HLA typing from next-generation sequencing data

Affiliations

OptiType: precision HLA typing from next-generation sequencing data

András Szolek et al. Bioinformatics. .

Abstract

Motivation: The human leukocyte antigen (HLA) gene cluster plays a crucial role in adaptive immunity and is thus relevant in many biomedical applications. While next-generation sequencing data are often available for a patient, deducing the HLA genotype is difficult because of substantial sequence similarity within the cluster and exceptionally high variability of the loci. Established approaches, therefore, rely on specific HLA enrichment and sequencing techniques, coming at an additional cost and extra turnaround time.

Result: We present OptiType, a novel HLA genotyping algorithm based on integer linear programming, capable of producing accurate predictions from NGS data not specifically enriched for the HLA cluster. We also present a comprehensive benchmark dataset consisting of RNA, exome and whole-genome sequencing data. OptiType significantly outperformed previously published in silico approaches with an overall accuracy of 97% enabling its use in a broad range of applications.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
OptiType’s four-digit HLA typing pipeline. Reference libraries for genomic and CDS are generated by extracting exons 2 and 3 from each known HLA-I allele. For genomic sequences, flanking intronic regions are also extracted. If some of these regions are missing, phylogenetic information is used to reconstruct the missing segments from the closest relative HLA-I allele. NGS reads are mapped against the so-constructed HLA allele reference (A). From the mapping result a binary hit matrix CR×A is constructed for all reads rR mapping to at least one allele aA of the reference with Cr,a=1 if read r could be mapped to allele a; otherwise, Cr,a=0 (B). Based on this hit matrix, an ILP is formulated that optimizes the number of explainable reads by selecting up to two alleles (columns of the hit matrix) for each HLA-I locus (C). The selected alleles represent the most probable genotype
Fig. 2.
Fig. 2.
Performance comparison of HLA typing algorithms. OptiType’s average prediction accuracy for major HLA-I loci was compared with four other published HLA typing methods capable of four-digit typing on publicly available datasets previously used to evaluate these methods
Fig. 3.
Fig. 3.
Coverage and read length dependence of prediction accuracy. To determine the influence of coverage depth on HLA typing accuracy, reads of 253 exome sequencing runs of the 1000 Genomes Project were subsampled >4000 times to simulate different coverage depth conditions. To investigate the impact of read length on performance, original reads were trimmed to 37 bp and evaluated with the same subsampling procedure. Read length alone shows little effect on prediction accuracy, and an average coverage depth greater than 10× over the HLA-I loci was already found to yield maximal accuracy

Similar articles

Cited by

References

    1. Bentley G, et al. High-resolution, high-throughput HLA genotyping by next-generation sequencing. Tissue Antigens. 2009;74:393–403. - PMC - PubMed
    1. Blasczyk R, et al. The nature of polymorphism of the HLA class I non-coding regions and their contribution to the diversification of HLA. Hereditas. 1997;127:7–9. - PubMed
    1. Boegel S, et al. HLA typing from RNA-Seq sequence reads. Genome Med. 2013;4:102. - PMC - PubMed
    1. Bradley B. The role of HLA matching in transplantation. Immunol. Lett. 1991;29:55–59. - PubMed
    1. Danzer M, et al. Rapid, scalable and highly automated HLA genotyping using next-generation sequencing: a transition from research to diagnostics. BMC Genomics. 2013;14:221. - PMC - PubMed

Publication types