Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Dec 22;4(12):102.
doi: 10.1186/gm403. eCollection 2012.

HLA typing from RNA-Seq sequence reads

Affiliations

HLA typing from RNA-Seq sequence reads

Sebastian Boegel et al. Genome Med. .

Abstract

We present a method, seq2HLA, for obtaining an individual's human leukocyte antigen (HLA) class I and II type and expression using standard next generation sequencing RNA-Seq data. RNA-Seq reads are mapped against a reference database of HLA alleles, and HLA type, confidence score and locus-specific expression level are determined. We successfully applied seq2HLA to 50 individuals included in the HapMap project, yielding 100% specificity and 94% sensitivity at a P-value of 0.1 for two-digit HLA types. We determined HLA type and expression for previously un-typed Illumina Body Map tissues and a cohort of Korean patients with lung cancer. Because the algorithm uses standard RNA-Seq reads and requires no change to laboratory protocols, it can be used for both existing datasets and future studies, thus adding a new dimension for HLA typing and biomarker studies.

PubMed Disclaimer

Figures

Figure 1
Figure 1
HLA sequence variability. (a) The helices encoded by exons 2 (α1- chain: orange) and 3 (α2-chain: red) of the HLA class I alleles bind peptides (crystal structure 3OXR from PDB). (b) HLA locus is polygenic (A, B, C) and highly polymorphic within and across the three class I genes, showing the nucleotide variability at each position of a multiple sequence alignment of all HLA class I alleles, where 0 and 2 represent minimum and maximum variability, as defined using Shannon's Entropy (2 - Information Content; see Methods). (c) Example sequences are shown for four regions in seven HLA groups.
Figure 2
Figure 2
Workflow of seq2HLA. (a) RNA-Seq reads are mapped against the reference HLA sequences. (b) For each locus, the allele with the greatest number of reads is determined. (c) Reads associated with the first determined group are removed and the mapping procedure is repeated. (d) The second digital haplotype is called. (e) Zygosity is determined. (f) The genotype and the associated P-values are reported.
Figure 3
Figure 3
Sensitivity versus specificity for different mapping and technical parameters applied to the 50 Montgomery test samples. Bowtie mapping parameters include -a (report all mappings), -m1 (report only unique mappings), -v<0|1|2> (allow zero, one or two mismatches), using the initial reference dataset consisting of exons 1 to 3, plus 75 nucleotides of exon 4. Analysis of different technical parameters comprised varying the average number of reads per sample (1e6, 5e6 and 10e6) while mapping against the initial reference dataset with '-a -v1', shortening the read length (from 73 nucleotides to 30 nucleotides) and using only one of the read pairs to simulate a single-end read dataset. nt, nucleotide; PE, paired-end; SE, single-end
Figure 4
Figure 4
Summed expression of MHC class I and class II in normal tissues in RPKM units [19]. Seq2HLA was applied to the 50 nucleotide paired-end RNA-Seq dataset from the Illumina Body Map 2.0 project. HLA, human leukocyte antigen; RPKM, reads per kilobase of exon model per million mapped reads.

References

    1. Cao K, Chopek M, Fernandez-Vina MA. High and intermediate resolution DNA typing systems for class I HLA-A, B, C genes by hybridization with sequence-specific oligonucleotide probes (SSOP) Rev Immunogenet. 1999;4:177–208. - PubMed
    1. Olerup O, Zetterquist H. HLA-DR typing by PCR amplification with sequence-specific primers (PCR-SSP) in 2 hours: an alternative to serological DR typing in clinical practice including donor-recipient matching in cadaveric transplantation. Tissue Antigens. 1992;4:225–235. doi: 10.1111/j.1399-0039.1992.tb01940.x. - DOI - PubMed
    1. Santamaria P, Boyce-Jacino MT, Lindstrom AL, Barbosa JJ, Faras AJ, Rich SS. HLA class II 'typing': direct sequencing of DRB, DQB, and DQA genes. Hum Immunol. 1992;4:69–81. doi: 10.1016/0198-8859(92)90056-S. - DOI - PubMed
    1. Terasaki PI, McClelland JD. Microdroplet assay of human serum cytotoxins. Nature. 1964;4:998–1000. - PubMed
    1. Erlich RL, Jia X, Anderson S, Banks E, Gao X, Carrington M, Gupta N, DePristo MA, Henn MR, Lennon NJ, de Bakker PI. Next-generation sequencing for HLA typing of class I loci. BMC Genomics. 2011;4:42. doi: 10.1186/1471-2164-12-42. - DOI - PMC - PubMed