Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Aug 1;369(1):fnac064.
doi: 10.1093/femsle/fnac064.

Examination of phase-variable haemoglobin-haptoglobin binding proteins in non-typeable Haemophilus influenzae reveals a diverse distribution of multiple variants

Affiliations

Examination of phase-variable haemoglobin-haptoglobin binding proteins in non-typeable Haemophilus influenzae reveals a diverse distribution of multiple variants

Zachary N Phillips et al. FEMS Microbiol Lett. .

Abstract

Non-typeable Haemophilus influenzae (NTHi) is a major human pathogen for which there is no globally licensed vaccine. NTHi has a strict growth requirement for iron and encodes several systems to scavenge elemental iron and heme from the host. An effective NTHi vaccine would target conserved, essential surface factors, such as those involved in iron acquisition. Haemoglobin-haptoglobin binding proteins (Hgps) are iron-uptake proteins localized on the outer-membrane of NTHi. If the Hgps are to be included as components of a rationally designed subunit vaccine against NTHi, it is important to understand their prevalence and diversity. Following analysis of all available Hgp sequences, we propose a standardized grouping method for Hgps, and demonstrate increased diversity of these proteins than previously determined. This analysis demonstrated that genes encoding variants HgpB and HgpC are present in all strains examined, and almost 40% of strains had a duplicate, nonidentical hgpB gene. Hgps are also phase-variably expressed; the encoding genes contain a CCAA(n) simple DNA sequence repeat tract, resulting in biphasic ON-OFF switching of expression. Examination of the ON-OFF state of hgpB and hgpC genes in a collection of invasive NTHi isolates demonstrated that 58% of isolates had at least one of hgpB or hgpC expressed (ON). Varying expression of a diverse repertoire of hgp genes would provide strains a method of evading an immune response while maintaining the ability to acquire iron via heme. Structural analysis of Hgps also revealed high sequence variability at the sites predicted to be surface exposed, demonstrating a further mechanism to evade the immune system-through varying the surface, immune-exposed regions of the membrane anchored protein. This information will direct and inform the choice of candidates to include in a vaccine against NTHi.

Keywords: Hgp; NTHi; invasive disease; iron acquisition; phase variation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
(A) The primary NTHi hgp gene cluster is located immediately downstream of the fucI gene (encoding a fucose isomerase), with variable distance (30–80 kb) between multiple hgp genes located in this region. Our analysis demonstrated that there were at least two hgp genes within this primary cluster, but the number of hgps varies in number and orientation in individual strains. Additionally, a secondary hgp gene can be located between the bioA (encoding adenosylmethionine-8-amino-7-oxononanoate aminotransferase) and the a pyk gene (encoding pyruvate kinase). This secondary site contains only a single hgp gene, and is not present in all strains. (B) Alignment of Hgp amino acid sequences in H. influenzae NCBI fully annotated genomes. Protein sequences were aligned by CLUSTAL OMEGA (v1.2.4) and viewed using default JalView (v2.1.1.7) settings, visually representing % identity (% ID) between sequences. The number of sequences aligned is under the ‘No.’ column. Amino acids are coloured according to the percentage in each column that agree with the consensus sequence, with % identity shown as blue, ranging from > 80% to > 40% identity. Grey areas represent gaps, and white areas indicate < 40% identity with the consensus sequence. We have categorized the previously broad Hgp groups (HgpA–D) using a > 70% identity cut-off to separate Hgps into groups (HgpA–G) and 80% identity to separate alleles (e.g HgpA1 vs. A2).
Figure 2.
Figure 2.
(A) The location of the surface domains and heme-binding core within aligned HgpB protein sequences. The structure of HgpB (from strain NCTC13377) was predicted using AlphaFold (v2.1.2), with (B) side and (C) top-down view provided. The VDs of the Hgps are located in surface-exposed areas (white). The β-barrel structure was highly conserved within Hgp groups (blue). The heme-binding core was surface accessible and highly conserved between Hgp groups (red). (D) Variable (surface) Domains (VD1–5) of HgpB contain highly variable sequences. Individual sequences were identified by aligning all the sequences present from each of the VDs (separate from the whole sequence) in CLUSTAL OMEGA (v1.2.4) and viewed using default settings in JalView overview (v2.1.1.7). A total of 102 HgpB protein sequences were included in the alignments. Amino acids are coloured according to the percentage in each column that agree with the consensus sequence, with % identity shown as blue, ranging from > 80% to > 40% identity. Grey areas represent gaps, and white areas indicate < 40% identity with the consensus sequence. VD1–the largest surface domain—had the highest sequence variability, and was not separated into individual conserved sequences. VD2–VD5 had a lower amount of diversity than VD1, and as such we have been able to individually identify the number of variants within each of these VDs (numbered 1–6) indicated on the left-hand side of each individual VD alignment.

References

    1. Atack JM, Srikhanta YN, Fox KLet al. . A biphasic epigenetic switch controls immunoevasion, virulence and niche adaptation in non-typeable Haemophilus influenzae. Nat Commun. 2015;6:7828. - PMC - PubMed
    1. Choby JE, Skaar EP. Heme synthesis and acquisition in bacterial pathogens. J Mol Biol. 2016;428:3408–28. - PMC - PubMed
    1. Cope LD, Hrkal Z, Hansen EJ. Detection of phase variation in expression of proteins involved in hemoglobin and hemoglobin-haptoglobin binding by nontypeable Haemophilus influenzae. Infect Immun. 2000;68:4092–101. - PMC - PubMed
    1. Dixon K, Bayliss CD, Makepeace Ket al. . Identification of the functional initiation codons of a phase-variable gene of Haemophilus influenzae, lic2A, with the potential for differential expression. J Bacteriol. 2007;189:511–21. - PMC - PubMed
    1. Fox KL, Atack JM, Srikhanta YNet al. . Selection for phase variation of LOS biosynthetic genes frequently occurs in progression of non-typeable Haemophilus influenzae infection from the nasopharynx to the middle ear of human patients. PLoS ONE. 2014;9:e90505. - PMC - PubMed

Publication types

MeSH terms