Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Sep 3:8:e42463.
doi: 10.7554/eLife.42463.

Interferon lambda 4 impacts the genetic diversity of hepatitis C virus

Collaborators, Affiliations

Interferon lambda 4 impacts the genetic diversity of hepatitis C virus

M Azim Ansari et al. Elife. .

Abstract

Hepatitis C virus (HCV) is a highly variable pathogen that frequently establishes chronic infection. This genetic variability is affected by the adaptive immune response but the contribution of other host factors is unclear. Here, we examined the role played by interferon lambda-4 (IFN-λ4) on HCV diversity; IFN-λ4 plays a crucial role in spontaneous clearance or establishment of chronicity following acute infection. We performed viral genome-wide association studies using human and viral data from 485 patients of white ancestry infected with HCV genotype 3a. We demonstrate that combinations of host genetic variants, which determine IFN-λ4 protein production and activity, influence amino acid variation across the viral polyprotein - not restricted to specific viral proteins or HLA restricted epitopes - and modulate viral load. We also observed an association with viral di-nucleotide proportions. These results support a direct role for IFN-λ4 in exerting selective pressure across the viral genome, possibly by a novel mechanism.

Keywords: genetics; genome-to-genome analysis; genomics; hepatitis C virus; host-pathogen interactions; human; infectious disease; innate immunity; interferon lambda 4; microbiology; virus.

PubMed Disclaimer

Conflict of interest statement

MA, EA, CI, Ad, SL, CB, DB, AT, PP, VS, VC, EH, RB, AP, ET, PS, PK, CH, EB, CS, JM, VP No competing interests declared, GF Grants Consulting and Speaker/Advisory Board: AbbVie, Alcura, Bristol-Myers Squibb, Gilead, Janssen, GlaxoSmithKline, Merck, Roche, Springbank, Idenix, Tekmira, Novartis, WI Grants, Consulting and Advisory/ Speaker Board: Roche, Janssen Cilag, Gilead Sciences, Novartis, GlaxoSmithKline, Pfizer, Abbvie and Bristol-Myers Squibb, KA Grants, Consulting and Advisory/ Speaker Board: Achillion, Alnylam, Astellas, Abbvie, Bristol-Myers Squibb, Gilead, GlaxoSmithKline, Janssen, Merck, Roche, Novartis, Vir

Figures

Figure 1.
Figure 1.. HCV genome-wide association study with IFNL4 SNP rs12979860 genotypes (CC vs. non-CC).
(a) Manhattan plot. The dashed line indicates 5% FDR. At this level 42 sites on the virus polyprotein are significantly associated with IFNL4 SNP. (b) Schematic of the HCV polyprotein.
Figure 1—figure supplement 1.
Figure 1—figure supplement 1.. Host and viral principal components.
(a) Host first and second PCs. (b) Proportion of variance explained by the viral principal components (PCs). The first and second PCs explain 3% and 2% of variation in the nucleotide sequences respectively which indicates there is little clustering of the viral sequences. This is also consistent with the virus phylogenetic tree, which is star-like.
Figure 1—figure supplement 2.
Figure 1—figure supplement 2.. Association between viral PCs and IFNL4 SNP rs12979860 genotypes in the combined cohort (N = 485).
(a) Virus phylogenetic tree, cohort (black EAP, grey BOSON), IFNL4 SNP rs12979860 (CC white, non-CC black) and the first 10 viral PCs (the colours are mapped such that dark blue represents the smallest number and bright yellow represents the largest number for each PC). (b) P-value of univariate association tests between viral PCs and the host SNPs. Black and grey dots are for association tests between the viral PCs and the IFNL4 SNP rs12979860 and 500 SNPs with minor allele frequency (MAF) similar to IFNL4 SNP rs12979860 MAF, respectively. Dashed line shows the 10% FDR line and the dotted line shows the nominal significance of p=0.05. Distribution of the fifth (c) and seventh (d) PCs stratified by the host IFNL4 SNP rs12979860 genotypes. Black dot and lines show the mean and 95% confidence interval for each group.
Figure 1—figure supplement 3.
Figure 1—figure supplement 3.. Distribution of (a) cohort from which the sequences were obtained (BOSON N = 411 or EAP N = 74) and (b) host IFNL4 SNP rs12979860 genotypes (CC or non-CC) on the virus phylogenetic tree.
The thickness and redness of the branches are proportional to the posterior probability that the distribution of the trait of interest on the tips of the tree is different under that clade. (a) Bayes factor of the alternative (where there is one or more branches which have a different distribution of sequences from the BOSON and EAP cohorts) to null model (where the distribution of sequences from the BOSON and EAP cohorts is the same everywhere on the tree) is 249, indicating that the alternative model is supported. There is a branch (thick and red) on the tree, representing a clade within which there are very few EAP sequences as well as sequences from UK patients in the BOSON cohort. (b) There is no evidence that any part of the tree has a distinct distribution in terms of the host IFNL4 SNP rs12979860 genotypes, the estimated Bayes factor for alternative model (where there is one or more branches which have a different distribution of host IFNL4 genotypes) to null model (where the distribution of IFNL4 SNP rs12979860 genotypes is the same everywhere on the tree) is 1.1 which indicates that the null model that host IFNL4 SNP rs12979860 genotypes are randomly distributed on the virus tree is better supported.
Figure 1—figure supplement 4.
Figure 1—figure supplement 4.. Viral allele frequencies in the BOSON (N = 411) and EAP (N = 74) cohorts.
(a) Viral nucleotide and (b) amino acid frequencies in the BOSON and EAP cohorts. The red dots represent the 10 amino acids previously reported to be associated with IFNL4 genotype. The black dot represents position 2576, which was identified in our previous study as a site associated with IFNL4 genotype (Ansari et al., 2017) but was not tested in the present study.
Figure 1—figure supplement 5.
Figure 1—figure supplement 5.. IFNL4-associated residues on the core E2 structure.
The core E2 structure (PDB 4MWF) is shown in grey, (A) shows the protein surface as a mesh (B) shows the ribbon structure. IFNL4 associated residues are highlighted as follows: L438 and F442 in Epitope 2 (red), K500 and S501 in orange, R521 in cyan, A524 and L546 in Epitope 3 (blue), T558 in purple and D576, N577, T578 and L580 in the igVR (green).
Figure 1—figure supplement 6.
Figure 1—figure supplement 6.. QQ-plots for association studies between viral amino acids and viral codons and host IFNL4 SNP rs12979860 and 500 host SNPs chosen across the human genome with a minor allele frequency (MAF) similar to the IFNL4 SNP rs12979860 MAF.
QQ-plots for association tests between the host SNPs and viral amino acid (a) host SNPs and change from the most common viral codon to (b) non-synonymous codons and (c) synonymous codons. First two viral PCs and first three host PCs were used as covariates in all three analyses. The black circles show the QQ-plot for the virus GWAS against IFNL4 SNP rs12979860 and the grey circles show the QQ-plot for the virus GWASs against 500 frequency-matched SNPs.
Figure 1—figure supplement 7.
Figure 1—figure supplement 7.. Effect size (beta) of IFNL4 SNP rs12979860 genotypes (CC vs. non-CC) on the proportion of dinucleotide frequencies in the combined cohort (N = 485).
The effect size is measured for non-CC relative to CC group.
Figure 1—figure supplement 8.
Figure 1—figure supplement 8.. Effect size (beta) of IFNL4 SNP rs12979860 genotypes (CC vs. non-CC) on the proportion of dinucleotide frequencies in the BOSON (N = 411) and EAP (N = 74) cohorts.
The effect size is measured for non-CC relative to CC group. The square indicates the effect size in the BOSON cohort and the circle indicates the effect size in the EAP cohort.
Figure 2.
Figure 2.. Comparison of the effect sizes (log(OR)) of host IFN-λ4 variants (IFN-λ4-S70 and IFN-λ4-P70 relative to IFN-λ4-Null) on HCV-encoded amino acids.
The circles show the log(OR) estimates and the grey lines indicate the 95% confidence intervals. The dashed line is the y = x line which has a slope of one. The solid black line shows the linear regression line, which has a slope of 0.77 that is significantly different from one (y = x line, p=9.6×10−7).
Figure 3.
Figure 3.. Bayesian model comparison of effect sizes of IFNL-λ4 variants on viral load in the BOSON cohort (N = 411).
(a) Pretreatment viral load stratified by the host IFN-λ4 variants. The black dots and lines indicate the mean and 95% confidence interval (CI) for each group. (b) The posterior probability of the five tested models from (a) stacked on top of each other (from model 1 to model 5; posterior probabilities of models 1, 2 and 5 are too small to be labelled on this plot). Models where the posterior probability is higher or lower than the prior probability are coloured as dark grey and light grey respectively. Only model 3 (as indicated) has a posterior probability bigger than its prior probability and it assumes that the mean viral load is identical in IFN-λ4-Null and IFN-λ4-S70 groups and different from the mean viral load of IFN-λ4-P70 group. (c) Viral load stratified by the host IFNL-λ4 variants and the presence and absence of serine at the viral amino acid site 2414. The black dots and lines indicate the mean and 95% CI for each group. (d) The posterior probability of the 58 tested models from (c) stacked on top of each other (from model 1 to model 58; only models where the posterior probability is higher than the prior probability are labelled on this plot). Models where the posterior probability is higher or lower than the prior probability are coloured as dark grey and light grey respectively. Model 5 has the highest posterior probability and it assumes that the mean viral load is only different in ‘IFN-λ4-P70 + 2414 serine’ group and identical in other groups.
Figure 3—figure supplement 1.
Figure 3—figure supplement 1.. Pre-treatment viral load stratified by cohort.
The viral load in the EAP cohort (N = 74) is significantly lower than that of the BOSON cohort (N = 411) (p=1.49×10−13).
Author response image 1.
Author response image 1.
Author response image 2.
Author response image 2.
Author response image 3.
Author response image 3.

Comment in

References

    1. Abraham G, Inouye M. Fast principal component analysis of large-scale genome-wide data. PLOS ONE. 2014;9:e93766. doi: 10.1371/journal.pone.0093766. - DOI - PMC - PubMed
    1. Albecka A, Montserret R, Krey T, Tarr AW, Diesis E, Ball JK, Descamps V, Duverlie G, Rey F, Penin F, Dubuisson J. Identification of new functional regions in hepatitis C virus envelope glycoprotein E2. Journal of Virology. 2011;85:1777–1792. doi: 10.1128/JVI.02170-10. - DOI - PMC - PubMed
    1. Alhammad YM, Maharajh S, Butcher R, Eden JS, White PA, Poumbourios P, Drummer HE. Longitudinal sequence and functional evolution within glycoprotein E2 in hepatitis C virus genotype 3a infection. PLOS ONE. 2015;10:e0126397. doi: 10.1371/journal.pone.0126397. - DOI - PMC - PubMed
    1. Angulo J, Pino K, Echeverría-Chagas N, Marco C, Martínez-Valdebenito C, Galeno H, Villagra E, Vera L, Lagos N, Becerra N, Mora J, Bermúdez A, Cárcamo M, Díaz J, Miquel JF, Ferrés M, López-Lastra M. Association of Single-Nucleotide polymorphisms in IL28B, but not TNF-α, with severity of disease caused by andes virus. Clinical Infectious Diseases : An Official Publication of the Infectious Diseases Society of America. 2015;61:e62–e69. doi: 10.1093/cid/civ830. - DOI - PMC - PubMed
    1. Ansari MA, Pedergnana V, L C Ip C, Magri A, Von Delft A, Bonsall D, Chaturvedi N, Bartha I, Smith D, Nicholson G, McVean G, Trebes A, Piazza P, Fellay J, Cooke G, Foster GR, Hudson E, McLauchlan J, Simmonds P, Bowden R, Klenerman P, Barnes E, Spencer CCA, STOP-HCV Consortium Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus. Nature Genetics. 2017;49:666–673. doi: 10.1038/ng.3835. - DOI - PMC - PubMed

Publication types