Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jan;119(1):225-36.
doi: 10.1172/JCI37085. Epub 2008 Dec 22.

Genome-wide hepatitis C virus amino acid covariance networks can predict response to antiviral therapy in humans

Affiliations

Genome-wide hepatitis C virus amino acid covariance networks can predict response to antiviral therapy in humans

Rajeev Aurora et al. J Clin Invest. 2009 Jan.

Abstract

Hepatitis C virus (HCV) is a common RNA virus that causes hepatitis and liver cancer. Infection is treated with IFN-alpha and ribavirin, but this expensive and physically demanding therapy fails in half of patients. The genomic sequences of independent HCV isolates differ by approximately 10%, but the effects of this variation on the response to therapy are unknown. To address this question, we analyzed amino acid covariance within the full viral coding region of pretherapy HCV sequences from 94 participants in the Viral Resistance to Antiviral Therapy of Chronic Hepatitis C (Virahep-C) clinical study. Covarying positions were common and linked together into networks that differed by response to therapy. There were 3-fold more hydrophobic amino acid pairs in HCV from nonresponding patients, and these hydrophobic interactions were predicted to contribute to failure of therapy by stabilizing viral protein complexes. Using our analysis to detect patterns within the networks, we could predict the outcome of therapy with greater than 95% coverage and 100% accuracy, raising the possibility of a prognostic test to reduce therapeutic failures. Furthermore, the hub positions in the networks are attractive antiviral targets because of their genetic linkage with many other positions that we predict would suppress evolution of resistant variants. Finally, covariance network analysis could be applicable to any virus with sufficient genetic variation, including most human RNA viruses.

PubMed Disclaimer

Figures

Figure 1
Figure 1. The HCV ORF.
The approximately 9,600-nt positive-polarity HCV RNA genome encodes a long ORF. The ORF is translated into an approximately 3,010–amino acid polyprotein that is cleaved to 10 mature proteins. The known or predicted functions of the proteins are indicated.
Figure 2
Figure 2. The All covariance networks.
Each network is composed of 47 sequences per genotype, 1a (A and B) and 1b (C and D), totaling 94 sequences. (A and C) Networks formed by the covariances. The nodes represent covarying amino acid positions, and the edges represent covariances between the nodes. The most highly connected nodes are in yellow. (B and D) Edge distribution for the genotype networks.
Figure 3
Figure 3. The connectivity of the response-specific covariance networks is different.
Shown are 1a Marked (A and B) and 1a Poor (C and D) response classes. (A and C) Networks formed by the covariant pairs of residue positions. The most highly connected nodes are in yellow. (B and D) Positions that directly covary with position 463 (square node) are shown to highlight the differences between the networks.
Figure 4
Figure 4. Segregation of the edges and nodes by phenotype.
The overlap and segregation of the covarying nodes (A and B) and edges (C and D) by response class is shown for genotype 1a (A and C) and 1b (B and D).
Figure 5
Figure 5. Random shuffling of sequences causes loss of information in the response-specific networks.
We generated 10 independent alignments, in which 1a Marked sequences were randomly replaced with 1a Poor sequences at 3 levels of replacement: 2, 4, or 6 sequences, giving 12.5%, 25%, and 37.5% sequences shuffled, respectively. Shown are proportions of true positive covarying pairs relative to the unshuffled Marked sequences. P values showing significance of the differences in conserved edges relative to the unshuffled network were determined by Student’s t test.
Figure 6
Figure 6. Example subnetworks associated with outcome of anti-HCV therapy, with 100% accuracy and 0% false coverage rates, that are potential biomarkers for prediction of therapy outcome.
The subnetworks contain 1 or more covariances that together are found in the indicated fraction (coverage) of the appropriate response class and are never found in the opposing response class. Polyprotein residue numbers are shown for each subnetwork, and the identity of the mature HCV protein is indicated in subscript.

Comment in

Similar articles

Cited by

References

    1. Armstrong G.L., et al. The prevalence of hepatitis C virus infection in the United States, 1999 through 2002. Ann. Intern. Med. 2006;144:705–714. - PubMed
    1. Hadziyannis S.J., et al. Peginterferon-alpha2a and ribavirin combination therapy in chronic hepatitis C: a randomized study of treatment duration and ribavirin dose. Ann. Intern. Med. 2004;140:346–355. - PubMed
    1. Manns M.P., et al. Peginterferon alfa-2b plus ribavirin compared with interferon alfa-2b plus ribavirin for initial treatment of chronic hepatitis C: a randomised trial. Lancet. 2001;358:958–965. doi: 10.1016/S0140-6736(01)06102-5. - DOI - PubMed
    1. McHutchison J.G., et al. Interferon alfa-2b alone or in combination with ribavirin as initial treatment for chronic hepatitis C. N. Engl. J. Med. 1998;339:1485–1492. doi: 10.1056/NEJM199811193392101. - DOI - PubMed
    1. Poynard T., et al. Randomised trial of interferon alpha2b plus ribavirin for 48 weeks or for 24 weeks versus interferon alpha2b plus placebo for 48 weeks for treatment of chronic infection with hepatitis C virus. International Hepatitis Interventional Therapy Group (IHIT). Lancet. 1998;352:1426–1432. doi: 10.1016/S0140-6736(98)07124-4. - DOI - PubMed

Publication types

Substances