Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Sep;82(18):9228-35.
doi: 10.1128/JVI.01535-07. Epub 2008 Jul 2.

Analysis of natural sequence variation and covariation in human immunodeficiency virus type 1 integrase

Affiliations

Analysis of natural sequence variation and covariation in human immunodeficiency virus type 1 integrase

Richard E Myers et al. J Virol. 2008 Sep.

Abstract

Human immunodeficiency virus type 1 (HIV-1) integrase inhibitors are in clinical trials, and raltegravir and elvitegravir are likely to be the first licensed drugs of this novel class of HIV antivirals. Understanding resistance to these inhibitors is important to maximize their efficacy. It has been shown that natural variation and covariation provide valuable insights into the development of resistance for established HIV inhibitors. Therefore, we have undertaken a study to fully characterize natural polymorphisms and amino acid covariation within an inhibitor-naïve sequence set spanning all defined HIV-1 subtypes. Inter- and intrasubtype variation was greatest in a 50-amino-acid segment of HIV-1 integrase incorporating the catalytic aspartic acid codon 116, suggesting that polymorphisms affect inhibitor binding and pathways to resistance. The critical mutations that determine the resistance pathways to raltegravir and elvitegravir (N155H, Q148K/R/H, and E92Q) were either rare or absent from the 1,165-sequence data set. However, 25 out of 41 mutations associated with integrase inhibitor resistance were present. These mutations were not subtype associated and were more prevalent in the subtypes that had been sampled frequently within the database. A novel modification of the Jaccard index was used to analyze amino acid covariation within HIV-1 integrase. A network of 10 covarying resistance-associated mutations was elucidated, along with a further 15 previously undescribed mutations that covaried with at least two of the resistance positions. The validation of covariation as a predictive tool will be dependent on monitoring the evolution of HIV-1 integrase under drug selection pressure.

PubMed Disclaimer

Figures

FIG. 1.
FIG. 1.
Amino acid variation in HIV-1 integrase. Subtype-specific alignments of HIV-1 were converted into frequency-based PSSMs, such that the amino acid diversity within each position of the subtype alignments was represented as a fraction of one. The variation within each position of a subtype PSSM was calculated relative to the equivalent position in the subtype B PSSM, and these values were averaged across a sliding window of 20 amino acid positions, with a step interval of 10.
FIG. 2.
FIG. 2.
Natural occurrence of HIV-1 integrase mutations by subtype (a) and inhibitor (b). (a) The frequency of integrase mutations associated with resistance to inhibitors (defined in Table 1) was calculated for the alignments of HIV-1 group M, N, and O subtypes and CRF_0101 and CRF_02 independently and within the whole 1,165-sequence data set (all). Multiple instances of differing amino acids associated with resistance at the same position were represented. White was used to indicate the absence of a resistance mutation in each alignment. The frequency of resistance mutations, if present within an alignment, was represented using a colored scale. (b) HIV-1 polymorphisms that have been associated with integrase inhibitor resistance were represented in black for primary mutations and gray for mutations associated with HIV-1 integrase inhibitor resistance (Table 1).
FIG. 3.
FIG. 3.
Neighbor-joining phylogenies of HIV-1 integrase subtype B sequences. (a) Gray dots were used to indicate the positions within the phylogeny that contained amino acid polymorphisms at 72 and 201. (b and c) Black dots indicate sequences that contained polymorphisms at positions 154 and 201. Phylogenies were generated using either real subtype B sequences from the Los Alamos database (n = 189) (a and b) or from a random sequence data set generated by permuting the original subtype B sequences (n = 1,000) (c).
FIG. 4.
FIG. 4.
(a) Correlation between original and corrected Z scores (Z and ZD, respectively). Statistically significant amino acid covarying pairs (black diamonds) were distinguished from nonsignificant results (gray diamonds) using the corrected ZD score, FDR, and a probability threshold of P < 0.001. The correlation coefficient between Z and ZD was low (r2 = 0.63). (b) Simulations of sequences constrained by the distance scaling factor D. Six sets of 1,000 simulated sequences were analyzed for covarying amino acids, where D was constrained between 0.1 to 0.3, 0.2 to 0.4, 0.3 to 0.5, 0.4 to 0.6, 0.5 to 0.7, and 0.6 to 0.8 within a set of sequences. The ZD and Z scores were plotted against each other for each set, and a binomial line of best fit was calculated. The data generated using subtype B HIV-1 integrase sequences are shown as both a scatter plot and a liner of best fit. (c) Relationship between the Z score and P value for simulated data sets. FDR-corrected P values were estimated from the ZD scores relating to each of the six simulated data sets. The minimum ZD score required to produce a P value lower than a range of P values was plotted (the average D of six data sets is shown as a solid line; D = 0.1 to 0.3, dashed line; D = 0.6 to 0.8, dotted line).
FIG. 5.
FIG. 5.
Network of selected covarying amino acids within HIV-1 integrase. The relationships between polymorphisms associated with resistance to HIV-1 integrase inhibitors defined in Table 1 (white circles, black lines) were calculated. Circular nodes were used to represent an amino acid position, and edges (lines) joining two nodes indicate that those two amino acids covaried. The network was expanded to include amino acids that previously have not been considered resistance mutations (gray circles, gray lines). Additional integrase positions were included in the network if a new node had two more edges connecting to an established resistance-associated node. There were two instances in which two resistance-associated amino acid positions were linked by multiple novel amino acid positions; these nodes were represented by letters that relate to these amino acid positions.
FIG. 6.
FIG. 6.
Crystallographic structure of the HIV-1 integrase tetramer (chains A to D) containing the catalytic core domain and the N-terminal domain (Protein Database accession no. 1K6Y). Two chains (B, C) are colored (red, high diversity; dark blue, low diversity) using the subtype diversity measure (Fig. 1; also see Materials and Methods) and show the average diversity of windows of 5 amino acids within the integrase sequence. The remaining chains (A, D) are colored gray. Residues forming the active-site triad, Asp 64, Asp 112, and Glu 152, are shown as space-filled atoms and are annotated on the integrase B chain. The positions of the N-terminal domains within the crystal structure are shown as the chain N; however, the linker between the N-terminal domain and the catalytic core was not resolved. The C-terminal domains of HIV-1 integrase also were not present within this structure.

Similar articles

Cited by

References

    1. Billich, A. 2003. S-1360 Shionogi-GlaxoSmithKline. Curr. Opin. Investig. Drugs 4206-209. - PubMed
    1. Brown, A. J., B. T. Korber, and J. H. Condra. 1999. Associations between amino acids in the evolution of HIV type 1 protease sequences under indinavir therapy. AIDS Res. Hum Retrovir. 15247-253. - PubMed
    1. D'Aquila, R. T., J. M. Schapiro, F. Brun-Vezinet, B. Clotet, B. Conway, L. M. Demeter, R. M. Grant, V. A. Johnson, D. R. Kuritzkes, C. Loveday, R. W. Shafer, and D. D. Richman. 2003. Drug resistance mutations in HIV-1. Top. HIV Med. 1192-96. - PubMed
    1. Embrey, M. W., J. S. Wai, T. W. Funk, C. F. Homnick, D. S. Perlow, S. D. Young, J. P. Vacca, D. J. Hazuda, P. J. Felock, K. A. Stillmock, M. V. Witmer, G. Moyer, W. A. Schleif, L. J. Gabryelski, L. Jin, I. W. Chen, J. D. Ellis, B. K. Wong, J. H. Lin, Y. M. Leonard, N. N. Tsou, and L. Zhuang. 2005. A series of 5-(5,6)-dihydrouracil substituted 8-hydroxy-[1,6]naphthyridine-7-carboxylic acid 4-fluorobenzylamide inhibitors of HIV-1 integrase and viral replication in cells. Bioorg. Med. Chem. Lett. 154550-4554. - PubMed
    1. Engelman, A., F. D. Bushman, and R. Craigie. 1993. Identification of discrete functional domains of HIV-1 integrase and their organization within an active multimeric complex. EMBO J. 123269-3275. - PMC - PubMed

Publication types