Comparative Study

. 2007 Aug 29;2(8):e814.

doi: 10.1371/journal.pone.0000814.

Distinguishing functional amino acid covariation from background linkage disequilibrium in HIV protease and reverse transcriptase

Qi Wang¹, Christopher Lee

Affiliations

Affiliation

¹ Center for Computational Biology, Molecular Biology Institute, Institute for Genomics and Proteomics, University of California at Los Angeles, Los Angeles, United States of America.

PMID: 17726544
PMCID: PMC1950573
DOI: 10.1371/journal.pone.0000814

Comparative Study

Distinguishing functional amino acid covariation from background linkage disequilibrium in HIV protease and reverse transcriptase

Qi Wang et al. PLoS One. 2007.

. 2007 Aug 29;2(8):e814.

doi: 10.1371/journal.pone.0000814.

Authors

Qi Wang¹, Christopher Lee

Affiliation

¹ Center for Computational Biology, Molecular Biology Institute, Institute for Genomics and Proteomics, University of California at Los Angeles, Los Angeles, United States of America.

PMID: 17726544
PMCID: PMC1950573
DOI: 10.1371/journal.pone.0000814

Abstract

Correlated amino acid mutation analysis has been widely used to infer functional interactions between different sites in a protein. However, this analysis can be confounded by important phylogenetic effects broadly classifiable as background linkage disequilibrium (BLD). We have systematically separated the covariation induced by selective interactions between amino acids from background LD, using synonymous (S) vs. amino acid (A) mutations. Covariation between two amino acid mutations, (A,A), can be affected by selective interactions between amino acids, whereas covariation within (A,S) pairs or (S,S) pairs cannot. Our analysis of the pol gene--including the protease and the reverse transcriptase genes--in HIV reveals that (A,A) covariation levels are enormously higher than for either (A,S) or (S,S), and thus cannot be attributed to phylogenetic effects. The magnitude of these effects suggests that a large portion of (A,A) covariation in the HIV pol gene results from selective interactions. Inspection of the most prominent (A,A) interactions in the HIV pol gene showed that they are known sites of independently identified drug resistance mutations, and physically cluster around the drug binding site. Moreover, the specific set of (A,A) interaction pairs was reproducible in different drug treatment studies, and vanished in untreated HIV samples. The (S,S) covariation curves measured a low but detectable level of background LD in HIV.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

**Figure 1. Schema of Separating Selective Interactions from Background Linkage Disequilibrium (BLD).**
(A) Mutation covariation due to BLD. Covariation of mutation A and R (shown in multiple sequence alignment, right) is caused by co-inheritance of the two mutations from a common ancestor (shown in the phylogenetic tree, left). (B) Mutation covariation due to selective interactions. Relative fitness models for mutations x and y, the double mutant (xy), and wildtype (0). Two models are contrasted: top, independent (additive) fitness effects don't cause amino acid mutation covariation; bottom, selective interactions cause covariation of x and y. (C) Distinguishing BLD *vs.* fitness using pairs of amino acid mutations (A) and synonymous (S) mutations.

**Figure 2. (A,A) Covariation Is Dramatically Higher Than (A,S) and (S,S) Covariation in the Specialty Dataset.**
(A) Sliding window results of average D′. All mutation pairs, black; silent mutation pairs (S,S) only, green. Each sliding window contains 4% of the data points in the set. (B) Sliding window results of average D′. Amino acid mutation pairs (A,A), red; amino acid mutations to silent mutations (A,S), blue; silent mutation pairs (S,S), green. Each sliding window contains 2% of the data points in the set. (C–E) Plots of D′ against the physical distance (base) within the mutation pair for C) (A,A), D) (A,S) and E) (S,S).

**Figure 3. (A,A) Covariation Is Dramatically Higher than (A,S) and (S,S) Covariation in the Stanford-Treated Dataset but not the Stanford-Untreated Dataset.**
Sliding window results of average D′ in A) Stanford-Treated Dataset and B) Stanford-Untreated Dataset. Amino acid mutation pairs (A,A), red; amino acid mutations to silent mutations (A,S), blue; silent mutation pairs (S,S), green. Each sliding window contains 4% of the data points in the set.

**Figure 4. Amino Acid Mutation Pairs that Show Strong Covariation Are Close to Active Sites in RT.**
HIV-1 reverse transcriptase (RT) structure (PDB accession number 3HVTA) is shown using Protein Explorer (www.proteinexplorer.org). The RT41, 43 and 44, red; RT 67 and 70, green; RT 208, 210, 218, 219, yellow; active sites 110,185 and 186 in magenta. The grey sphere cluster is the nucleoside reverse transcriptase inhibitor — Nevirapine.

**Figure 5. The Covariation Maps of Three Different Types of Mutation Pairs in HIV Protease.**
The covariation maps of A) amino acid mutation pairs (A,A), B) amino acid mutations to silent mutations (A,S) and C) silent mutation pairs (S,S). The X and Y axes represent the codon positions in protease. Each cell represents the strongest covariation value (θ; see Materials and Methods) measured for any mutation pair of the designated type between the two positions. The strength of the covariation is depicted on a color scale, with yellow indicating covariation score (θ) larger than 1 and varying shades up to blue indicating covariation score (θ) larger than 5 (the covariation of two mutations is at least five times greater than random). White indicates no evidence of covariation.

See this image and copyright information in PMC

Cited by

A multifaceted analysis of HIV-1 protease multidrug resistance phenotypes.
Doherty KM, Nakka P, King BM, Rhee SY, Holmes SP, Shafer RW, Radhakrishnan ML. Doherty KM, et al. BMC Bioinformatics. 2011 Dec 15;12:477. doi: 10.1186/1471-2105-12-477. BMC Bioinformatics. 2011. PMID: 22172090 Free PMC article.
Deep sequencing of protease inhibitor resistant HIV patient isolates reveals patterns of correlated mutations in Gag and protease.
Flynn WF, Chang MW, Tan Z, Oliveira G, Yuan J, Okulicz JF, Torbett BE, Levy RM. Flynn WF, et al. PLoS Comput Biol. 2015 Apr 20;11(4):e1004249. doi: 10.1371/journal.pcbi.1004249. eCollection 2015 Apr. PLoS Comput Biol. 2015. PMID: 25894830 Free PMC article.
Synthetic lethals in HIV: ways to avoid drug resistance : Running title: Preventing HIV resistance.
Petitjean M, Badel A, Veitia RA, Vanet A. Petitjean M, et al. Biol Direct. 2015 Apr 17;10:17. doi: 10.1186/s13062-015-0044-y. Biol Direct. 2015. PMID: 25888435 Free PMC article.
Correlated evolution of nearby residues in Drosophilid proteins.
Callahan B, Neher RA, Bachtrog D, Andolfatto P, Shraiman BI. Callahan B, et al. PLoS Genet. 2011 Feb;7(2):e1001315. doi: 10.1371/journal.pgen.1001315. Epub 2011 Feb 24. PLoS Genet. 2011. PMID: 21383965 Free PMC article.
CoVaMa: Co-Variation Mapper for disequilibrium analysis of mutant loci in viral populations using next-generation sequence data.
Routh A, Chang MW, Okulicz JF, Johnson JE, Torbett BE. Routh A, et al. Methods. 2015 Dec;91:40-47. doi: 10.1016/j.ymeth.2015.09.021. Epub 2015 Sep 25. Methods. 2015. PMID: 26408523 Free PMC article.

See all "Cited by" articles

References

1. Altschuh D, Lesk AM, Bloomer AC, Klug A. Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. J Mol Biol. 1987;193:693–707. - PubMed
1. Gobel U, Sander C, Schneider R, Valencia A. Correlated mutations and residue contacts in proteins. Proteins. 1994;18:309–317. - PubMed
1. Shindyalov IN, Kolchanov NA, Sander C. Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? Protein Eng. 1994;7:349–358. - PubMed
1. Thomas DJ, Casari G, Sander C. The prediction of protein contacts from multiple sequence alignments. Protein Eng. 1996;9:941–948. - PubMed
1. Olmea O, Valencia A. Improving contact predictions by the combination of correlated mutations and other sources of sequence information. Fold Des. 1997;2:S25–32. - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

U54 RR021813/RR/NCRR NIH HHS/United States

LinkOut - more resources

Full Text Sources
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Distinguishing functional amino acid covariation from background linkage disequilibrium in HIV protease and reverse transcriptase

Affiliation

Distinguishing functional amino acid covariation from background linkage disequilibrium in HIV protease and reverse transcriptase

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Molecular Biology Databases

Research Materials