Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;39(1):msab309.
doi: 10.1093/molbev/msab309.

A Mutation-Selection Model of Protein Evolution under Persistent Positive Selection

Affiliations

A Mutation-Selection Model of Protein Evolution under Persistent Positive Selection

Asif U Tamuri et al. Mol Biol Evol. .

Abstract

We use first principles of population genetics to model the evolution of proteins under persistent positive selection (PPS). PPS may occur when organisms are subjected to persistent environmental change, during adaptive radiations, or in host-pathogen interactions. Our mutation-selection model indicates protein evolution under PPS is an irreversible Markov process, and thus proteins under PPS show a strongly asymmetrical distribution of selection coefficients among amino acid substitutions. Our model shows the criteria ω>1 (where ω is the ratio of nonsynonymous over synonymous codon substitution rates) to detect positive selection is conservative and indeed arbitrary, because in real proteins many mutations are highly deleterious and are removed by selection even at positively selected sites. We use a penalized-likelihood implementation of the PPS model to successfully detect PPS in plant RuBisCO and influenza HA proteins. By directly estimating selection coefficients at protein sites, our inference procedure bypasses the need for using ω as a surrogate measure of selection and improves our ability to detect molecular adaptation in proteins.

Keywords: RuBisCO; cytochrome b; distribution of fitness effects; influenza; mutation–selection model; positive selection.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Analysis of proteins under the PPS mutation–selection model. (AA″) Estimates of ω at protein sites. (BB″) Distribution of selection coefficients among nonsynonymous substitutions. (CC″) Relationship between ω and average selection (S¯) at protein sites. Sites under PPS (Zk>0) are indicated in red in AA″ and CC″, and their contribution to the distribution of selection coefficients indicated in red in BB″. In CC′, the solid line is equation (1). The penalty on Zk is λ=0.001.
Fig. 2.
Fig. 2.
Relationship between Zk and evolutionary parameters for PPS sites in HA and rbcL. (A) Irreversibility index, Ik, versus Zk. The index is normalized to give the expected excess number of substitutions from detailed balance. (B) Site substitution rate, rk=iπkqii,k, versus Zk. Note the qij,k are scaled so that they give the relative rate with respect to a neutral sequence (Tamuri et al. 2014). Thus, if rk = 1, then the site evolves at the same rate as, say, a pseudogene. (C) Nonsynonymous rate, ωk versus Zk. The penalty on Zk is λ=0.001 in all cases.
Fig. 3.
Fig. 3.
Pattern of amino acid substitution in PPS sites of human influenza (H1N1) HA protein between 1918 and 2009. The penalty on Zk is λ=0.001. Each colored dot represents a particular amino acid as indicated in the legend.

References

    1. Benjamini Y, Hochberg Y.. 1995. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B. 57(1):289–300.
    1. Bulmer M. 1991. The selection-mutation-drift theory of synonymous codon usage. Genetics 129(3):897–907. - PMC - PubMed
    1. Cox DD, O’Sullivan F.. 1990. Asymptotic analysis of penalized likelihood and related estimators. Ann Statist. 18(4):1676–1695.
    1. Cox DR. 1962. Further results on tests of separate families of hypotheses. J R Stat Soc B. 24(2):406–424.
    1. De Maio N, Schrempf D, Kosiol C.. 2015. Pomo: an allele frequency-based approach for species tree estimation. Syst Biol. 64(6):1018–1031. - PMC - PubMed

Publication types