The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins
- PMID: 38747388
- PMCID: PMC11094778
- DOI: 10.1002/pro.5011
The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins
Abstract
A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Keywords: ancestral sequence reconstruction; consensus design; protein folding; protein stability; singular value decomposition.
© 2024 The Protein Society.
Figures









Update of
-
The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins.bioRxiv [Preprint]. 2023 Jul 1:2023.06.29.547063. doi: 10.1101/2023.06.29.547063. bioRxiv. 2023. Update in: Protein Sci. 2024 Jun;33(6):e5011. doi: 10.1002/pro.5011. PMID: 37425932 Free PMC article. Updated. Preprint.
References
-
- Casari G, Sander C, Valencia A. A method to predict functional residues in proteins. Nat Struct Biol. 1995;22(2):171–178. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources