Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 27;7(40):35532-35537.
doi: 10.1021/acsomega.2c02513. eCollection 2022 Oct 11.

Succinct Amyloid and Nonamyloid Patterns in Hexapeptides

Affiliations

Succinct Amyloid and Nonamyloid Patterns in Hexapeptides

László Keresztes et al. ACS Omega. .

Abstract

Hexapeptides are widely applied as a model system for studying the amyloid-forming properties of polypeptides, including proteins. Recently, large experimental databases have become publicly available with amyloidogenic labels. Using these data sets for training and testing purposes, one may build artificial intelligence (AI)-based classifiers for predicting the amyloid state of peptides. In our previous work (Biomolecules 2021, 11, 500), we described the Support Vector Machine (SVM)-based Budapest Amyloid Predictor (https://pitgroup.org/bap). Here, we apply the Budapest Amyloid Predictor for discovering numerous amyloidogenic and nonamyloidogenic hexapeptide patterns with accuracy between 80% and 84%, as surprising and succinct novel rules for further understanding the amyloid state of peptides. For example, we have shown that for any independently mutated residue (position marked by "x"), the patterns CxFLWx, FxFLFx, or xxIVIV are predicted to be amyloidogenic, while those of PxDxxx, xxKxEx, and xxPQxx are nonamyloidogenic. We note that each amyloidogenic pattern with two x's (e.g.,CxFLWx) describes succinctly 202 = 400 hexapeptides, while the nonamyloidogenic patterns comprising four point mutations (e.g.,PxDxxx) give 204 = 160 000 hexapeptides in total. We also examine the restricted substitutions for positions "x" from subclasses of proteinogenic amino acid residues; for example, if "x" is substituted with hydrophobic amino acids, then there exist patterns containing three x's, like MxVVxx, predicted to be amyloidogenic. If we can choose for the x positions any hydrophobic amino acids, except the "structure breaker" proline, then we get amyloid patterns with five x positions, for example, xxxFxx, each corresponding to 32 768 hexapeptides. To our knowledge, no similar applications of artificial intelligence tools or succinct amyloid patterns were described before the present work.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Examples of amyloid and nonamyloid patterns.

References

    1. Michiels E.; et al. Reverse engineering synthetic antiviral amyloids. Nat. Commun. 2020, 11, 2832.10.1038/s41467-020-16721-8. - DOI - PMC - PubMed
    1. Gillmore J. D.; et al. CRISPR-Cas9 In Vivo Gene Editing for Transthyretin Amyloidosis. N. Engl. J. Med. 2021, 385, 493–502. 10.1056/NEJMoa2107454. - DOI - PubMed
    1. Horvath D.; Menyhard D.; Perczel A. Protein aggregation in a nutshell: The splendid molecular architecture of the dreaded amyloid fibrils. Curr. Protein Pept. Sci. 2019, 20, 1077–1088. 10.2174/1389203720666190925102832. - DOI - PubMed
    1. Taricska N.; Horvath D.; Menyhard D. K.; Akontz-Kiss H.; Noji M.; So M.; Goto Y.; Fujiwara T.; Perczel A. The Route from the Folded to the Amyloid State: Exploring the Potential Energy Surface of a Drug-Like Miniprotein. Chem. - Eur. J. 2020, 26, 1968–1978. 10.1002/chem.201903826. - DOI - PMC - PubMed
    1. Takács K.; Varga B.; Grolmusz V. PDB _Amyloid: an extended live amyloid structure list from the PDB. FEBS Open Bio 2019, 9, 185–190. 10.1002/2211-5463.12524. - DOI - PMC - PubMed

LinkOut - more resources