Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep 3;4(9):3567-3580.
doi: 10.1021/jacsau.4c00501. eCollection 2024 Sep 23.

Aggregation Rules of Short Peptides

Affiliations

Aggregation Rules of Short Peptides

Jiaqi Wang et al. JACS Au. .

Abstract

The elucidation of aggregation rules for short peptides (e.g., tetrapeptides and pentapeptides) is crucial for the precise manipulation of aggregation. In this study, we derive comprehensive aggregation rules for tetrapeptides and pentapeptides across the entire sequence space based on the aggregation propensity values predicted by a transformer-based deep learning model. Our analysis focuses on three quantitative aspects. First, we investigate the type and positional effects of amino acids on aggregation, considering both the first- and second-order contributions. By identifying specific amino acids and amino acid pairs that promote or attenuate aggregation, we gain insights into the underlying aggregation mechanisms. Second, we explore the transferability of aggregation propensities between tetrapeptides and pentapeptides, aiming to explore the possibility of enhancing or mitigating aggregation by concatenating or removing specific amino acids at the termini. Finally, we evaluate the aggregation morphologies of over 20,000 tetrapeptides, regarding the morphology distribution and type and positional contributions of each amino acid. This work extends the existing aggregation rules from tripeptide sequences to millions of tetrapeptide and pentapeptide sequences, offering experimentalists an explicit roadmap for fine-tuning the aggregation behavior of short peptides for diverse applications, including hydrogels, emulsions, or pharmaceuticals.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing financial interest.

Figures

Figure 1
Figure 1
Analyses of the aggregation rules of short peptides from three perspectives. A deep learning-based TRN model predicts the AP values of tetrapeptides (APte) and pentapeptides (APpe) across the entire sequence space (i.e., 160,000 tetrapeptides and 3,200,000 pentapeptides). Based on the predicted AP values, we analyze the (a) type and positional contribution of 20 amino acids to the AP, (b) transferability relationship between the AP of a pentapeptide and the averaged AP of two corresponding tetrapeptides (APpe versus APave), and (c) distribution of the morphologies of aggregated tetrapeptides as well as the type and positional contributions of 20 amino acids to different morphologies.
Figure 2
Figure 2
Prediction and distribution of APte (and APte′) and APpe (and APpe′). (a,b) Predicted AP values of tetrapeptides (APte) and pentapeptides (APpe) compared to simulation-generated AP of tetrapeptides (APsim,te) and pentapeptides (APsim,pe). (c,d) Violin distribution of APte′ (normalized APte) and APpe′ (normalized APpe) within four ranges of A ∈ [0.00,0.25), B ∈ [0.25,0.50), C ∈ [0.50,0.75), and D ∈ [0.75,1.00), with the number of peptides counted in each range. The purple and yellow dots overlapping the “Total” distribution in (c) indicate the comparison results with experimental TEM images, i.e., consistent and inconsistent on a qualitative level, respectively.
Figure 3
Figure 3
Aggregation rules in terms of individual amino acids (i.e., first-order aggregation rules). (a,b) Percentage of each amino acid at each position within the pool of aggregating peptides (i.e., the normalized AP values within the range D ∈ [0.75, 1.00]) for (a) tetrapeptides and (b) pentapeptides. (c,d) Averaged AP values of peptides with a fixed amino acid at a specific position for (c) tetrapeptides and (d) pentapeptides. (e,f) Averaged AP values of peptides containing (e) 1–4 and (f) 1–5 specific amino acids.
Figure 4
Figure 4
Aggregation rules in terms of 400 amino acid pairs (i.e., second-order aggregation rules). (a,b) Percentage of amino acid pairs in the high AP range of D (AP′ ∈ [0.75, 1.00]) for (a) tetrapeptides and (b) pentapeptides. (c,d) Averaged AP values of (c) tetrapeptides and (d) pentapeptides with amino acid pairs fixed at specific “positions”.
Figure 5
Figure 5
Transferability relationship between APpe′ and averaged APave′ (APave′ are the average of two APte′). (a) Relationship between APpe′ and APave′, with the color indicating the difference ε between APpe′ and the averaged APave′ (ε = APpe′ – APave′). (b) Violin distribution (left) and number distribution (right) of ε. (c–e) AP values of 10 groups of peptides with positive maximum ε between APpe and APave. (f–h) AP values of the 10 groups of peptides with negative maximum ε between APpe and APave.
Figure 6
Figure 6
Computational and experimental results of AP and morphologies of peptide Aβ16–22 with the addition of amino acids DD and DDD at the N-terminus. The table lists the attribution of each amino acid with the increasing number of D, as well as the predicted (APPRD) and simulation AP (APSIM).
Figure 7
Figure 7
Distribution of morphologies after aggregation. (a–c) Possible morphologies formed in simulations, such as fibers or tubes, intermediate structures such as net and curved sheet, and spherical or vesicle structures. (d) Violin distributions of morphologies. (e–g) Positional percentage of each amino acid within each RMOI range, calculated through dividing the number of each amino acid at each position within a RMOI range, by the total number of the amino acid at the position across all ranges.

Similar articles

Cited by

References

    1. Zapadka K. L.; Becher F. J.; Gomes dos Santos A.; Jackson S. E. Factors affecting the physical stability (aggregation) of peptide therapeutics. Interface Focus 2017, 7 (6), 20170030.10.1098/rsfs.2017.0030. - DOI - PMC - PubMed
    1. Nguyen P. H.; Sterpone F.; Derreumaux P. Aggregation of disease-related peptides. Prog. Mol. Biol. Transl. Sci. 2020, 170, 435–460. 10.1016/bs.pmbts.2019.12.002. - DOI - PubMed
    1. Tao K.; Makam P.; Aizen R.; Gazit E. Self-assembling peptide semiconductors. Science 2017, 358 (6365), eaam975610.1126/science.aam9756. - DOI - PMC - PubMed
    1. Nguyen T. P.; Easley A. D.; Kang N.; Khan S.; Lim S.-M.; Rezenom Y. H.; Wang S.; Tran D. K.; Fan J.; Letteri R. A.; et al. Polypeptide organic radical batteries. Nature 2021, 593 (7857), 61–66. 10.1038/s41586-021-03399-1. - DOI - PubMed
    1. Wen Q.; Zhang Y.; Li C.; Ling S.; Yang X.; Chen G.; Yang Y.; Wang Q. NIR-II fluorescent self-assembled peptide nanochain for ultrasensitive detection of peritoneal metastasis. Angew. Chem. 2019, 131 (32), 11117–11122. 10.1002/ange.201905643. - DOI - PubMed

LinkOut - more resources