Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
- PMID: 37220419
- PMCID: PMC10278063
- DOI: 10.1021/acschembio.3c00159
Identification of Macrocyclic Peptide Families from Combinatorial Libraries Containing Noncanonical Amino Acids Using Cheminformatics and Bioinformatics Inspired Clustering
Abstract
In the past decade, macrocyclic peptides gained increasing interest as a new therapeutic modality to tackle intracellular and extracellular therapeutic targets that had been previously classified as "undruggable". Several technological advances have made discovering macrocyclic peptides against these targets possible: 1) the inclusion of noncanonical amino acids (NCAAs) into mRNA display, 2) increased availability of next generation sequencing (NGS), and 3) improvements in rapid peptide synthesis platforms. This type of directed-evolution based screening can produce large numbers of potential hit sequences given that DNA sequencing is the functional output of this platform. The current standard for selecting hit peptides from these selections for downstream follow-up relies on the frequency counting and sorting of unique peptide sequences which can result in the generation of false negatives due to technical reasons including low translation efficiency or other experimental factors. To overcome our inability to detect weakly enriched peptide sequences among our large data sets, we wanted to develop a clustering method that would enable the identification of peptide families. Unfortunately, utilizing traditional clustering algorithms, such as ClustalW, is not possible for this technology due to the incorporation of NCAAs in these libraries. Therefore, we developed a new atomistic clustering method with a Pairwise Aligned Peptide (PAP) chemical similarity metric to perform sequence alignments and identify macrocyclic peptide families. With this method, low enriched peptides, including isolated sequences (singletons), can now be clustered into families providing a comprehensive analysis of NGS data resulting from macrocycle discovery selections. Additionally, upon identification of a hit peptide with the desired activity, this clustering algorithm can be used to identify derivatives from the initial data set for structure-activity relationship (SAR) analysis without requiring additional selection experiments.
Conflict of interest statement
The authors declare the following competing financial interest(s): M-L.L, A.G., and C.N.C. are current employees of Genentech, Inc. and shareholders of Roche.
Figures





Similar articles
-
Ribosomal Synthesis of Macrocyclic Peptides with β2- and β2,3-Homo-Amino Acids for the Development of Natural Product-Like Combinatorial Libraries.ACS Chem Biol. 2021 Jun 18;16(6):1011-1018. doi: 10.1021/acschembio.1c00062. Epub 2021 May 19. ACS Chem Biol. 2021. PMID: 34008946
-
The RaPID Platform for the Discovery of Pseudo-Natural Macrocyclic Peptides.Acc Chem Res. 2021 Sep 21;54(18):3604-3617. doi: 10.1021/acs.accounts.1c00391. Epub 2021 Sep 10. Acc Chem Res. 2021. PMID: 34505781
-
Biosynthetic Strategies for Macrocyclic Peptides.Molecules. 2021 Jun 1;26(11):3338. doi: 10.3390/molecules26113338. Molecules. 2021. PMID: 34206124 Free PMC article. Review.
-
Diversification of Phage-Displayed Peptide Libraries with Noncanonical Amino Acid Mutagenesis and Chemical Modification.Chem Rev. 2024 May 8;124(9):6051-6077. doi: 10.1021/acs.chemrev.4c00004. Epub 2024 Apr 30. Chem Rev. 2024. PMID: 38686960 Free PMC article. Review.
-
Clustering of disulfide-rich peptides provides scaffolds for hit discovery by phage display: application to interleukin-23.BMC Bioinformatics. 2016 Nov 23;17(1):481. doi: 10.1186/s12859-016-1350-9. BMC Bioinformatics. 2016. PMID: 27881076 Free PMC article.
Cited by
-
New approaches for challenging therapeutic targets.Drug Discov Today. 2024 Apr;29(4):103942. doi: 10.1016/j.drudis.2024.103942. Epub 2024 Mar 5. Drug Discov Today. 2024. PMID: 38447929 Free PMC article. Review.
-
Reaching New Heights in Genetic Code Manipulation with High Throughput Screening.Chem Rev. 2024 Nov 13;124(21):12145-12175. doi: 10.1021/acs.chemrev.4c00329. Epub 2024 Oct 17. Chem Rev. 2024. PMID: 39418482 Review.
-
An mRNA Display Approach for Covalent Targeting of a Staphylococcus aureus Virulence Factor.J Am Chem Soc. 2025 Mar 12;147(10):8312-8325. doi: 10.1021/jacs.4c15713. Epub 2025 Feb 27. J Am Chem Soc. 2025. PMID: 40013487
-
An mRNA Display Approach for Covalent Targeting of a Staphylococcus aureus Virulence Factor.bioRxiv [Preprint]. 2024 Nov 8:2024.11.06.622387. doi: 10.1101/2024.11.06.622387. bioRxiv. 2024. Update in: J Am Chem Soc. 2025 Mar 12;147(10):8312-8325. doi: 10.1021/jacs.4c15713. PMID: 39574702 Free PMC article. Updated. Preprint.
References
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous