Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Nov 5;10(45):54389-54404.
doi: 10.1021/acsomega.5c07055. eCollection 2025 Nov 18.

Half-Space Proximal Networks (HSPNs): A Proxy for Multi-Query Similarity Searching Models Predicting Tumor-Homing Peptides

Affiliations

Half-Space Proximal Networks (HSPNs): A Proxy for Multi-Query Similarity Searching Models Predicting Tumor-Homing Peptides

Maylin Romero et al. ACS Omega. .

Abstract

Tumor-homing peptides (THPs) have emerged as promising agents in cancer treatments. These short sequences can specifically target tumor cells and vasculature. Here, a nontrained machine learning (ML) method based on network science and multiquery similarity searching to predict THPs is presented. We leverage the network-based representation of THPs' chemical space to extract valuable information by employing a novel similarity-based, yet sparse, network known as the half-space proximal network (HSPN). The HSPN of the THPs' giant component is composed of 12 communities that represent distinct modes of action and/or targets, as well as sequence templates (scaffolds). In the HSPN analysis, various centrality measures were employed to identify the most significant and nonredundant THPs. These central THPs were then used as queries (Qs) in group fusion similarity-based searches against an established collection of known THPs. The performance of the resulting multiquery similarity-based search models (MQSSMs) was assessed using three benchmarking datasets of THPs/non-THPs. The MQSSMs derived from the HSPNs (THP2) demonstrated superior discrimination performance compared to the classical chemical space networks (CSNs, namely THP1) when applied to the THPs/non-THPs datasets Remarkably, exceptional MCC values (>0.887) were achieved when utilizing Qs from both CSN and HSPN networks to construct MQSSMs (THP3), employing a similarity threshold of 0.6, in external datasets. Next, we conducted a statistical comparison between the performance of our top-performing MQSSM, THP3, and several THP prediction servers, including TumorHPD, THPep, SCMTHP, and NEPTUNE. Our proposed model demonstrated its superiority by surpassing the state-of-the-art supervised and trained ML methods for THP prediction with statistically significant differences. These results provide strong evidence that network-based similarity searches are highly effective and reliable for identifying THPs.

PubMed Disclaimer

Figures

1
1. General Approach Followed in the Study is Based on Three steps: 1) HSPN Construction and Analysis, and 2) MQSSMs Selection and Comparison
2
2. Group Fusion and Similarity Searching
1
1
Density, modularity, and average clustering coefficient (ACC) as functions of the similarity threshold applied to the 627 THPs projected with the CSN (blue) and HSPN (pink).
2
2
Density and average clustering coefficient (ACC) functions of similarity threshold applied to the 627 THPs projected with the HSPN.
3
3
Similarity of sparse networks of THPs: (A) HSPN giant components. (B) HSPN singletons (outliers). (C) HSPN singletons after 30% similarity extraction scaffold. In all networks, nodes are colored by their community membership and sized by their weighted degree.
4
4
Average ranks obtained by each method in the Friedman test. Friedman statistic (distributed according to chi-square with 6 degrees of freedom): 55.651786.

References

    1. WHO Fact-sheets of Cancer; https://www.who.int/news-room/fact-sheets/detail/cancer (Accessed 07 June 2022).
    1. Ferlay, J. ; Ervik, M. ; Lam, F. ; Colombet, M. ; Mery, L. ; Piñeros, M. ; Znaor, A. ; Soerjomataram, I. ; Bray, F. . Global Cancer Observatory: Cancer Today; International Agency for Research on Cancer: Lyon, France, https://gco.iarc.fr/today/about#references (Accessed 07 June 2022).
    1. Miller K. D., Nogueira L., Mariotto A. B., Rowland J. H., Yabroff K. R., Alfano C. M., Jemal A., Kramer J. L., Siegel R. L.. Cancer Treatment and Survivorship Statistics, 2019. Ca-Cancer J. Clin. 2019;69(5):363–385. doi: 10.3322/caac.21565. - DOI - PubMed
    1. Wu D., Gao Y., Qi Y., Chen L., Ma Y., Li Y.. Peptide-Based Cancer Therapy: Opportunity and Challenge. Cancer Lett. 2014;351(1):13–22. doi: 10.1016/j.canlet.2014.05.002. - DOI - PubMed
    1. Wei L., Zhou C., Chen H., Song J., Su R.. ACPred-FL: A Sequence-Based Predictor Using Effective Feature Representation to Improve the Prediction of Anti-Cancer Peptides. Bioinformatics. 2018;34(23):4007–4016. doi: 10.1093/bioinformatics/bty451. - DOI - PMC - PubMed

LinkOut - more resources