Skip to main navigation Skip to search Skip to main content

Half-Space Proximal Networks (HSPNs): A Proxy for Multi-Query Similarity Searching Models Predicting Tumor-Homing Peptides

  • Maylin Romero
  • , Yovani Marrero-Ponce*
  • , Felix Martinez-Rios
  • , Guillermin Agüero-Chapin
  • , Longendri Aguilera-Mendoza
  • , Edgar Chavez
  • , Edgar A. Márquez
  • , Noel Pérez-Pérez
  • , José R. Mora
  • , Ernesto Contreras-Torres
  • , Stephen J. Barigye
  • *Corresponding author for this work
  • Universidad Yachay Tech
  • Universidad Panamericana (UP)
  • Universitat de València
  • University of Porto
  • Faculdade de Ciências da Universidade do Porto
  • Universidad San Francisco de Quito
  • Centro de Investigación Científicay de Educación Superior de Ensenada (CICESE)
  • Universidad del Norte
  • Universidad Autónoma de Madrid
  • McGill University

Research output: Contribution to journalArticlepeer-review

Abstract

Tumor-homing peptides (THPs) have emerged as promising agents in cancer treatments. These short sequences can specifically target tumor cells and vasculature. Here, a nontrained machine learning (ML) method based on network science and multiquery similarity searching to predict THPs is presented. We leverage the network-based representation of THPs’ chemical space to extract valuable information by employing a novel similarity-based, yet sparse, network known as the half-space proximal network (HSPN). The HSPN of the THPs’ giant component is composed of 12 communities that represent distinct modes of action and/or targets, as well as sequence templates (scaffolds). In the HSPN analysis, various centrality measures were employed to identify the most significant and nonredundant THPs. These central THPs were then used as queries (Qs) in group fusion similarity-based searches against an established collection of known THPs. The performance of the resulting multiquery similarity-based search models (MQSSMs) was assessed using three benchmarking datasets of THPs/non-THPs. The MQSSMs derived from the HSPNs (THP2) demonstrated superior discrimination performance compared to the classical chemical space networks (CSNs, namely THP1) when applied to the THPs/non-THPs datasets Remarkably, exceptional MCC values (>0.887) were achieved when utilizing Qs from both CSN and HSPN networks to construct MQSSMs (THP3), employing a similarity threshold of 0.6, in external datasets. Next, we conducted a statistical comparison between the performance of our top-performing MQSSM, THP3, and several THP prediction servers, including TumorHPD, THPep, SCMTHP, and NEPTUNE. Our proposed model demonstrated its superiority by surpassing the state-of-the-art supervised and trained ML methods for THP prediction with statistically significant differences. These results provide strong evidence that network-based similarity searches are highly effective and reliable for identifying THPs.

Original languageEnglish
Pages (from-to)54389-54404
Number of pages16
JournalACS Omega
Volume10
Issue number45
DOIs
StatePublished - 18 Nov 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Fingerprint

Dive into the research topics of 'Half-Space Proximal Networks (HSPNs): A Proxy for Multi-Query Similarity Searching Models Predicting Tumor-Homing Peptides'. Together they form a unique fingerprint.

Cite this