TY - JOUR
T1 - Multiquery Similarity Searching Models
T2 - An Alternative Approach for Predicting Hemolytic Activity from Peptide Sequence
AU - Castillo-Mendieta, Kevin
AU - Agüero-Chapin, Guillermin
AU - Marquez, Edgar
AU - Perez-Castillo, Yunierkis
AU - Barigye, Stephen J.
AU - Pérez-Cárdenas, Mariela
AU - Peréz-Giménez, Facundo
AU - Marrero-Ponce, Yovani
N1 - Publisher Copyright:
© 2024 American Chemical Society.
PY - 2024/4/15
Y1 - 2024/4/15
N2 - The desirable pharmacological properties and a broad number of therapeutic activities have made peptides promising drugs over small organic molecules and antibody drugs. Nevertheless, toxic effects, such as hemolysis, have hampered the development of such promising drugs. Hence, a reliable computational tool to predict peptide hemolytic toxicity is enormously useful before synthesis and experimental evaluation. Currently, four web servers that predict hemolytic activity using machine learning (ML) algorithms are available; however, they exhibit some limitations, such as the need for a reliable negative set and limited application domain. Hence, we developed a robust model based on a novel theoretical approach that combines network science and a multiquery similarity searching (MQSS) method. A total of 1152 initial models were constructed from 144 scaffolds generated in a previous report. These were evaluated on external data sets, and the best models were fused and improved. Our best MQSS model I1 outperformed all state-of-the-art ML-based models and was used to characterize the prevalence of hemolytic toxicity on therapeutic peptides. Based on our model’s estimation, the number of hemolytic peptides might be 3.9-fold higher than the reported.
AB - The desirable pharmacological properties and a broad number of therapeutic activities have made peptides promising drugs over small organic molecules and antibody drugs. Nevertheless, toxic effects, such as hemolysis, have hampered the development of such promising drugs. Hence, a reliable computational tool to predict peptide hemolytic toxicity is enormously useful before synthesis and experimental evaluation. Currently, four web servers that predict hemolytic activity using machine learning (ML) algorithms are available; however, they exhibit some limitations, such as the need for a reliable negative set and limited application domain. Hence, we developed a robust model based on a novel theoretical approach that combines network science and a multiquery similarity searching (MQSS) method. A total of 1152 initial models were constructed from 144 scaffolds generated in a previous report. These were evaluated on external data sets, and the best models were fused and improved. Our best MQSS model I1 outperformed all state-of-the-art ML-based models and was used to characterize the prevalence of hemolytic toxicity on therapeutic peptides. Based on our model’s estimation, the number of hemolytic peptides might be 3.9-fold higher than the reported.
UR - http://www.scopus.com/inward/record.url?scp=85188239349&partnerID=8YFLogxK
U2 - 10.1021/acs.chemrestox.3c00408
DO - 10.1021/acs.chemrestox.3c00408
M3 - Artículo
AN - SCOPUS:85188239349
SN - 0893-228X
VL - 37
SP - 580
EP - 589
JO - Chemical Research in Toxicology
JF - Chemical Research in Toxicology
IS - 4
ER -