Skip to main navigation Skip to search Skip to main content

A comparative study of nonlinear machine learning for the "in silico" depiction of tyrosinase inhibitory activity from molecular structure

  • Huong Le-Thi-Thu*
  • , Yovani Marrero-Ponce
  • , Gerardo M. Casañola-Martin
  • , Gladys Casas Cardoso
  • , Maria Del Carmen Chávez
  • , María M. Garcia
  • , Carlos Morell
  • , Francisco Torrens
  • , Concepción Abad
  • *Corresponding author for this work
  • Universidad Central Marta Abreu de Las Villas
  • Universitat de València

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

In the preset report, for the first time, support vector machine (SVM), artificial neural network (ANN), Bayesian networks (BNs), k-nearest neighbor (k-NN) are applied and compared on two "in-house" datasets to describe the tyrosinase inhibitory activity from the molecular structure. The data set Data I is used for the identification of tyrosinase inhibitors (TIs) including 701 active and 728 inactive compounds. Data II consists of active chemicals for potency estimation of TIs. The 2D TOMOCOMD-CARDD atom-based quadratic indices are used as molecular descriptors. The derived models show rather encouraging results with the areas under the Receiver Operating Characteristic (AURC) curve in the test set above 0.943 and 0.846 for the Data I and Data II, respectively. Multiple comparison tests are carried out to compare the performance of the models and reveal the improvement of machine learning (ML) techniques with respect to statistical ones (see Chemometr. Intell. Lab. Syst. 2010, 104, 249). In some cases, these ameliorations are statistically significant. The tests also demostrate that k-NN, despite being a rather simple approach, presents the best behavior in both data. The obtained results suggest that the ML-based models could help to improve the virtual screening procedures and the confluence of these different techniques can increase the practicality of data mining procedures of chemical databases for the discovery of novel TIs as possible depigmenting agents.

Original languageEnglish
Pages (from-to)527-537
Number of pages11
JournalMolecular Informatics
Volume30
Issue number6-7
DOIs
StatePublished - Jun 2011
Externally publishedYes

Keywords

  • Atom-based quadratic index
  • Machine learning technique
  • Multiple comparison test
  • Tyrosinase inhibitor

Fingerprint

Dive into the research topics of 'A comparative study of nonlinear machine learning for the "in silico" depiction of tyrosinase inhibitory activity from molecular structure'. Together they form a unique fingerprint.

Cite this