Skip to main navigation Skip to search Skip to main content

Novel similarity measures for the effective and efficient retrieval of pharmacological datasets

  • Oscar Miguel Rivera Borroto*
  • , Yoandy Hernández Díaz
  • , José Manuel García De La Vega
  • , Ricardo Del Corazón Grau Ábalo
  • , Yovani Marrero Ponce
  • *Corresponding author for this work
  • Universidad Central Marta Abreu de Las Villas
  • Universidad Autónoma de Madrid

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Similarity searching is an important facility in modern chemical information management systems to accede the rich information contained in currently enormous chemical repositories. Basically, given a molecular representation, a similarity measure, and a matching algorithm, the technique output returns an ordered list of dataset molecules in decreasing order of similarity with respect to a query or reference molecule specified by the user. As a consequence, researchers have put their interest in molecular representations and similarity measures performance. However, their studies have been predominantly focused in binary representations and the corresponding resemblance measures, and little work has been done taking into account other types of numerical description. Also, Machine Learning techniques have been applied for descriptor selection, though not consistently with the neighbourhood principle. These precedents, together with the need of new methods suitable for each chemical context, constitute the motivation for this work. It comprises the computational implementation, in the Java environment, and comparison of two novel measures of similarity to other proximity models established in the literature at effectively retrieving eight pharmacological datasets from Medicinal Chemistry, represented by machine learning-selected real descriptors, and some efficient matching algorithm.

Original languageEnglish
Pages (from-to)50-56
Number of pages7
JournalAfinidad
Volume68
Issue number551
StatePublished - Jan 2011
Externally publishedYes

Keywords

  • Machine learning descriptor selection
  • Medicinal chemistry datasets
  • Nearest neighbours
  • Similarity measures
  • Similarity search

Fingerprint

Dive into the research topics of 'Novel similarity measures for the effective and efficient retrieval of pharmacological datasets'. Together they form a unique fingerprint.

Cite this