Optimum search strategies or novel 3D molecular descriptors: Is there a stalemate?

Yovani Marrero-Ponce, César R. García-Jacas, Stephen J. Barigye, José R. Valdés-Martiní, Oscar Miguel Rivera-Borroto, Ricardo W. Pino-Urias, Néstor Cubillán, Ysaías J. Alvarado, Huong Le-Thi-Thu

Producción científica: Contribución a una revistaArtículorevisión exhaustiva

15 Citas (Scopus)


The present manuscript describes a novel 3D-QSAR alignment free method (QuBiLS-MIDAS Duplex) based on algebraic bilinear, quadratic and linear forms on the kth two-tuple spatial-(dis)similarity matrix. Generalization schemes for the inter-atomic spatial distance using diverse (dis)-similarity measures are discussed. On the other hand, normalization approaches for the two-tuple spatial-(dis)similarity matrix by using simple-and double-stochastic and mutual probability schemes are introduced. With the aim of taking into consideration particular inter-atomic interactions in total or local-fragment indices, path and length cut-off constraints are used. Also, in order to generalize the use of the linear combination of atom-level indices to yield global (molecular) definitions, a set of aggregation operators (invariants) are applied. A Shannon’s entropy based variability study for the proposed 3D algebraic form-based indices and the DRAGON molecular descriptor families demonstrates superior performance for the former. A principal component analysis reveals that the novel indices codify structural information orthogonal to those captured by the DRAGON indices. Finally, a QSAR study for the binding affinity to the corticosteroid-binding globulin using Cramer’s steroid database is performed. From this study, it is revealed that the QuBiLS-MIDAS Duplex approach yields similar-to-superior performance statistics than all the 3D-QSAR methods reported in the literature reported so far, even with lower degree of freedom, using both the 31 steroids as the training set and the popular division of Cramer’s database in training [1-21] and test sets [22-31]. It is thus expected that this methodology provides useful tools for the diversity analysis of compound datasets and high-throughput screening structure–activity data.

Idioma originalInglés
Páginas (desde-hasta)533-564
Número de páginas32
PublicaciónCurrent Bioinformatics
EstadoPublicada - 1 dic. 2015
Publicado de forma externa


Profundice en los temas de investigación de 'Optimum search strategies or novel 3D molecular descriptors: Is there a stalemate?'. En conjunto forman una huella única.

Citar esto