TY - JOUR
T1 - Novel global and local 3D atom-based linear descriptors of the Minkowski distance matrix
T2 - theory, diversity–variability analysis and QSPR applications
AU - Cubillán, Néstor
AU - Marrero-Ponce, Yovani
AU - Ariza-Rico, Harold
AU - Barigye, Stephen J.
AU - García-Jacas, César R.
AU - Valdes-Martini, José R.
AU - Alvarado, Ysaías J.
N1 - Publisher Copyright:
© 2015, Springer International Publishing Switzerland.
PY - 2015/10/4
Y1 - 2015/10/4
N2 - A new family of alignment-free 3D descriptors based on TOMOCOMD-CARDD framework has been designed, namely 3D-linear indices. In this report, we have proposed the use of a generalized form of the geometric pairwise atom-atom distance matrix as structural information matrix. This matrix, denominated as non-stochastic, uses as matrix form of linear maps as well as their algebraic transformations: stochastic, double stochastic and mutual probabilities matrices. The methodology for 3D-QSAR studies is based on the combined use of global and local approaches. Principal component analysis reveals that the novel indices are capable of capturing structural information not codified by the indices implemented in the DRAGON’s software. Moreover, Shannon’s entropy based variability analysis comparing the 3D-linear indices with some relevant descriptors suggests that the former encode similar-to-better amount of structural information than these descriptors. Finally, a search for the best regressions for congeneric databases in QSPR modeling was performed. The overall results demonstrates satisfactory behavior.
AB - A new family of alignment-free 3D descriptors based on TOMOCOMD-CARDD framework has been designed, namely 3D-linear indices. In this report, we have proposed the use of a generalized form of the geometric pairwise atom-atom distance matrix as structural information matrix. This matrix, denominated as non-stochastic, uses as matrix form of linear maps as well as their algebraic transformations: stochastic, double stochastic and mutual probabilities matrices. The methodology for 3D-QSAR studies is based on the combined use of global and local approaches. Principal component analysis reveals that the novel indices are capable of capturing structural information not codified by the indices implemented in the DRAGON’s software. Moreover, Shannon’s entropy based variability analysis comparing the 3D-linear indices with some relevant descriptors suggests that the former encode similar-to-better amount of structural information than these descriptors. Finally, a search for the best regressions for congeneric databases in QSPR modeling was performed. The overall results demonstrates satisfactory behavior.
KW - 3D-linear index
KW - QSPR study
KW - TOMOCOMD-CARDD
KW - Variability analysis
UR - http://www.scopus.com/inward/record.url?scp=84940895558&partnerID=8YFLogxK
U2 - 10.1007/s10910-015-0533-3
DO - 10.1007/s10910-015-0533-3
M3 - Artículo
AN - SCOPUS:84940895558
SN - 0259-9791
VL - 53
SP - 2028
EP - 2064
JO - Journal of Mathematical Chemistry
JF - Journal of Mathematical Chemistry
IS - 9
ER -