TY - JOUR
T1 - Multi-server approach for high-throughput molecular descriptors calculation based on multi-linear algebraic maps
AU - García-Jacas, César R.
AU - Aguilera-Mendoza, Longendri
AU - González-Pérez, Reisel
AU - Marrero-Ponce, Yovani
AU - Acevedo-Martínez, Liesner
AU - Barigye, Stephen J.
AU - Avdeenko, Tatiana
N1 - Publisher Copyright:
© 2015 Wiley-VCH Verlag GmbH & Co. KGaA.
PY - 2015/1
Y1 - 2015/1
N2 - The present report introduces a novel module of the QuBiLS-MIDAS software for the distributed computation of the 3D Multi-Linear algebraic molecular indices. The main motivation for developing this module is to deal with the computational complexity experienced during the calculation of the descriptors over large datasets. To accomplish this task, a multi-server computing platform named Tarenal was developed, which is suited for institutions with many workstations interconnected through a local network and without resources particularly destined for computation tasks. This new system was deployed in 337 workstations and it was perfectly integrated with the QuBiLSMIDAS software. To illustrate the usability of the T-arenal platform, performance tests over a dataset comprised of 15000 compounds are carried out, yielding a 52 and 60 fold reduction in the sequential processing time for the 2-Linear and 3-Linear indices, respectively. Therefore, it can be stated that the T-arenal based distribution of computation tasks constitutes a suitable strategy for performing high-throughput calculations of 3D Multi-Linear descriptors over thousands of chemical structures for posterior QSAR and/or ADME-Tox studies.
AB - The present report introduces a novel module of the QuBiLS-MIDAS software for the distributed computation of the 3D Multi-Linear algebraic molecular indices. The main motivation for developing this module is to deal with the computational complexity experienced during the calculation of the descriptors over large datasets. To accomplish this task, a multi-server computing platform named Tarenal was developed, which is suited for institutions with many workstations interconnected through a local network and without resources particularly destined for computation tasks. This new system was deployed in 337 workstations and it was perfectly integrated with the QuBiLSMIDAS software. To illustrate the usability of the T-arenal platform, performance tests over a dataset comprised of 15000 compounds are carried out, yielding a 52 and 60 fold reduction in the sequential processing time for the 2-Linear and 3-Linear indices, respectively. Therefore, it can be stated that the T-arenal based distribution of computation tasks constitutes a suitable strategy for performing high-throughput calculations of 3D Multi-Linear descriptors over thousands of chemical structures for posterior QSAR and/or ADME-Tox studies.
KW - 3D N-linear algebraic descriptors
KW - Distributed computing system
KW - Multi-server architecture
KW - Platform of distributed tasks
KW - QuBiLS-MIDAS
KW - T-arenal
KW - TOMOCOMD-CARDD
UR - http://www.scopus.com/inward/record.url?scp=84921050627&partnerID=8YFLogxK
U2 - 10.1002/minf.201400086
DO - 10.1002/minf.201400086
M3 - Artículo
C2 - 27490863
AN - SCOPUS:84921050627
SN - 1868-1743
VL - 34
SP - 60
EP - 69
JO - Molecular Informatics
JF - Molecular Informatics
IS - 1
ER -