TY - JOUR
T1 - Chemometric and chemoinformatic analyses of anabolic and androgenic activities of testosterone and dihydrotestosterone analogues
AU - Alvarez-Ginarte, Yoanna María
AU - Crespo-Otero, Rachel
AU - Marrero-Ponce, Yovani
AU - Noheda-Marin, Pedro
AU - Garcia de la Vega, Jose Manuel
AU - Montero-Cabrera, Luis Alberto
AU - Ruiz García, José Alberto
AU - Caldera-Luzardo, José A.
AU - Alvarado, Ysaias J.
N1 - Funding Information:
This research was supported by the Center for Pharmaceutical Chemistry (CQF), Cuba and the Faculty of Chemistry, Universidad de La Habana, and computational facilities were provided by Deutscher Akademischer Austauschdienst (DAAD) in Bonn, Germany. The Universidad Autónoma de Madrid—Universidad de La Habana program under the auspices of CajaMadrid, Spain, also supported part of this work. One of the authors (M.-P.Y.) thanks the program ‘Estades Temporals per a Investigadors Convidats’ for a fellowship to work at Valencia University (2008). Finally, but very importantly, M.-P.Y. thanks the Flemish Interuniversity Council (VLIR) of Belgium for partial support of this research through a part of the fund of the project ‘Strengthening postgraduate education and research in Pharmaceutical Sciences’. Anonymous reviewers are gratefully acknowledged for their helpful suggestions that have led to improving the paper.
PY - 2008/6/15
Y1 - 2008/6/15
N2 - Predictive quantitative structure-activity relationship (QSAR) models of anabolic and androgenic activities for the testosterone and dihydrotestosterone steroid analogues were obtained by means of multiple linear regression using quantum and physicochemical molecular descriptors (MD) as well as a genetic algorithm for the selection of the best subset of variables. Quantitative models found for describing the anabolic (androgenic) activity are significant from a statistical point of view: R2 of 0.84 (0.72 and 0.70). A leave-one-out cross-validation procedure revealed that the regression models had a fairly good predictability [q2 of 0.80 (0.60 and 0.59)]. In addition, other QSAR models were developed to predict anabolic/androgenic (A/A) ratios and the best regression equation explains 68% of the variance for the experimental values of AA ratio and has a rather adequate q2 of 0.51. External validation, by using test sets, was also used in each experiment in order to evaluate the predictive power of the obtained models. The result shows that these QSARs have quite good predictive abilities (R2 of 0.90, 0.72 (0.55), and 0.53) for anabolic activity, androgenic activity, and A/A ratios, respectively. Last, a Williams plot was used in order to define the domain of applicability of the models as a squared area within ±2 band for residuals and a leverage threshold of h = 0.16. No apparent outliers were detected and the models can be used with high accuracy in this applicability domain. MDs included in our QSAR models allow the structural interpretation of the biological process, evidencing the main role of the shape of molecules, hydrophobicity, and electronic properties. Attempts were made to include lipophilicity (octanol-water partition coefficient (log P)) and electronic (hardness (η)) values of the whole molecules in the multivariate relations. It was found from the study that the log P of molecules has positive contribution to the anabolic and androgenic activities and high values of η produce unfavorable effects. The found MDs can also be efficiently used in similarity studies based on cluster analysis. Our model for the anabolic/androgenic ratio (expressed by weight of levator ani muscle, LA, and seminal vesicle, SV, in mice) predicts that the 2-aminomethylene-17α-methyl-17β-hydroxy-5α-androstan-3- one (43) compound is the most potent anabolic steroid, and the 17α-methyl-2β,17β-dihydroxy-5α-androstane (31) compound is the least potent one of this series. The approach described in this report is an alternative for the discovery and optimization of leading anabolic compounds among steroids and analogues. It also gives an important role to electron exchange terms of molecular interactions to this kind of steroid activity.
AB - Predictive quantitative structure-activity relationship (QSAR) models of anabolic and androgenic activities for the testosterone and dihydrotestosterone steroid analogues were obtained by means of multiple linear regression using quantum and physicochemical molecular descriptors (MD) as well as a genetic algorithm for the selection of the best subset of variables. Quantitative models found for describing the anabolic (androgenic) activity are significant from a statistical point of view: R2 of 0.84 (0.72 and 0.70). A leave-one-out cross-validation procedure revealed that the regression models had a fairly good predictability [q2 of 0.80 (0.60 and 0.59)]. In addition, other QSAR models were developed to predict anabolic/androgenic (A/A) ratios and the best regression equation explains 68% of the variance for the experimental values of AA ratio and has a rather adequate q2 of 0.51. External validation, by using test sets, was also used in each experiment in order to evaluate the predictive power of the obtained models. The result shows that these QSARs have quite good predictive abilities (R2 of 0.90, 0.72 (0.55), and 0.53) for anabolic activity, androgenic activity, and A/A ratios, respectively. Last, a Williams plot was used in order to define the domain of applicability of the models as a squared area within ±2 band for residuals and a leverage threshold of h = 0.16. No apparent outliers were detected and the models can be used with high accuracy in this applicability domain. MDs included in our QSAR models allow the structural interpretation of the biological process, evidencing the main role of the shape of molecules, hydrophobicity, and electronic properties. Attempts were made to include lipophilicity (octanol-water partition coefficient (log P)) and electronic (hardness (η)) values of the whole molecules in the multivariate relations. It was found from the study that the log P of molecules has positive contribution to the anabolic and androgenic activities and high values of η produce unfavorable effects. The found MDs can also be efficiently used in similarity studies based on cluster analysis. Our model for the anabolic/androgenic ratio (expressed by weight of levator ani muscle, LA, and seminal vesicle, SV, in mice) predicts that the 2-aminomethylene-17α-methyl-17β-hydroxy-5α-androstan-3- one (43) compound is the most potent anabolic steroid, and the 17α-methyl-2β,17β-dihydroxy-5α-androstane (31) compound is the least potent one of this series. The approach described in this report is an alternative for the discovery and optimization of leading anabolic compounds among steroids and analogues. It also gives an important role to electron exchange terms of molecular interactions to this kind of steroid activity.
KW - Anabolic and androgenic activities
KW - Genetic algorithm
KW - QSAR model
KW - Quantum and physicochemical molecular descriptor
KW - Testosterone and dihydrotestosterone steroid analogues
UR - http://www.scopus.com/inward/record.url?scp=44849101437&partnerID=8YFLogxK
U2 - 10.1016/j.bmc.2008.04.001
DO - 10.1016/j.bmc.2008.04.001
M3 - Artículo
C2 - 18514531
AN - SCOPUS:44849101437
SN - 0968-0896
VL - 16
SP - 6448
EP - 6459
JO - Bioorganic and Medicinal Chemistry
JF - Bioorganic and Medicinal Chemistry
IS - 12
ER -