TY - JOUR
T1 - Nucleotide's bilinear indices
T2 - Novel bio-macromolecular descriptors for bioinformatics studies of nucleic acids. I. Prediction of paromomycin's affinity constant with HIV-1 Ψ-RNA packaging region
AU - Marrero-Ponce, Yovani
AU - Ortega-Broche, Sadiel E.
AU - Díaz, Yunaimy Echeverría
AU - Alvarado, Ysaias J.
AU - Cubillan, Nestor
AU - Cardoso, Gladys Casas
AU - Torrens, Francisco
AU - Pérez-Giménez, Facundo
N1 - Funding Information:
Sadiel Ortega-Broche (O-B. S) acknowledges to Bioinformatics Research Center of Central University “Marta Abreu” of Las Villas for kind hospitality during the 2006–2007. Yovani Marrero-Ponce (M-P. Y) thanks are given to the Valencia University, (Spain) for partial financial support as well as the program “Estades Temporals per an Investigadors Convidats” for a fellowship to work at Pharmacy Faculty (2009). M-P. Y also thanks support from Spanish MEC (Project Reference: SAF2006-04698). Finally, but not less, this work was supported in part by VLIR (Vlaamse InterUniversitaire Raad, Flemish Interuniversity Council, Belgium) under the IUC Program VLIR-UCLV.
PY - 2009/7/21
Y1 - 2009/7/21
N2 - A new set of nucleotide-based bio-macromolecular descriptors are presented. This novel approach to bio-macromolecular design from a linear algebra point of view is relevant to nucleic acids quantitative structure-activity relationship (QSAR) studies. These bio-macromolecular indices are based on the calculus of bilinear maps on ℜn [bmk (over(x, -)m, over(y, -)m) : ℜn × ℜn → ℜ] in canonical basis. Nucleic acid's bilinear indices are calculated from kth power of non-stochastic and stochastic nucleotide's graph-theoretic electronic-contact matrices, Mmk and s Mmk, respectively. That is to say, the kth non-stochastic and stochastic nucleic acid's bilinear indices are calculated using Mmk and s Mmk as matrix operators of bilinear transformations. Moreover, biochemical information is codified by using different pair combinations of nucleotide-base properties as weightings (experimental molar absorption coefficient ε{lunate}260 at 260 nm and pH=7.0, first (Δ E1) and second (Δ E2) single excitation energies in eV, and first (f1) and second (f2) oscillator strength values (of the first singlet excitation energies) of the nucleotide DNA-RNA bases. As example of this approach, an interaction study of the antibiotic paromomycin with the packaging region of the HIV-1 Ψ-RNA have been performed and it have been obtained several linear models in order to predict the interaction strength. The best linear model obtained by using non-stochastic bilinear indices explains about 91% of the variance of the experimental Log K (R=0.95 and s=0.08×10-4 M-1) as long as the best stochastic bilinear indices-based equation account for 93% of the Log K variance (R=0.97 and s=0.07×10-4 M-1). The leave-one-out (LOO) press statistics, evidenced high predictive ability of both models (q2=0.86 and scv=0.09×10-4 M-1 for non-stochastic and q2=0.91 and scv=0.08×10-4 M-1 for stochastic bilinear indices). The nucleic acid's bilinear indices-based models compared favorably with other nucleic acid's indices-based approaches reported nowadays. These models also permit the interpretation of the driving forces of the interaction process. In this sense, developed equations involve short-reaching (k≤3), middle-reaching (4
AB - A new set of nucleotide-based bio-macromolecular descriptors are presented. This novel approach to bio-macromolecular design from a linear algebra point of view is relevant to nucleic acids quantitative structure-activity relationship (QSAR) studies. These bio-macromolecular indices are based on the calculus of bilinear maps on ℜn [bmk (over(x, -)m, over(y, -)m) : ℜn × ℜn → ℜ] in canonical basis. Nucleic acid's bilinear indices are calculated from kth power of non-stochastic and stochastic nucleotide's graph-theoretic electronic-contact matrices, Mmk and s Mmk, respectively. That is to say, the kth non-stochastic and stochastic nucleic acid's bilinear indices are calculated using Mmk and s Mmk as matrix operators of bilinear transformations. Moreover, biochemical information is codified by using different pair combinations of nucleotide-base properties as weightings (experimental molar absorption coefficient ε{lunate}260 at 260 nm and pH=7.0, first (Δ E1) and second (Δ E2) single excitation energies in eV, and first (f1) and second (f2) oscillator strength values (of the first singlet excitation energies) of the nucleotide DNA-RNA bases. As example of this approach, an interaction study of the antibiotic paromomycin with the packaging region of the HIV-1 Ψ-RNA have been performed and it have been obtained several linear models in order to predict the interaction strength. The best linear model obtained by using non-stochastic bilinear indices explains about 91% of the variance of the experimental Log K (R=0.95 and s=0.08×10-4 M-1) as long as the best stochastic bilinear indices-based equation account for 93% of the Log K variance (R=0.97 and s=0.07×10-4 M-1). The leave-one-out (LOO) press statistics, evidenced high predictive ability of both models (q2=0.86 and scv=0.09×10-4 M-1 for non-stochastic and q2=0.91 and scv=0.08×10-4 M-1 for stochastic bilinear indices). The nucleic acid's bilinear indices-based models compared favorably with other nucleic acid's indices-based approaches reported nowadays. These models also permit the interpretation of the driving forces of the interaction process. In this sense, developed equations involve short-reaching (k≤3), middle-reaching (4
KW - Footprinting
KW - HIV-1 Ψ-RNA packaging region
KW - Multiple linear regression
KW - Nucleic acid and nucleotide bilinear indices
KW - Paromomycin
KW - QSPR
KW - TOMOCOMD-CANAR software
UR - http://www.scopus.com/inward/record.url?scp=67349167717&partnerID=8YFLogxK
U2 - 10.1016/j.jtbi.2009.02.021
DO - 10.1016/j.jtbi.2009.02.021
M3 - Artículo
C2 - 19272394
AN - SCOPUS:67349167717
SN - 0022-5193
VL - 259
SP - 229
EP - 241
JO - Journal of Theoretical Biology
JF - Journal of Theoretical Biology
IS - 2
ER -