How the energy evaluation method used in the geometry optimization step affect the quality of the subsequent QSAR/QSPR models (original) (raw)

2009, Journal of Computer-Aided Molecular Design

The quantitative influence of the choice of energy evaluation method used in the geometry optimization step prior to the calculation of molecular descriptors in QSAR and QSPR models was investigated. A total of 11 energy evaluation methods on three molecular datasets (toxicological compounds, aromatic compounds and PPARc agonists) were studied. The methods employed were: MMFF94 s, MM3* with e r (relative dielectric constant) = 1, MM3* with e r = 80, AM1, PM3, HF/STO-3G, HF/6-31G, HF/6-31G(d,p), B3LYP/STO-3G, B3LYP/6-31G, and B3LYP/6-31G(d,p). The 3D-descriptors used in the QSAR/ QSPR models were calculated with commercially available molecular descriptor programs primarily directed toward pharmaceutical research. In order to evaluate the uncertainties involved in the QSAR/QSPR predictions bootstrapping was used to validate all models using 1,000 drawings for each data set. The scale free error-term, q 2 , was used to compare the relative quality of the models resulting from different optimization methods on the same set of molecules. Depending on the dataset, the average 0.632 bootstrap estimated q 2 varies from 0.55 to 0.57 for the toxicological compounds, from 0.58 to 0.62 for the aromatic compounds, and from 0.69 to 0.75 for the PPARc agonists. The B3LYP/6-31G(d,p) provided the best overall results, albeit the increase in q 2 was small in all cases. The results clearly indicate that the choice of the energy evaluation method has very limited impact. This study suggests that QSAR or QSPR studies might benefit from the choice of a rapid optimization method with little or no loss in model accuracy. Keywords QSAR Á QSPR Á Energy evaluation Á PLS regression Á Quantum mechanics Á Semi-empirical Á Molecular mechanics Abbreviations QSPR Quantitative structure property relationship QSAR Quantitative structure activity relationship MM3* Allinger's molecular mechanics AM1 Austin model 1 PM3 Parameterized model 3, HF, Hartree-Fock B3LYP The hybrid exchange-correlation functional based on work from Becke, Lee, Yang and Par PLS Partial least squares RMSD Root mean square distance MCMM Monte Carlo multiple minimum Electronic supplementary material The online version of this article (