Publication details

QSPR Designer - a program to design and evaluate QSPR models. Case study on pKa prediction

Authors

SKŘEHOTA Ondřej SVOBODOVÁ VAŘEKOVÁ Radka GEIDL Stanislav KUDERA Michal SEHNAL David IONESCU Crina-Maria KOČA Jaroslav

Year of publication 2010
Type Article in Proceedings
Conference 6th German Conference on Chemoinformatics
MU Faculty or unit

Faculty of Science

Citation
Web http://va.gdch.de/programm/prog_detail.asp?strVANr=5412
Field Informatics
Keywords QSPR, model design, descriptors, pKa, atomic charges, phenols
Description Nowadays, a large amount of experimental and predicted data about the 3D structure of organic molecules and biomolecules is available. Advanced computational methods and high performance computers allow us to obtain large sets of descriptors that can be used to estimate physicochemical properties. It is often of interest to study the correlations between descriptors and properties using multilinear regression and to design, parameterize, and test different QSPR (Quantitative Structure Property Relationship) models. We developed a modular and easily extensible program, called QSPR Designer, which can read or calculate structural properties of atoms and bonds, employ them as QSPR descriptors, and evaluate correlations between the descriptors and the examined physicochemical property of a molecule. Furthermore, the software allows us to effectively design and parameterize QSPR models, calculate physicochemical properties via the models, test the quality of the models, and provide graphs and tables summarizing the results. The performance of the software is demonstrated by a case study on the prediction of pKa. The pKa is of fundamental relevance for chemical, biological and pharmaceutical research, because many important physicochemical properties are pKa dependent. Unfortunately, pKa is also one of the most challenging properties to calculate [1]. Atomic charges have proven very successful descriptors for the prediction of pKa [2]. Charges can be calculated using a variety of methods (HF, MP2, functionals, etc.), population analyses (Mulliken, ESP, NPA, etc.) and basis sets. Consequently, the procedure of charge calculation strongly influences their correlation with pKa [3]. Using the QSPR Designer, we have successfully designed, evaluated, and compared 75 different QSPR models for the prediction of pKa from charges. Our best model predicted the pKa for 143 phenols with a correlation coefficient 0.969, RMSE (root mean square error) 0.416 and the average pKa error 0.329.

You are running an old browser version. We recommend updating your browser to its latest version.

More info