Informace o publikaci

QSPR Designer - a program to design and evaluate QSPR models. Case study on pKa prediction

Autoři

SKŘEHOTA Ondřej SVOBODOVÁ VAŘEKOVÁ Radka GEIDL Stanislav KUDERA Michal SEHNAL David IONESCU Crina-Maria KOČA Jaroslav

Rok publikování 2010
Druh Článek ve sborníku
Konference 6th German Conference on Chemoinformatics
Fakulta / Pracoviště MU

Přírodovědecká fakulta

Citace
www http://va.gdch.de/programm/prog_detail.asp?strVANr=5412
Obor Informatika
Klíčová slova QSPR, model design, descriptors, pKa, atomic charges, phenols
Popis Nowadays, a large amount of experimental and predicted data about the 3D structure of organic molecules and biomolecules is available. Advanced computational methods and high performance computers allow us to obtain large sets of descriptors that can be used to estimate physicochemical properties. It is often of interest to study the correlations between descriptors and properties using multilinear regression and to design, parameterize, and test different QSPR (Quantitative Structure Property Relationship) models. We developed a modular and easily extensible program, called QSPR Designer, which can read or calculate structural properties of atoms and bonds, employ them as QSPR descriptors, and evaluate correlations between the descriptors and the examined physicochemical property of a molecule. Furthermore, the software allows us to effectively design and parameterize QSPR models, calculate physicochemical properties via the models, test the quality of the models, and provide graphs and tables summarizing the results. The performance of the software is demonstrated by a case study on the prediction of pKa. The pKa is of fundamental relevance for chemical, biological and pharmaceutical research, because many important physicochemical properties are pKa dependent. Unfortunately, pKa is also one of the most challenging properties to calculate [1]. Atomic charges have proven very successful descriptors for the prediction of pKa [2]. Charges can be calculated using a variety of methods (HF, MP2, functionals, etc.), population analyses (Mulliken, ESP, NPA, etc.) and basis sets. Consequently, the procedure of charge calculation strongly influences their correlation with pKa [3]. Using the QSPR Designer, we have successfully designed, evaluated, and compared 75 different QSPR models for the prediction of pKa from charges. Our best model predicted the pKa for 143 phenols with a correlation coefficient 0.969, RMSE (root mean square error) 0.416 and the average pKa error 0.329.