LogP prediction performance with the SMD solvation model and the M06 density functional family for SAMPL6 blind prediction challenge molecules

J Comput Aided Mol Des. 2020 May;34(5):511-522. doi: 10.1007/s10822-020-00278-1. Epub 2020 Jan 14.

Abstract

This work presents a quantum mechanical model for predicting octanol-water partition coefficients of small protein-kinase inhibitor fragments as part of the SAMPL6 LogP Prediction Challenge. The model calculates solvation free energy differences using the M06-2X functional with SMD implicit solvation and the def2-SVP basis set. This model was identified as dqxk4 in the SAMPL6 Challenge and was the third highest performing model in the physical methods category with 0.49 log Root Mean Squared Error (RMSE) for predicting the 11 compounds in SAMPL6 blind prediction set. We also collaboratively investigated the use of empirical models to address model deficiencies for halogenated compounds at minimal additional computational cost. A mixed model consisting of the dqxk4 physical and hdpuj empirical models found improved performance at 0.34 log RMSE on the SAMPL6 dataset. This collaborative mixed model approach shows how empirical models can be leveraged to expediently improve performance in chemical spaces that are difficult for ab initio methods to simulate.

Keywords: Computational chemistry; DFT; Implicit solvation; LogP; SAMPL6.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Hydrogen-Ion Concentration
  • Molecular Structure
  • Solvents / chemistry*
  • Thermodynamics*
  • Water / chemistry*

Substances

  • Solvents
  • Water