Machine learning optimization of cross docking accuracy

Comput Biol Chem. 2016 Jun:62:133-44. doi: 10.1016/j.compbiolchem.2016.04.005. Epub 2016 May 4.

Abstract

Performance of small molecule automated docking programs has conceptually been divided into docking -, scoring -, ranking - and screening power, which focuses on the crystal pose prediction, affinity prediction, ligand ranking and database screening capabilities of the docking program, respectively. Benchmarks show that different docking programs can excel in individual benchmarks which suggests that the scoring function employed by the programs can be optimized for a particular task. Here the scoring function of Smina is re-optimized towards enhancing the docking power using a supervised machine learning approach and a manually curated database of ligands and cross docking receptor pairs. The optimization method does not need associated binding data for the receptor-ligand examples used in the data set and works with small train sets. The re-optimization of the weights for the scoring function results in a similar docking performance with regard to docking power towards a cross docking test set. A ligand decoy based benchmark indicates a better discrimination between poses with high and low RMSD. The reported parameters for Smina are compatible with Autodock Vina and represent ready-to-use alternative parameters for researchers who aim at pose prediction rather than affinity prediction.

Keywords: Autodock Vina; Cross docking; Docking power; Drug discovery; Machine learning optimization; Molecular docking; Scoring function; Smina.

MeSH terms

  • Drug Design
  • Ligands
  • Machine Learning*
  • Molecular Docking Simulation*

Substances

  • Ligands