Machine learning optimization of cross docking accuracy

Esben J Bjerrum

doi:10.1016/j.compbiolchem.2016.04.005

Machine learning optimization of cross docking accuracy

Comput Biol Chem. 2016 Jun:62:133-44. doi: 10.1016/j.compbiolchem.2016.04.005. Epub 2016 May 4.

Author

Esben J Bjerrum¹

Affiliation

¹ Wildcard Pharmaceutical Consulting, Frødings Alle 41, 2860 Søborg, Denmark. Electronic address: [email protected].

PMID: 27179709
DOI: 10.1016/j.compbiolchem.2016.04.005

Abstract

Performance of small molecule automated docking programs has conceptually been divided into docking -, scoring -, ranking - and screening power, which focuses on the crystal pose prediction, affinity prediction, ligand ranking and database screening capabilities of the docking program, respectively. Benchmarks show that different docking programs can excel in individual benchmarks which suggests that the scoring function employed by the programs can be optimized for a particular task. Here the scoring function of Smina is re-optimized towards enhancing the docking power using a supervised machine learning approach and a manually curated database of ligands and cross docking receptor pairs. The optimization method does not need associated binding data for the receptor-ligand examples used in the data set and works with small train sets. The re-optimization of the weights for the scoring function results in a similar docking performance with regard to docking power towards a cross docking test set. A ligand decoy based benchmark indicates a better discrimination between poses with high and low RMSD. The reported parameters for Smina are compatible with Autodock Vina and represent ready-to-use alternative parameters for researchers who aim at pose prediction rather than affinity prediction.

Keywords: Autodock Vina; Cross docking; Docking power; Drug discovery; Machine learning optimization; Molecular docking; Scoring function; Smina.

MeSH terms

Drug Design
Ligands
Machine Learning*
Molecular Docking Simulation*

Substances

Ligands