AutoShim: empirically corrected scoring functions for quantitative docking with a crystal structure and IC50 training data

J Chem Inf Model. 2008 Apr;48(4):861-72. doi: 10.1021/ci7004548. Epub 2008 Apr 2.

Abstract

It has been notoriously difficult to develop general all-purpose scoring functions for high-throughput docking that correlate with measured binding affinity. As a practical alternative, AutoShim uses the program Magnet to add point-pharmacophore like "shims" to the binding site of each protein target. The pharmacophore shims are weighted by partial least-squares (PLS) regression, adjusting the all-purpose scoring function to reproduce IC 50 data, much as the shims in an NMR magnet are weighted to optimize the field for a better spectrum. This dramatically improves the affinity predictions on 25% of the compounds held out at random. An iterative procedure chooses the best pose during the process of shim parametrization. This method reproducibly converges to a consistent solution, regardless of starting pose, in just 2-4 iterations, so these robust models do not overtrain. Sets of complex multifeature shims, generated by a recursive partitioning method, give the best activity predictions, but these are difficult to interpret when designing new compounds. Sets of simpler single-point pharmacophores still predict affinity reasonably well and clearly indicate the molecular interactions producing effective binding. The pharmacophore requirements are very reproducible, irrespective of the compound sets used for parametrization, lending confidence to the predictions and interpretations. The automated procedure does require a training set of experimental compounds but otherwise adds little extra time over conventional docking.

MeSH terms

  • Crystallography
  • Hydrogen Bonding
  • Molecular Structure*
  • Proteins / chemistry

Substances

  • Proteins