Substantial improvements in large-scale redocking and screening using the novel HYDE scoring function

Nadine Schneider; Sally Hindle; Gudrun Lange; Robert Klein; Jürgen Albrecht; Hans Briem; Kristin Beyer; Holger Claußen; Marcus Gastreich; Christian Lemmen; Matthias Rarey

doi:10.1007/s10822-011-9531-0

Substantial improvements in large-scale redocking and screening using the novel HYDE scoring function

J Comput Aided Mol Des. 2012 Jun;26(6):701-23. doi: 10.1007/s10822-011-9531-0. Epub 2011 Dec 27.

Authors

Nadine Schneider¹, Sally Hindle, Gudrun Lange, Robert Klein, Jürgen Albrecht, Hans Briem, Kristin Beyer, Holger Claußen, Marcus Gastreich, Christian Lemmen, Matthias Rarey

Affiliation

¹ Center for Bioinformatics, University of Hamburg, Bundesstr. 43, 20146, Hamburg, Germany.

PMID: 22203423
DOI: 10.1007/s10822-011-9531-0

Abstract

The HYDE scoring function consistently describes hydrogen bonding, the hydrophobic effect and desolvation. It relies on HYdration and DEsolvation terms which are calibrated using octanol/water partition coefficients of small molecules. We do not use affinity data for calibration, therefore HYDE is generally applicable to all protein targets. HYDE reflects the Gibbs free energy of binding while only considering the essential interactions of protein-ligand complexes. The greatest benefit of HYDE is that it yields a very intuitive atom-based score, which can be mapped onto the ligand and protein atoms. This allows the direct visualization of the score and consequently facilitates analysis of protein-ligand complexes during the lead optimization process. In this study, we validated our new scoring function by applying it in large-scale docking experiments. We could successfully predict the correct binding mode in 93% of complexes in redocking calculations on the Astex diverse set, while our performance in virtual screening experiments using the DUD dataset showed significant enrichment values with a mean AUC of 0.77 across all protein targets with little or no structural defects. As part of these studies, we also carried out a very detailed analysis of the data that revealed interesting pitfalls, which we highlight here and which should be addressed in future benchmark datasets.

MeSH terms

Algorithms*
Binding Sites
Hydrogen Bonding
Hydrophobic and Hydrophilic Interactions
Ligands
Models, Molecular
Protein Binding
Proteins / chemistry*
Thermodynamics*
Water / chemistry*

Substances

Ligands
Proteins
Water