Feature-map vectors: a new class of informative descriptors for computational drug discovery

J Comput Aided Mol Des. 2006 Dec;20(12):751-62. doi: 10.1007/s10822-006-9085-8. Epub 2007 Jan 5.

Abstract

In order to develop robust machine-learning or statistical models for predicting biological activity, descriptors that capture the essence of the protein-ligand interaction are required. In the absence of structural information from X-ray or NMR experiments, deriving informative descriptors can be difficult. We have developed feature-map vectors (FMVs), a new class of descriptors based on chemical features, to address this challenge. FMVs, which are derived from the conformational models of a few actives, are low dimensional, problem specific, and highly interpretable. By using shape-based alignments and scoring with chemical features, FMVs can combine information about a molecule's shape and the pharmacophores it can match. In five validation studies, bag classifiers built using FMVs have shown high enrichments for identifying actives for five diverse targets: CDK2, 5-HT(3), DHFR, thrombin, and ACE. The interpretability of these descriptors has been demonstrated for CDK2 and 5-HT(3), where the method automatically discovers the standard literature pharmacophore.

MeSH terms

  • Algorithms
  • Artificial Intelligence
  • Computer Simulation
  • Computer-Aided Design
  • Cyclin-Dependent Kinase 2 / chemistry
  • Cyclin-Dependent Kinase 2 / drug effects
  • Drug Design*
  • Humans
  • In Vitro Techniques
  • Ligands
  • Models, Molecular
  • Proteins / chemistry
  • Receptors, Serotonin, 5-HT3 / chemistry
  • Receptors, Serotonin, 5-HT3 / drug effects

Substances

  • Ligands
  • Proteins
  • Receptors, Serotonin, 5-HT3
  • CDK2 protein, human
  • Cyclin-Dependent Kinase 2