Evolving interpretable structure-activity relationships. 1. Reduced graph queries

J Chem Inf Model. 2008 Aug;48(8):1543-57. doi: 10.1021/ci8000502. Epub 2008 Jul 17.

Abstract

A new machine learning method is presented for extracting interpretable structure-activity relationships from screening data. The method is based on an evolutionary algorithm and reduced graphs and aims to evolve a reduced graph query (subgraph) that is present within the active compounds and absent from the inactives. The reduced graph representation enables heterogeneous compounds, such as those found in high-throughput screening data, to be captured in a single representation with the resulting query encoding structure-activity information in a form that is readily interpretable by a chemist. The application of the method is illustrated using data sets extracted from the well-known MDDR data set and GSK in-house screening data. Queries are evolved that are consistent with the known SARs, and they are also shown to be robust when applied to independent sets that were not used in training.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosomes / genetics
  • Combinatorial Chemistry Techniques / methods*
  • Humans
  • Phenotype
  • Receptor, Serotonin, 5-HT1A / metabolism
  • Serotonin 5-HT1 Receptor Agonists
  • Structure-Activity Relationship

Substances

  • Serotonin 5-HT1 Receptor Agonists
  • Receptor, Serotonin, 5-HT1A