Modeling of compound profiling experiments using support vector machines

Chem Biol Drug Des. 2014 Jul;84(1):75-85. doi: 10.1111/cbdd.12294. Epub 2014 Mar 13.

Abstract

Profiling of compounds against target families has become an important approach in pharmaceutical research for the identification of hits and analysis of selectivity and promiscuity patterns. We report on modeling of profiling experiments involving 429 potential inhibitors and a panel of 24 different kinases using support vector machine (SVM) techniques and naïve Bayesian classification. The experimental matrix contained many different activity profiles. SVM predictions achieved overall high accuracy due to consistently low false-positive and consistently high true-negative rates. However, predictions for promiscuous inhibitors were affected by false-negative rates. Combined target-based SVM classifiers reached or exceeded the performance of SVM profile prediction methods and were superior to Bayesian classification. The classifiers displayed different prediction characteristics including diverse combinations of false-positive and true-negative rates. Predicted and experimentally observed compound activity profiles were compared in detail, revealing activity patterns modeled with different accuracy.

Keywords: Bayesian classification; activity profile prediction; compound profiling; inhibitors; machine learning; protein kinases; support vector machines; target families.

MeSH terms

  • Animals
  • Bayes Theorem
  • Computer-Aided Design
  • Drug Discovery / methods*
  • Humans
  • Protein Kinase Inhibitors / chemistry*
  • Protein Kinase Inhibitors / pharmacology*
  • Protein Kinases / metabolism
  • Support Vector Machine*

Substances

  • Protein Kinase Inhibitors
  • Protein Kinases