A Genetic Algorithm Based Support Vector Machine Model for Blood-Brain Barrier Penetration Prediction

Biomed Res Int. 2015:2015:292683. doi: 10.1155/2015/292683. Epub 2015 Oct 4.

Abstract

Blood-brain barrier (BBB) is a highly complex physical barrier determining what substances are allowed to enter the brain. Support vector machine (SVM) is a kernel-based machine learning method that is widely used in QSAR study. For a successful SVM model, the kernel parameters for SVM and feature subset selection are the most important factors affecting prediction accuracy. In most studies, they are treated as two independent problems, but it has been proven that they could affect each other. We designed and implemented genetic algorithm (GA) to optimize kernel parameters and feature subset selection for SVM regression and applied it to the BBB penetration prediction. The results show that our GA/SVM model is more accurate than other currently available log BB models. Therefore, to optimize both SVM parameters and feature subset simultaneously with genetic algorithm is a better approach than other methods that treat the two problems separately. Analysis of our log BB model suggests that carboxylic acid group, polar surface area (PSA)/hydrogen-bonding ability, lipophilicity, and molecular charge play important role in BBB penetration. Among those properties relevant to BBB penetration, lipophilicity could enhance the BBB penetration while all the others are negatively correlated with BBB penetration.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Biological Transport, Active
  • Blood-Brain Barrier / drug effects*
  • Blood-Brain Barrier / physiology*
  • Databases, Factual
  • Drug Design
  • Humans
  • Hydrogen Bonding
  • Hydrogen-Ion Concentration
  • Models, Biological*
  • Permeability
  • Quantitative Structure-Activity Relationship
  • Support Vector Machine*