Identifying predictive features in drug response using machine learning: opportunities and challenges

Mathukumalli Vidyasagar

doi:10.1146/annurev-pharmtox-010814-124502

Identifying predictive features in drug response using machine learning: opportunities and challenges

Annu Rev Pharmacol Toxicol. 2015:55:15-34. doi: 10.1146/annurev-pharmtox-010814-124502. Epub 2014 Dec 12.

Author

Mathukumalli Vidyasagar¹

Affiliation

¹ Erik Jonsson School of Engineering and Computer Science, University of Texas at Dallas, Richardson, Texas 75080; email: [email protected].

PMID: 25423479
DOI: 10.1146/annurev-pharmtox-010814-124502

Abstract

This article reviews several techniques from machine learning that can be used to study the problem of identifying a small number of features, from among tens of thousands of measured features, that can accurately predict a drug response. Prediction problems are divided into two categories: sparse classification and sparse regression. In classification, the clinical parameter to be predicted is binary, whereas in regression, the parameter is a real number. Well-known methods for both classes of problems are briefly discussed. These include the SVM (support vector machine) for classification and various algorithms such as ridge regression, LASSO (least absolute shrinkage and selection operator), and EN (elastic net) for regression. In addition, several well-established methods that do not directly fall into machine learning theory are also reviewed, including neural networks, PAM (pattern analysis for microarrays), SAM (significance analysis for microarrays), GSEA (gene set enrichment analysis), and k-means clustering. Several references indicative of the application of these methods to cancer biology are discussed.

Keywords: EN algorithm; GSEA; LASSO; PAM; SAM; SVMs; cancer biology; k-means clustering; machine learning; neural networks; precision medicine; prediction in pharmacology; regression.

Publication types

Review

MeSH terms

Algorithms
Animals
Antineoplastic Agents / adverse effects
Antineoplastic Agents / pharmacokinetics
Antineoplastic Agents / therapeutic use*
Artificial Intelligence*
Cluster Analysis
Drug Discovery / methods*
Drug-Related Side Effects and Adverse Reactions / etiology
Drug-Related Side Effects and Adverse Reactions / prevention & control
Gene Expression Regulation, Neoplastic / drug effects
Gene Regulatory Networks / drug effects
Humans
Neural Networks, Computer
Patient Safety
Patient Selection
Pattern Recognition, Automated
Pharmacology / methods*
Precision Medicine / methods*
Risk Assessment
Risk Factors

Substances

Antineoplastic Agents