In Silico Prediction of Compounds Binding to Human Plasma Proteins by QSAR Models

Lixia Sun; Hongbin Yang; Jie Li; Tianduanyi Wang; Weihua Li; Guixia Liu; Yun Tang

doi:10.1002/cmdc.201700582

In Silico Prediction of Compounds Binding to Human Plasma Proteins by QSAR Models

ChemMedChem. 2018 Mar 20;13(6):572-581. doi: 10.1002/cmdc.201700582. Epub 2017 Nov 10.

Authors

Lixia Sun¹, Hongbin Yang¹, Jie Li¹, Tianduanyi Wang¹, Weihua Li¹, Guixia Liu¹, Yun Tang¹

Affiliation

¹ Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.

PMID: 29057587
DOI: 10.1002/cmdc.201700582

Abstract

Plasma protein binding (PPB) is a significant pharmacokinetic property of compounds in drug discovery and design. Due to the high cost and time-consuming nature of experimental assays, in silico approaches have been developed to assess the binding profiles of chemicals. However, because of unambiguity and the lack of uniform experimental data, most available predictive models are far from satisfactory. In this study, an elaborately curated training set containing 967 diverse pharmaceuticals with plasma-protein-bound fractions (f_b ) was used to construct quantitative structure-activity relationship (QSAR) models by six machine learning algorithms with 26 molecular descriptors. Furthermore, we combined all of the individual learners to yield consensus prediction, marginally improving the accuracy of the consensus model. The model performance was estimated by tenfold cross validation and three external validation sets comprising 242 pharmaceutical, 397 industrial, and 231 newly designed chemicals, respectively. The models showed excellent performance for the entire test set, with mean absolute error (MAE) ranging from 0.126 to 0.178, demonstrating that our models could be used by a chemist when drawing a molecular structure from scratch. Meanwhile, structural descriptors contributing significantly to the predictive power of the models were related to the binding mechanisms, and the trend in terms of their effects on PPB can serve as guidance for the structural modification of chemicals. The applicability domain was also defined to distinguish favorable predictions from unfavorable predictions.

Keywords: QSAR; consensus modeling; machine learning; pharmacokinetics; plasma protein binding.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Binding Sites
Blood Proteins / chemistry*
Computer Simulation*
Databases, Chemical
Humans
Machine Learning
Models, Molecular
Molecular Structure
Pharmaceutical Preparations / chemistry*
Protein Binding
Quantitative Structure-Activity Relationship*

Substances

Blood Proteins
Pharmaceutical Preparations