Transfer Learning for Drug Discovery

Chenjing Cai; Shiwei Wang; Youjun Xu; Weilin Zhang; Ke Tang; Qi Ouyang; Luhua Lai; Jianfeng Pei

doi:10.1021/acs.jmedchem.9b02147

Transfer Learning for Drug Discovery

J Med Chem. 2020 Aug 27;63(16):8683-8694. doi: 10.1021/acs.jmedchem.9b02147. Epub 2020 Jul 24.

Authors

Chenjing Cai¹, Shiwei Wang², Youjun Xu³, Weilin Zhang⁴, Ke Tang⁵, Qi Ouyang^{1

6}, Luhua Lai^{1

3}, Jianfeng Pei¹

Affiliations

¹ Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, P. R. China.
² PTN Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, P. R. China.
³ BNLMS and Peking-Tsinghua Center for Life Sciences at the College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, P. R. China.
⁴ Beijing Intelligent Pharma Technology Co., Ltd., Beijing 100083, P. R. China.
⁵ Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, P. R. China.
⁶ The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing 100871, P. R. China.

PMID: 32672961
DOI: 10.1021/acs.jmedchem.9b02147

Abstract

The data sets available to train models for in silico drug discovery efforts are often small. Indeed, the sparse availability of labeled data is a major barrier to artificial-intelligence-assisted drug discovery. One solution to this problem is to develop algorithms that can cope with relatively heterogeneous and scarce data. Transfer learning is a type of machine learning that can leverage existing, generalizable knowledge from other related tasks to enable learning of a separate task with a small set of data. Deep transfer learning is the most commonly used type of transfer learning in the field of drug discovery. This Perspective provides an overview of transfer learning and related applications to drug discovery to date. Furthermore, it provides outlooks on the future development of transfer learning for drug discovery.

Publication types

Research Support, Non-U.S. Gov't
Review

MeSH terms

Datasets as Topic
Deep Learning*
Drug Discovery / methods*