Efficient design of peptide-binding polymers using active learning approaches

Assima Rakhimbekova; Anton Lopukhov; Natalia Klyachko; Alexander Kabanov; Timur I Madzhidov; Alexander Tropsha

doi:10.1016/j.jconrel.2022.11.023

Efficient design of peptide-binding polymers using active learning approaches

J Control Release. 2023 Jan:353:903-914. doi: 10.1016/j.jconrel.2022.11.023. Epub 2022 Dec 19.

Authors

Assima Rakhimbekova¹, Anton Lopukhov², Natalia Klyachko², Alexander Kabanov³, Timur I Madzhidov¹, Alexander Tropsha⁴

Affiliations

¹ A.M. Butlerov Institute of Chemistry, Kazan Federal University, Kazan 420008, Russia.
² Laboratory of Chemical Design of Bionanomaterials, Faculty of Chemistry, M.V. Lomonosov Moscow State University, Moscow, Russia.
³ Laboratory of Chemical Design of Bionanomaterials, Faculty of Chemistry, M.V. Lomonosov Moscow State University, Moscow, Russia; Center for Nanotechnology in Drug Delivery, Division of Pharmacoengineering and Molecular Pharmaceutics, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, NC, USA.
⁴ Laboratory for Molecular Modeling, Division of Chemical Biology and Medicinal Chemistry, UNC Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, NC 27599, USA. Electronic address: [email protected].

PMID: 36402234
DOI: 10.1016/j.jconrel.2022.11.023

Abstract

Active learning (AL) has become a subject of active recent research both in industry and academia as an efficient approach for rapid design and discovery of novel chemicals, materials, and polymers. Herein, we have assessed the applicability of AL for the discovery of polymeric micelle formulations for poorly soluble drugs. We were motivated by the key advantages of this approach making it a desirable strategy for rational design of drug delivery systems due toto its ability to (i) employ relatively small datasets for model development, (ii) iterate between model development and model assessment using small external datasets that can be either generated in focused experimental studies or formed from subsets of the initial training data, and (iii) progressively evolve models towards increasingly more reliable predictions and the identification of novel chemicals with the desired properties. In this study, we compared various AL protocols for their effectiveness in finding biologically active molecules using synthetic datasets. We have investigated the dependency of AL performance on the size of the initial training set, the relative complexity of the task, and the choice of the initial training dataset. We found that AL techniques as applied to regression modeling offer no benefits over random search, while AL used for classification tasks performs better than models built for randomly selected training sets but still quite far from perfect. Using the best performing AL protocol,. Finally, the best performing AL approach was employed to discover and experimentally validate novel binding polymers for a case study of asialoglycoprotein receptor (ASGPR).

Keywords: Active learning; Binders; Bioactivity; Molecular design; Polymer binders; Polymers.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Drug Delivery Systems
Micelles
Peptides
Polymers* / chemistry
Problem-Based Learning*

Substances

Polymers
Micelles
Peptides