Machine learning (ML) has increasingly been applied to predict properties of drugs. Particularly, metabolism can be predicted with ML methods, which can be exploited during drug discovery and development. The prediction of metabolism is a crucial bottleneck in the early identification of toxic metabolites or biotransformation pathways that can affect elimination of the drug and potentially hinder the development of future new drugs. Metabolism prediction can be addressed with the application of ML models trained on large and validated dataset, from early stages of lead optimization to latest stage of drug development. ML methods rely on molecular descriptors that allow to identify and learn chemical and molecular features to predict sites of metabolism (SoMs) or activity associated with mechanism of inhibition (e.g., CYP inhibition). The application of ML methods in the prediction of drug metabolism represents a powerful resource to be exploited during drug discovery and development. ML allows to improve in silico screening and safety assessments of drugs in advance, steering their path to marketing authorization. Prediction of biotransformation reactions and metabolites allows to shorten the time, save the cost, and reduce animal testing. In this context, ML methods represent a technique to fill data gaps and an opportunity to reduce animal testing, calling for the 3R principles within the Big Data era.
Keywords: (Q)SAR; 3Rs; Drug development; Drug discovery; Machine learning; Metabolism; SoM.
© 2025. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.