Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data

Shauna D O'Donovan; Rachel Cavill; Florian Wimmenauer; Alexander Lukas; Tobias Stumm; Evgueni Smirnov; Michael Lenz; Gokhan Ertaylan; Danyel G J Jennen; Natal A W van Riel; Kurt Driessens; Ralf L M Peeters; Theo M C M de Kok

doi:10.1371/journal.pone.0292030

Application of transfer learning to predict drug-induced human in vivo gene expression changes using rat in vitro and in vivo data

PLoS One. 2023 Nov 30;18(11):e0292030. doi: 10.1371/journal.pone.0292030. eCollection 2023.

Authors

Shauna D O'Donovan^{1

2

3}, Rachel Cavill⁴, Florian Wimmenauer⁴, Alexander Lukas⁴, Tobias Stumm⁴, Evgueni Smirnov⁴, Michael Lenz^{1

5

6}, Gokhan Ertaylan^{1

7}, Danyel G J Jennen⁸, Natal A W van Riel^{1

2

3}, Kurt Driessens⁴, Ralf L M Peeters^{1

4}, Theo M C M de Kok^{1

8}

Affiliations

¹ Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, The Netherlands.
² Dept. of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
³ Eindhoven Artificial Intelligence Systems Institute (EAISI), Eindhoven University of Technology, Eindhoven, The Netherlands.
⁴ Dept. of Advanced Computing Sciences, Maastricht University, Maastricht, The Netherlands.
⁵ Institute of Organismic and Molecular Evolution, Johannes Gutenberg University Mainz, Mainz, Germany.
⁶ Preventive Cardiology and Preventative Medicine - Center for Cardiology, University Medical Center of the Johannes Gutenberg University Mainz, Mainz, Germany.
⁷ Sustainable Health, Flemish Institute for Technological Research (VITO), Mol, Belgium.
⁸ Dept. of Toxicogenomics, GROW School for Oncology and Reproduction, Maastricht University, Maastricht, The Netherlands.

Abstract

The liver is the primary site for the metabolism and detoxification of many compounds, including pharmaceuticals. Consequently, it is also the primary location for many adverse reactions. As the liver is not readily accessible for sampling in humans; rodent or cell line models are often used to evaluate potential toxic effects of a novel compound or candidate drug. However, relating the results of animal and in vitro studies to relevant clinical outcomes for the human in vivo situation still proves challenging. In this study, we incorporate principles of transfer learning within a deep artificial neural network allowing us to leverage the relative abundance of rat in vitro and in vivo exposure data from the Open TG-GATEs data set to train a model to predict the expected pattern of human in vivo gene expression following an exposure given measured human in vitro gene expression. We show that domain adaptation has been successfully achieved, with the rat and human in vitro data no longer being separable in the common latent space generated by the network. The network produces physiologically plausible predictions of human in vivo gene expression pattern following an exposure to a previously unseen compound. Moreover, we show the integration of the human in vitro data in the training of the domain adaptation network significantly improves the temporal accuracy of the predicted rat in vivo gene expression pattern following an exposure to a previously unseen compound. In this way, we demonstrate the improvements in prediction accuracy that can be achieved by combining data from distinct domains.

Copyright: © 2023 O’Donovan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Animals
Gene Expression
Humans
Learning
Liver*
Machine Learning
Neural Networks, Computer*
Rats

Grants and funding

The research in this paper was supported by the Dutch Province of Limburg. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.