A systematic comparison of computational methods for expression forecasting

Eric Kernfeld; Yunxiao Yang; Joshua Weinstock; Alexis Battle; Patrick Cahan

doi:10.1101/2023.07.28.551039

A systematic comparison of computational methods for expression forecasting

bioRxiv [Preprint]. 2024 Oct 1:2023.07.28.551039. doi: 10.1101/2023.07.28.551039.

Authors

Eric Kernfeld, Yunxiao Yang, Joshua Weinstock, Alexis Battle, Patrick Cahan

Abstract

Expression forecasting methods use machine learning models to predict how a cell will alter its transcriptome upon perturbation. Such methods are enticing because they promise to answer pressing questions in fields ranging from developmental genetics to cell fate engineering and because they are a fast, cheap, and accessible complement to the corresponding experiments. However, the absolute and relative accuracy of these methods is poorly characterized, limiting their informed use, their improvement, and the interpretation of their predictions. To address these issues, we created a benchmarking platform that combines a panel of 11 large-scale perturbation datasets with an expression forecasting software engine that encompasses or interfaces to a wide variety of methods. We used our platform to systematically assess methods, parameters, and sources of auxiliary data, finding that performance strongly depends on the choice of metric, and especially for simple metrics like mean squared error, it is uncommon for expression forecasting methods to out-perform simple baselines. Our platform will serve as a resource to improve methods and to identify contexts in which expression forecasting can succeed.

Publication types

Preprint

Abstract

Publication types

Grants and funding