Bayesian Gene Set Benchmark Dose Estimation for "omic" responses

Bioinformatics. 2025 Jan 9:btaf008. doi: 10.1093/bioinformatics/btaf008. Online ahead of print.

Abstract

Motivation: Estimating a toxic reference point using tools like the benchmark dose (BMD) is a critical step in setting policy to regulate pollution and ensure safe environments. Toxicity can be measured for different endpoints, including changes in gene expression and histopathology for various tissues, and is typically explored one gene or tissue at a time in a univariate setting that ignores correlation. In this work, we develop a multivariate estimation procedure to estimate the BMD for specified gene sets. Our approach extends the foundational univariate approach by accounting for correlation in a statistically principled way.

Results: We illustrate the method using data from a 5-day rat study and Hallmark gene sets and compare to existing BMD results computed by the EPA for both gene sets and apical histopathology endpoints. In contrast to previous ad-hoc methods, our principled approach provides the needed extension to bring the foundational univariate method into the multivariate world of transcriptomics. In addition to use in a regulatory setting, our method can provide hypothesis generation when gene sets correspond to mechanistic pathways.

Availability and implementation: BS-BMD is implemented in R and C ++ and available at https://github.com/NIEHS/BS-BMD.

Supplementary information: Supplementary data are available at Journal Name.