Predicting subgroup treatment effects for a new study: Motivations, results and learnings from running a data challenge in a pharmaceutical corporation

Björn Bornkamp; Silvia Zaoli; Michela Azzarito; Ruvie Martin; Carsten Philipp Müller; Conor Moloney; Giulia Capestro; David Ohlssen; Mark Baillie

doi:10.1002/pst.2368

Predicting subgroup treatment effects for a new study: Motivations, results and learnings from running a data challenge in a pharmaceutical corporation

Pharm Stat. 2024 Jul-Aug;23(4):495-510. doi: 10.1002/pst.2368. Epub 2024 Feb 7.

Authors

Björn Bornkamp¹, Silvia Zaoli¹, Michela Azzarito¹, Ruvie Martin², Carsten Philipp Müller³, Conor Moloney⁴, Giulia Capestro¹, David Ohlssen², Mark Baillie¹

Affiliations

¹ Global Drug Development, Novartis Pharma AG, Basel, Switzerland.
² Global Drug Development, Novartis Pharmaceuticals Corporation, East Hanover, New Jersey, USA.
³ Data Digital, Beyond Conception GmbH, Altendorf, Switzerland.
⁴ Global Drug Development, Novartis Pharma AG, Dublin, Ireland.

PMID: 38326967
DOI: 10.1002/pst.2368

Abstract

We present the motivation, experience, and learnings from a data challenge conducted at a large pharmaceutical corporation on the topic of subgroup identification. The data challenge aimed at exploring approaches to subgroup identification for future clinical trials. To mimic a realistic setting, participants had access to 4 Phase III clinical trials to derive a subgroup and predict its treatment effect on a future study not accessible to challenge participants. A total of 30 teams registered for the challenge with around 100 participants, primarily from Biostatistics organization. We outline the motivation for running the challenge, the challenge rules, and logistics. Finally, we present the results of the challenge, the participant feedback as well as the learnings. We also present our view on the implications of the results on exploratory analyses related to treatment effect heterogeneity.

Keywords: common task framework; data science; machine learning; subgroup analysis; subgroup identification.

MeSH terms

Biostatistics / methods
Clinical Trials, Phase III as Topic* / methods
Data Interpretation, Statistical
Drug Industry
Humans
Motivation*
Research Design
Treatment Outcome