Predicting subgroup treatment effects for a new study: Motivations, results and learnings from running a data challenge in a pharmaceutical corporation

Pharm Stat. 2024 Jul-Aug;23(4):495-510. doi: 10.1002/pst.2368. Epub 2024 Feb 7.

Abstract

We present the motivation, experience, and learnings from a data challenge conducted at a large pharmaceutical corporation on the topic of subgroup identification. The data challenge aimed at exploring approaches to subgroup identification for future clinical trials. To mimic a realistic setting, participants had access to 4 Phase III clinical trials to derive a subgroup and predict its treatment effect on a future study not accessible to challenge participants. A total of 30 teams registered for the challenge with around 100 participants, primarily from Biostatistics organization. We outline the motivation for running the challenge, the challenge rules, and logistics. Finally, we present the results of the challenge, the participant feedback as well as the learnings. We also present our view on the implications of the results on exploratory analyses related to treatment effect heterogeneity.

Keywords: common task framework; data science; machine learning; subgroup analysis; subgroup identification.

MeSH terms

  • Biostatistics / methods
  • Clinical Trials, Phase III as Topic* / methods
  • Data Interpretation, Statistical
  • Drug Industry
  • Humans
  • Motivation*
  • Research Design
  • Treatment Outcome