ConvSCCS: convolutional self-controlled case series model for lagged adverse event detection

Biostatistics. 2020 Oct 1;21(4):758-774. doi: 10.1093/biostatistics/kxz003.

Abstract

With the increased availability of large electronic health records databases comes the chance of enhancing health risks screening. Most post-marketing detection of adverse drug reaction (ADR) relies on physicians' spontaneous reports, leading to under-reporting. To take up this challenge, we develop a scalable model to estimate the effect of multiple longitudinal features (drug exposures) on a rare longitudinal outcome. Our procedure is based on a conditional Poisson regression model also known as self-controlled case series (SCCS). To overcome the need of precise risk periods specification, we model the intensity of outcomes using a convolution between exposures and step functions, which are penalized using a combination of group-Lasso and total-variation. Up to our knowledge, this is the first SCCS model with flexible intensity able to handle multiple longitudinal features in a single model. We show that this approach improves the state-of-the-art in terms of mean absolute error and computation time for the estimation of relative risks on simulated data. We apply this method on an ADR detection problem, using a cohort of diabetic patients extracted from the large French national health insurance database (SNIIRAM), a claims database containing medical reimbursements of more than 53 million people. This work has been done in the context of a research partnership between Ecole Polytechnique and CNAMTS (in charge of SNIIRAM).

Keywords: Conditional Poisson model; Penalization; Risk screening; Scalability; Self-controlled case series; Total variation.

MeSH terms

  • Cohort Studies
  • Databases, Factual
  • Drug-Related Side Effects and Adverse Reactions* / epidemiology
  • Electronic Health Records
  • Humans
  • Research Design