Pairwise Accelerated Failure Time Regression Models for Infectious Disease Transmission in Close-Contact Groups With External Sources of Infection

Yushuf Sharker; Zaynab Diallo; Wasiur R KhudaBukhsh; Eben Kenah

doi:10.1002/sim.10226

Pairwise Accelerated Failure Time Regression Models for Infectious Disease Transmission in Close-Contact Groups With External Sources of Infection

Stat Med. 2024 Oct 3. doi: 10.1002/sim.10226. Online ahead of print.

Authors

Yushuf Sharker¹, Zaynab Diallo², Wasiur R KhudaBukhsh³, Eben Kenah²

Affiliations

¹ Data Sciences Institute, Takeda Pharmaceuticals USA, Cambridge, Massachusetts, USA.
² Biostatistics Division, College of Public Health, The Ohio State University, Columbus, Ohio, USA.
³ School of Mathematical Sciences, University of Nottingham, Nottingham, UK.

PMID: 39362790
DOI: 10.1002/sim.10226

Abstract

Many important questions in infectious disease epidemiology involve associations between covariates (e.g., age or vaccination status) and infectiousness or susceptibility. Because disease transmission produces dependent outcomes, these questions are difficult or impossible to address using standard regression models from biostatistics. Pairwise survival analysis handles dependent outcomes by calculating likelihoods in terms of contact interval distributions in ordered pairs of individuals. The contact interval in the ordered pair $i j$ is the time from the onset of infectiousness in $i$ to infectious contact from $i$ to $j$ , where an infectious contact is sufficient to infect $j$ if they are susceptible. Here, we introduce a pairwise accelerated failure time regression model for infectious disease transmission that allows the rate parameter of the contact interval distribution to depend on individual-level infectiousness covariates for $i$ , individual-level susceptibility covariates for $j$ , and pair-level covariates (e.g., type of relationship). This model can simultaneously handle internal infections (caused by transmission between individuals under observation) and external infections (caused by environmental or community sources of infection). We show that this model produces consistent and asymptotically normal parameter estimates. In a simulation study, we evaluate bias and confidence interval coverage probabilities, explore the role of epidemiologic study design, and investigate the effects of model misspecification. We use this regression model to analyze household data from Los Angeles County during the 2009 influenza A (H1N1) pandemic, where we find that the ability to account for external sources of infection increases the statistical power to estimate the effect of antiviral prophylaxis.

Keywords: accelerated failure time model; infectious disease epidemiology; secondary attack risk; survival analysis.

Abstract

Grants and funding