Adjusting for unmeasured confounding due to either of two crossed factors with a logistic regression model

Stat Med. 2016 Aug 15;35(18):3179-88. doi: 10.1002/sim.6916. Epub 2016 Feb 19.

Abstract

Motivated by an investigation of the effect of surface water temperature on the presence of Vibrio cholerae in water samples collected from different fixed surface water monitoring sites in Haiti in different months, we investigated methods to adjust for unmeasured confounding due to either of the two crossed factors site and month. In the process, we extended previous methods that adjust for unmeasured confounding due to one nesting factor (such as site, which nests the water samples from different months) to the case of two crossed factors. First, we developed a conditional pseudolikelihood estimator that eliminates fixed effects for the levels of each of the crossed factors from the estimating equation. Using the theory of U-Statistics for independent but non-identically distributed vectors, we show that our estimator is consistent and asymptotically normal, but that its variance depends on the nuisance parameters and thus cannot be easily estimated. Consequently, we apply our estimator in conjunction with a permutation test, and we investigate use of the pigeonhole bootstrap and the jackknife for constructing confidence intervals. We also incorporate our estimator into a diagnostic test for a logistic mixed model with crossed random effects and no unmeasured confounding. For comparison, we investigate between-within models extended to two crossed factors. These generalized linear mixed models include covariate means for each level of each factor in order to adjust for the unmeasured confounding. We conduct simulation studies, and we apply the methods to the Haitian data. Copyright © 2016 John Wiley & Sons, Ltd.

Keywords: between-within model; composite likelihood; conditional likelihood; confounding; logistic regression; pseudolikelihood.

MeSH terms

  • Computer Simulation
  • Confounding Factors, Epidemiologic
  • Data Interpretation, Statistical*
  • Haiti
  • Linear Models
  • Logistic Models*
  • Vibrio cholerae / isolation & purification
  • Water Supply