Conditional generalized estimating equations for the analysis of clustered and longitudinal data

Sylvie Goetgeluk; Stijn Vansteelandt

doi:10.1111/j.1541-0420.2007.00944.x

Conditional generalized estimating equations for the analysis of clustered and longitudinal data

Biometrics. 2008 Sep;64(3):772-780. doi: 10.1111/j.1541-0420.2007.00944.x. Epub 2007 Nov 19.

Authors

Sylvie Goetgeluk¹, Stijn Vansteelandt¹

Affiliation

¹ Department of Applied Mathematics and Computer Sciences, Ghent University, Krijgslaan 281 S9, 9000 Ghent, Belgium.

PMID: 18047524
DOI: 10.1111/j.1541-0420.2007.00944.x

Abstract

A common and important problem in clustered sampling designs is that the effect of within-cluster exposures (i.e., exposures that vary within clusters) on outcome may be confounded by both measured and unmeasured cluster-level factors (i.e., measurements that do not vary within clusters). When some of these are ill/not accounted for, estimation of this effect through population-averaged models or random-effects models may introduce bias. We accommodate this by developing a general theory for the analysis of clustered data, which enables consistent and asymptotically normal estimation of the effects of within-cluster exposures in the presence of cluster-level confounders. Semiparametric efficient estimators are obtained by solving so-called conditional generalized estimating equations. We compare this approach with a popular proposal by Neuhaus and Kalbfleisch (1998, Biometrics 54, 638-645) who separate the exposure effect into a within- and a between-cluster component within a random intercept model. We find that the latter approach yields consistent and efficient estimators when the model is linear, but is less flexible in terms of model specification. Under nonlinear models, this approach may yield inconsistent and inefficient estimators, though with little bias in most practical settings.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Antidepressive Agents, Tricyclic / therapeutic use
Biometry / methods*
Clinical Trials as Topic / statistics & numerical data
Cluster Analysis*
Data Interpretation, Statistical
Depression / drug therapy
Humans
Imipramine / therapeutic use
Longitudinal Studies
Models, Statistical*

Substances

Antidepressive Agents, Tricyclic
Imipramine