A joint modeling approach to data with informative cluster size: robustness to the cluster size model

Stat Med. 2011 Jul 10;30(15):1825-36. doi: 10.1002/sim.4239. Epub 2011 Apr 15.

Abstract

In many biomedical and epidemiological studies, data are often clustered due to longitudinal follow up or repeated sampling. While in some clustered data the cluster size is pre-determined, in others it may be correlated with the outcome of subunits, resulting in informative cluster size. When the cluster size is informative, standard statistical procedures that ignore cluster size may produce biased estimates. One attractive framework for modeling data with informative cluster size is the joint modeling approach in which a common set of random effects are shared by both the outcome and cluster size models. In addition to making distributional assumptions on the shared random effects, the joint modeling approach needs to specify the cluster size model. Questions arise as to whether the joint modeling approach is robust to misspecification of the cluster size model. In this paper, we studied both asymptotic and finite-sample characteristics of the maximum likelihood estimators in joint models when the cluster size model is misspecified. We found that using an incorrect distribution for the cluster size may induce small to moderate biases, while using a misspecified functional form for the shared random parameter in the cluster size model results in nearly unbiased estimation of outcome model parameters. We also found that there is little efficiency loss under this model misspecification. A developmental toxicity study was used to motivate the research and to demonstrate the findings.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural

MeSH terms

  • Abnormalities, Drug-Induced / veterinary
  • Animals
  • Bias
  • Biomedical Research / methods*
  • Biomedical Research / statistics & numerical data
  • Cluster Analysis
  • Epidemiologic Research Design
  • Ethylene Glycol / administration & dosage
  • Ethylene Glycol / toxicity*
  • Female
  • Follow-Up Studies
  • Linear Models
  • Mice
  • Organogenesis / drug effects*
  • Pregnancy
  • Pregnancy Outcome / veterinary
  • Sample Size

Substances

  • Ethylene Glycol