Bayesian modeling of the covariance structure for irregular longitudinal data using the partial autocorrelation function

Stat Med. 2015 May 30;34(12):2004-18. doi: 10.1002/sim.6465. Epub 2015 Mar 12.

Abstract

In long-term follow-up studies, irregular longitudinal data are observed when individuals are assessed repeatedly over time but at uncommon and irregularly spaced time points. Modeling the covariance structure for this type of data is challenging, as it requires specification of a covariance function that is positive definite. Moreover, in certain settings, careful modeling of the covariance structure for irregular longitudinal data can be crucial in order to ensure no bias arises in the mean structure. Two common settings where this occurs are studies with 'outcome-dependent follow-up' and studies with 'ignorable missing data'. 'Outcome-dependent follow-up' occurs when individuals with a history of poor health outcomes had more follow-up measurements, and the intervals between the repeated measurements were shorter. When the follow-up time process only depends on previous outcomes, likelihood-based methods can still provide consistent estimates of the regression parameters, given that both the mean and covariance structures of the irregular longitudinal data are correctly specified and no model for the follow-up time process is required. For 'ignorable missing data', the missing data mechanism does not need to be specified, but valid likelihood-based inference requires correct specification of the covariance structure. In both cases, flexible modeling approaches for the covariance structure are essential. In this paper, we develop a flexible approach to modeling the covariance structure for irregular continuous longitudinal data using the partial autocorrelation function and the variance function. In particular, we propose semiparametric non-stationary partial autocorrelation function models, which do not suffer from complex positive definiteness restrictions like the autocorrelation function. We describe a Bayesian approach, discuss computational issues, and apply the proposed methods to CD4 count data from a pediatric AIDS clinical trial.

Keywords: missing data; non-stationary covariance function; outcome-dependent follow-up; penalized splines; semiparametric covariance function.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Analysis of Variance
  • Anti-HIV Agents / administration & dosage
  • Bayes Theorem*
  • Bias
  • CD4 Lymphocyte Count
  • Child
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Dose-Response Relationship, Drug
  • HIV Infections / drug therapy
  • HIV Infections / immunology
  • Humans
  • Likelihood Functions
  • Linear Models
  • Longitudinal Studies*
  • Randomized Controlled Trials as Topic / statistics & numerical data
  • Research Design*
  • Zidovudine / administration & dosage

Substances

  • Anti-HIV Agents
  • Zidovudine