Variable selection for recurrent event data via nonconcave penalized estimating function

Lifetime Data Anal. 2009 Jun;15(2):197-215. doi: 10.1007/s10985-008-9104-2. Epub 2008 Nov 26.

Abstract

Variable selection is an important issue in all regression analysis and in this paper, we discuss this in the context of regression analysis of recurrent event data. Recurrent event data often occur in long-term studies in which individuals may experience the events of interest more than once and their analysis has recently attracted a great deal of attention (Andersen et al., Statistical models based on counting processes, 1993; Cook and Lawless, Biometrics 52:1311-1323, 1996, The analysis of recurrent event data, 2007; Cook et al., Biometrics 52:557-571, 1996; Lawless and Nadeau, Technometrics 37:158-168, 1995; Lin et al., J R Stat Soc B 69:711-730, 2000). However, it seems that there are no established approaches to the variable selection with respect to recurrent event data. For the problem, we adopt the idea behind the nonconcave penalized likelihood approach proposed in Fan and Li (J Am Stat Assoc 96:1348-1360, 2001) and develop a nonconcave penalized estimating function approach. The proposed approach selects variables and estimates regression coefficients simultaneously and an algorithm is presented for this process. We show that the proposed approach performs as well as the oracle procedure in that it yields the estimates as if the correct submodel was known. Simulation studies are conducted for assessing the performance of the proposed approach and suggest that it works well for practical situations. The proposed methodology is illustrated by using the data from a chronic granulomatous disease study.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biometry
  • Granulomatous Disease, Chronic / complications
  • Granulomatous Disease, Chronic / drug therapy
  • Humans
  • Infections / etiology
  • Interferon-gamma / therapeutic use
  • Likelihood Functions
  • Models, Statistical
  • Poisson Distribution
  • Recombinant Proteins
  • Recurrence
  • Regression Analysis*

Substances

  • Recombinant Proteins
  • Interferon-gamma