Generalized additive modeling with implicit variable selection by likelihood-based boosting

Biometrics. 2006 Dec;62(4):961-71. doi: 10.1111/j.1541-0420.2006.00578.x.

Abstract

The use of generalized additive models in statistical data analysis suffers from the restriction to few explanatory variables and the problems of selection of smoothing parameters. Generalized additive model boosting circumvents these problems by means of stagewise fitting of weak learners. A fitting procedure is derived which works for all simple exponential family distributions, including binomial, Poisson, and normal response variables. The procedure combines the selection of variables and the determination of the appropriate amount of smoothing. Penalized regression splines and the newly introduced penalized stumps are considered as weak learners. Estimates of standard deviations and stopping criteria, which are notorious problems in iterative procedures, are based on an approximate hat matrix. The method is shown to be a strong competitor to common procedures for the fitting of generalized additive models. In particular, in high-dimensional settings with many nuisance predictor variables it performs very well.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biometry
  • Data Interpretation, Statistical
  • Humans
  • Likelihood Functions
  • Linear Models
  • Mental Disorders / therapy
  • Models, Statistical*
  • Patient Readmission / statistics & numerical data
  • Poisson Distribution
  • Regression Analysis