Generalized additive modeling with implicit variable selection by likelihood-based boosting

Gerhard Tutz; Harald Binder

doi:10.1111/j.1541-0420.2006.00578.x

Generalized additive modeling with implicit variable selection by likelihood-based boosting

Biometrics. 2006 Dec;62(4):961-71. doi: 10.1111/j.1541-0420.2006.00578.x.

Authors

Gerhard Tutz¹, Harald Binder

Affiliation

¹ Institut für Statistik, Ludwig-Maximilians-Universität München Akademiestr. 1, D-80799 München, Germany. [email protected]

PMID: 17156269
DOI: 10.1111/j.1541-0420.2006.00578.x

Abstract

The use of generalized additive models in statistical data analysis suffers from the restriction to few explanatory variables and the problems of selection of smoothing parameters. Generalized additive model boosting circumvents these problems by means of stagewise fitting of weak learners. A fitting procedure is derived which works for all simple exponential family distributions, including binomial, Poisson, and normal response variables. The procedure combines the selection of variables and the determination of the appropriate amount of smoothing. Penalized regression splines and the newly introduced penalized stumps are considered as weak learners. Estimates of standard deviations and stopping criteria, which are notorious problems in iterative procedures, are based on an approximate hat matrix. The method is shown to be a strong competitor to common procedures for the fitting of generalized additive models. In particular, in high-dimensional settings with many nuisance predictor variables it performs very well.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Biometry
Data Interpretation, Statistical
Humans
Likelihood Functions
Linear Models
Mental Disorders / therapy
Models, Statistical*
Patient Readmission / statistics & numerical data
Poisson Distribution
Regression Analysis