Estimation and Accuracy after Model Selection

J Am Stat Assoc. 2014 Jul 1;109(507):991-1007. doi: 10.1080/01621459.2013.823775.

Abstract

Classical statistical theory ignores model selection in assessing estimation accuracy. Here we consider bootstrap methods for computing standard errors and confidence intervals that take model selection into account. The methodology involves bagging, also known as bootstrap smoothing, to tame the erratic discontinuities of selection-based estimators. A useful new formula for the accuracy of bagging then provides standard errors for the smoothed estimators. Two examples, nonparametric and parametric, are carried through in detail: a regression model where the choice of degree (linear, quadratic, cubic, …) is determined by the Cp criterion, and a Lasso-based estimation problem.

Keywords: ABC intervals; Cp; Lasso; bagging; bootstrap smoothing; importance sampling; model averaging.