Estimation and Accuracy after Model Selection

Bradley Efron

doi:10.1080/01621459.2013.823775

Estimation and Accuracy after Model Selection

J Am Stat Assoc. 2014 Jul 1;109(507):991-1007. doi: 10.1080/01621459.2013.823775.

Author

Bradley Efron¹

Affiliation

¹ Stanford University.

Abstract

Classical statistical theory ignores model selection in assessing estimation accuracy. Here we consider bootstrap methods for computing standard errors and confidence intervals that take model selection into account. The methodology involves bagging, also known as bootstrap smoothing, to tame the erratic discontinuities of selection-based estimators. A useful new formula for the accuracy of bagging then provides standard errors for the smoothed estimators. Two examples, nonparametric and parametric, are carried through in detail: a regression model where the choice of degree (linear, quadratic, cubic, …) is determined by the C_p criterion, and a Lasso-based estimation problem.

Keywords: ABC intervals; Cp; Lasso; bagging; bootstrap smoothing; importance sampling; model averaging.

Abstract

Grants and funding