184
Views
36
CrossRef citations to date
0
Altmetric
Primary Article

Tuning Variable Selection Procedures by Adding Noise

, &
Pages 165-175 | Published online: 01 Jan 2012
 

Abstract

Many variable selection methods for linear regression depend critically on tuning parameters that control the performance of the method, for example, “entry” and “stay” significance levels in forward and backward selection. However, most methods do not adapt the tuning parameters to particular datasets. We propose a general strategy for adapting variable selection tuning parameters that effectively estimates the tuning parameters so that the selection method avoids overfitting and underfitting. The strategy is based on the principle that overfitting and underfitting can be directly observed in estimates of the error variance after adding controlled amounts of additional independent noise to the response variable, then running a variable selection method. It is related to the simulation technique SIMEX found in the measurement error literature. We focus on forward selection because of its simplicity and ability to handle large numbers of explanatory variables. Monte Carlo studies show that the new method compares favorably with established methods.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.