Facultade de Fisioterapia

FWDselect: An R Package for Variable Selection in Regression Models

Sestelo, Marta; Martínez Villanueva, Nora; Meira Machado, L.; Roca Pardiñas, Javier
Abstract:
In multiple regression models, when there are a large number (p) of explanatory variables which may or may not be relevant for predicting the response, it is useful to be able to reduce the model. To this end, it is necessary to determine the best subset of q (q ≤ p) predictors which will establish the model with the best prediction capacity. FWDselect package introduces a new forward stepwisebased selection procedure to select the best model in different regression frameworks (parametric or nonparametric). The developed methodology, which can be equally applied to linear models, generalized linear models or generalized additive models, aims to introduce solutions to the following two topics: i) selection of the best combination of q variables by using a step-by-step method; and, perhaps, most importantly, ii) search for the number of covariates to be included in the model based on bootstrap resampling techniques. The software is illustrated using real and simulated data.
Year:
2016
Type of Publication:
Article
Journal:
The R Journal
Volume:
8
Number:
1
Pages:
132-148
Month:
August
Hits: 1925