Facultade de Fisioterapia

Model checks for nonparametric regression with missing data: a comparative study

Cotos Yáñez, Tomás R.; Pérez González, Ana; González Manteiga, W.
Abstract:
This paper analyses the behaviour of the goodness-of-fit tests for regression models. To this end, it uses statistics based on an estimation of the integrated regression function with missing observations either in the response variable or in some of the covariates. It proposes several versions of one empirical process, constructed from a previous estimation, that uses only the complete observations or replaces the missing observations with imputed values. In the case of missing covariates, a link model is used to fill the missing observations with other complete covariates. In all the situations, Bootstrap methodology is used to calibrate the distribution of the test statistics. A broad simulation study compares the different procedures based on empirical regression methodology, with smoothed tests previously studied in the literature. The comparison reflects the effect of the correlation between the covariates in the tests based on the imputed sample for missing covariates. In addition, the paper proposes a computational binning strategy to evaluate the tests based on an empirical process for large data sets. Finally, two applications to real data illustrate the performance of the tests.
Year:
2016
Type of Publication:
Article
Keywords:
Missing data in regression models; goodness-of-fit test; empirical process
Journal:
Journal of Statistical Computation and Simulation
Volume:
86
Number:
16
Pages:
3188-3204
Month:
November
DOI:
https://doi.org/10.1080/00949655.2016.1156114
Hits: 1786