6533b7d6fe1ef96bd1265bd0
RESEARCH PRODUCT
Regression with imputed covariates: A generalized missing-indicator approach
Franco PeracchiValentino DardanoniSalvatore Modicasubject
Economics and EconometricsApplied MathematicsRegression analysisMissing dataRegressionSet (abstract data type)Reduction (complexity)Economic dataBias of an estimatorStatisticsCovariateMissing covariates ImputationsBias precision trade-off Model reduction Model averaging BMI and incomeEconometricsStatistics::MethodologyC12C13C19Missing covariatesImputationsBias-precision trade-offModel reductionModel averagingBMI and incomeMathematicsdescription
A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper, we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.
year | journal | country | edition | language |
---|---|---|---|---|
2011-06-01 |