5. How do we choose best method to impute missing value for a data?
Cold deck imputationA systematically chosen value from an individual who has similar values on other variables.This is similar to Hot Deck in most ways, but removes the random variation. So for example, you may always choose the third individual in the same experimental condition and block.Regression imputationThe predicted value obtained by regressing the missing variable on other variables.So instead of just taking the mean, you’re taking the predicted value, based on other variables. This preserves relationships among variables involved in the imputation model, but not variability around predicted values.Stochastic regression imputationThe predicted value from a regression plus a random residual value.This has all the advantages of regression imputation but adds in the advantages ofthe random component.Most multiple imputation is based off of some form of stochastic regression imputation.Interpolation and extrapolationAn estimated value from other observations from the same individual. It usually only works in longitudinal data.Use caution, though. Interpolation, for example, might make more sense for a variable like height in children–one that can’t go back down over time. Extrapolation means you’re estimating beyond the actual range of the data and that requires making more assumptions that you should.
You've reached the end of your free preview.
Want to read all 11 pages?