When using multiple imputation by chained equations to impute several MAR variables, some sources suggest the following steps:
The problem is that mi impute chained does not do steps 1 and 2!
As a result, many values cannot be imputed because of missingness in the independent / auxiliary variables.
Is there a way to get the STATA mi package or a 3rd party package to do step 1 and 2 as part of the mi workflow??
- Use mean value imputation for all missing variables as placeholders
- Set the placeholder back to missing for one variable to be imputed ("var")
- Use regression imputation for "var" (benefitting from complete case data thanks to step 1)
- Repeat steps 2-3 for each variable you want to impute
- Repeat steps 2-4 for a given number of cycles, updating the imputations each cycle, resulting in one imputed dataset
- Repeat steps 1-5 for a given number of imputations
The problem is that mi impute chained does not do steps 1 and 2!
As a result, many values cannot be imputed because of missingness in the independent / auxiliary variables.
Is there a way to get the STATA mi package or a 3rd party package to do step 1 and 2 as part of the mi workflow??
Comment