Hello,
I am imputing missing categorical data using logit for binary and mlogit for categorical variables.
After running my imputation model, the imputed values are all integers (categories). Ex. if I imputed the variable female (0=male, 1=female), all imputed values are either 0 or 1. This seems strange to me, as I would have expected a value between 0 and 1 as the outcome of a logistic regression.
This occurs both when using mi chained and when using the ice package: example below of imputing female with both methods.
mi impute chained (logit) female = i.hincfel i.emp6 i.marstat i.educ3 agea, add(3) double rseed(10) force noisily
ice female m.hincfel m.emp6 m.marstat o.educ3 agea, gen(miss) m(2)
Having thoroughly searched the mi and ice documentation, I can't find information on how integers / categories are created from logistic or multinomial logistic imputation. Ex. Does a predicted probability of and above generate a 1, while all predictions below 0.50 yield a 0? How does this apply to multinomial logistic regression?
If anyone can point me to documentation explaining this process, I would be very grateful.
I am imputing missing categorical data using logit for binary and mlogit for categorical variables.
After running my imputation model, the imputed values are all integers (categories). Ex. if I imputed the variable female (0=male, 1=female), all imputed values are either 0 or 1. This seems strange to me, as I would have expected a value between 0 and 1 as the outcome of a logistic regression.
This occurs both when using mi chained and when using the ice package: example below of imputing female with both methods.
mi impute chained (logit) female = i.hincfel i.emp6 i.marstat i.educ3 agea, add(3) double rseed(10) force noisily
ice female m.hincfel m.emp6 m.marstat o.educ3 agea, gen(miss) m(2)
Having thoroughly searched the mi and ice documentation, I can't find information on how integers / categories are created from logistic or multinomial logistic imputation. Ex. Does a predicted probability of and above generate a 1, while all predictions below 0.50 yield a 0? How does this apply to multinomial logistic regression?
If anyone can point me to documentation explaining this process, I would be very grateful.
Comment