I have a question about using mi imputation regress for multiple imputation. I am using survey data, and the outcome variable that I want to use for subsequent analysis contains missing data (c.10% of the sample). As the outcome variable is univariate and continuous, as advised by the Stata "mi impute" help page, I am using a linear regression model as an imputation model to obtain m=20 imputed values of my outcome variable (hence "mi impute regress").
However, although the code runs properly and I obtain m=20 vectors or completed data for my outcome variable, I don't fully understand how this works: if the imputed outcome variables stem from a linear regression based on other observed variables, how is it possible to obtain m=20 different vectors? If the imputed values are predicted values from a linear regression, I should only be able to get a single imputed vector for my outcome variable, as the prediction from a linear regression canot give me several different outputs.
Thank you for your help.
However, although the code runs properly and I obtain m=20 vectors or completed data for my outcome variable, I don't fully understand how this works: if the imputed outcome variables stem from a linear regression based on other observed variables, how is it possible to obtain m=20 different vectors? If the imputed values are predicted values from a linear regression, I should only be able to get a single imputed vector for my outcome variable, as the prediction from a linear regression canot give me several different outputs.
Thank you for your help.
Comment