Hi Stata Users,
First, let me apologize in case this may not be a Stata question in a strict manner. However, I believe I can still benefit from the vast knowledge of members of this group.
I am performing multiple imputation using the code below
and attached dataset .
I then perform some robustness checks and find whereas the mean of the imputed distribution is accurate (we know this since we have the population estimate!), Kolmogorov-Smirnov test of equality of distributions of original and imputed variables shows they are different. Visual exploration by use of kdensity function shows the same.

I notice that whereas the mean of the distributions are similar (dotted green line superimposed on the continuous red line), standard deviation seems a bit different and am wondering whether there’s a way to try and address this.
Thanks in advance!
First, let me apologize in case this may not be a Stata question in a strict manner. However, I believe I can still benefit from the vast knowledge of members of this group.
I am performing multiple imputation using the code below
Code:
mi set wide mi register imputed pr_attend mi impute chained (regress) pr_attend, add(20) by(age) mi estimate: regress pr_attend hhsize hh_head_no_educ clust_literacy num_children hh_member_formal_empl dep_ratio hh_orphan i.hv024
I then perform some robustness checks and find whereas the mean of the imputed distribution is accurate (we know this since we have the population estimate!), Kolmogorov-Smirnov test of equality of distributions of original and imputed variables shows they are different. Visual exploration by use of kdensity function shows the same.
I notice that whereas the mean of the distributions are similar (dotted green line superimposed on the continuous red line), standard deviation seems a bit different and am wondering whether there’s a way to try and address this.
Thanks in advance!
Comment