Hi there,
I have a dataset of higher ed institutions that contains an avg graduate rate and an avg transfer student graduation rate for each institution. In some cases, there is no information about avg transfer graduation rates. As such, I am imputing transfer grad rates (yvar) from the avg grad rates (xvar), as there is a strong correlation between them. I'm using the mipolate command to do this and am testing out a couple different options (linear, idw). I'd like to generate imputed transfer grad rate values based on multiple known avg grad rates, not just the one nearest or exact matching neighbor. These mipolate options seem to do that except for when an institution with a missing yvar has an exact match on xvar with another institution. When there's an exact match on xvar, I just get the same exact value from the institution that does not have the missing yvar. Does anyone know how I can do this imputation and still generate yvars based on the values of multiple xvars even when two institutions have the exact same xvar value?
Code used for imputation:
mipolate gradrate_8yr_transferstudents avg_grad_rate, gen(imp_tgr) idw
Example data below, after doing imputation. Two institutions here are missing on the yvar (gradrate_8yr_transferstudents). The first one (inst 7) received an imputed value (imp_tgr) based on multiple values of avg_grad_date in the dataset, using the idw option. Inst 10, though, has the exact same xvar value (avg_grad_rate) as inst 11, so it receives an imputed value only based on the transfer grad rate of inst 11. Ideally, I'd like for inst 10 to receive an imputed value based on multiple other values just like int 7. Is there any way to do this?
* Example generated by -dataex-. For more info, type help dataex
clear
ihe_name (gradrate_8yr_transferstudents avg_grad_rate imp_tgr)
Inst 1 .1597 7 .1597
Inst 2 .1146 8.125 .1146
Inst 3 .1924 9.222222328186035 .1924
Inst 4 .1152 9.55555534362793 .1152
Inst 5 .2743 10.222222328186035 .2743
Inst 6 .1944 10.88888931274414 .1944
Inst 7 . 11.333333015441895 .25058369178133116
Inst 8 .2806 14.44444465637207 .2806
inst 9 .3662 15.11111068725586 .3662
inst 10 . 15.222222328186035 .2475
Inst 11 .2475 15.222222328186035 .2475
Thanks for your help!
I have a dataset of higher ed institutions that contains an avg graduate rate and an avg transfer student graduation rate for each institution. In some cases, there is no information about avg transfer graduation rates. As such, I am imputing transfer grad rates (yvar) from the avg grad rates (xvar), as there is a strong correlation between them. I'm using the mipolate command to do this and am testing out a couple different options (linear, idw). I'd like to generate imputed transfer grad rate values based on multiple known avg grad rates, not just the one nearest or exact matching neighbor. These mipolate options seem to do that except for when an institution with a missing yvar has an exact match on xvar with another institution. When there's an exact match on xvar, I just get the same exact value from the institution that does not have the missing yvar. Does anyone know how I can do this imputation and still generate yvars based on the values of multiple xvars even when two institutions have the exact same xvar value?
Code used for imputation:
mipolate gradrate_8yr_transferstudents avg_grad_rate, gen(imp_tgr) idw
Example data below, after doing imputation. Two institutions here are missing on the yvar (gradrate_8yr_transferstudents). The first one (inst 7) received an imputed value (imp_tgr) based on multiple values of avg_grad_date in the dataset, using the idw option. Inst 10, though, has the exact same xvar value (avg_grad_rate) as inst 11, so it receives an imputed value only based on the transfer grad rate of inst 11. Ideally, I'd like for inst 10 to receive an imputed value based on multiple other values just like int 7. Is there any way to do this?
* Example generated by -dataex-. For more info, type help dataex
clear
ihe_name (gradrate_8yr_transferstudents avg_grad_rate imp_tgr)
Inst 1 .1597 7 .1597
Inst 2 .1146 8.125 .1146
Inst 3 .1924 9.222222328186035 .1924
Inst 4 .1152 9.55555534362793 .1152
Inst 5 .2743 10.222222328186035 .2743
Inst 6 .1944 10.88888931274414 .1944
Inst 7 . 11.333333015441895 .25058369178133116
Inst 8 .2806 14.44444465637207 .2806
inst 9 .3662 15.11111068725586 .3662
inst 10 . 15.222222328186035 .2475
Inst 11 .2475 15.222222328186035 .2475
Thanks for your help!
Comment