Hi All
I have some general inquiries about data management post multiple imputation:
I have a longitudinal dataset with imputed data (data was imputed in the wide format). Post imputation, I converted the dataset to mlong (so data stays in wide format but imputations are in rows).
Here comes my confusion:
I was under the impression that data management in the mlong format (such recoding or generating new variables based on existing imputed variables) wouldn't require the mi: passive. But I think this is wrong?
For example, I have a categorical variable childhood social class, that I would like to recode to have fewer categories:
I don't know how to correctly recode a categorical variable like the one above, as mi est: does not support recode. And the above only recodes for those where _mi_m==0 (In hindsight I should have done this before imputing).
For a continuos variable like BMI, I require quadratic BMI for my models:
mi passive: gen bmi3 = bmi3*bmi3
But the following:
works only for _mi_m==0 and this would effect the N in any regression model even when run with mi est:
will not include the total N...
What are the best rules for data management post imputation?
Many Thanks
/Amal
I have some general inquiries about data management post multiple imputation:
I have a longitudinal dataset with imputed data (data was imputed in the wide format). Post imputation, I converted the dataset to mlong (so data stays in wide format but imputations are in rows).
Here comes my confusion:
I was under the impression that data management in the mlong format (such recoding or generating new variables based on existing imputed variables) wouldn't require the mi: passive. But I think this is wrong?
For example, I have a categorical variable childhood social class, that I would like to recode to have fewer categories:
Code:
tab childses tab childses, nolab recode childses (1 2 = 1) (3 = 2) (4 = 3) (5 = 4), gen(childsesx) tab childsesx label define childsesx 1 "I&II Professional/Managerial" 2 "IV&V partly/unskilled" 3 "III non-manual" 4 "III manual" label values childsesx childsesx tab childsesx
For a continuos variable like BMI, I require quadratic BMI for my models:
mi passive: gen bmi3 = bmi3*bmi3
But the following:
Code:
gen bmi3 = bmi3*bmi3
Code:
mi est: bmi i.childsesx
What are the best rules for data management post imputation?
Many Thanks
/Amal
Comment