Dear Statalisters,
I apologize in advance because this might be a bit of a simple question, but I just can't seem to figure it out, so I hope someone is able to help. I am planning to do an ordered probit regression with a number of (more or less objective) health indicators as independent variables and subjective health status as dependent variable. However, since my dataset contains men and women between 50 and 65 years of age from 4 different countries, I want to standardize these objective health indicators by age and sex, yet I am not sure how to do this exactly.
I thought about using -egen newvar = std(oldvar)- but this option cannot be combined with 'by' (nor with the bysort prefix), which means that it does not specifically standardize oldvar for age nor sex. The alternative would be -dstdize- or -idstdize- I think, but I am not sure whether this would be the appropriate command in my case. If I understand correctly, -dstdize- is typically used to calculate standardized mortality rates, but it does not generate a new variable. However, what I am trying to do is convert the current health variables into new variables that are standardized for age and sex, use these in an oprobit regression with subjective health as Y (incl. country dummies), and finally, generate a adjusted health index (via oprobit post-estimation).
Does anyone have an idea whether -dstdize- would be a suitable command, and if it can be used to generate new (standardized) variables? If not: any ideas on a different approach?
Thanks in advance.
I apologize in advance because this might be a bit of a simple question, but I just can't seem to figure it out, so I hope someone is able to help. I am planning to do an ordered probit regression with a number of (more or less objective) health indicators as independent variables and subjective health status as dependent variable. However, since my dataset contains men and women between 50 and 65 years of age from 4 different countries, I want to standardize these objective health indicators by age and sex, yet I am not sure how to do this exactly.
I thought about using -egen newvar = std(oldvar)- but this option cannot be combined with 'by' (nor with the bysort prefix), which means that it does not specifically standardize oldvar for age nor sex. The alternative would be -dstdize- or -idstdize- I think, but I am not sure whether this would be the appropriate command in my case. If I understand correctly, -dstdize- is typically used to calculate standardized mortality rates, but it does not generate a new variable. However, what I am trying to do is convert the current health variables into new variables that are standardized for age and sex, use these in an oprobit regression with subjective health as Y (incl. country dummies), and finally, generate a adjusted health index (via oprobit post-estimation).
Does anyone have an idea whether -dstdize- would be a suitable command, and if it can be used to generate new (standardized) variables? If not: any ideas on a different approach?
Thanks in advance.
Comment