Hello everyone!
We're researching into the mental health trajectories of immigrants and natives in Germany over time. For that purpose, we're using unbalanced panel data. We're estimating the within-person-changes with two fixed-effects models for immigrants and natives each, with mental health (mh) as outcome and time dummies (2020, 2018, 2016 etc. – survey every two years) as exposure, controlling for age. We want to visualize the results using coefplot.
Our main question is: As to which degree does our model
xtreg mh i.time_dummies c.age#c.age if immigrant==1, fe vce(robust)
consider values of those who didn't participate every time? We have quite large standard errors, which was somehow expected, especially for the immigrant group. But those standard errors are rising quickly (for all time points) when we include more years in our time dummy variable. We struggle a bit to find an explanation for that. At the same time, the regression output tells us that all immigrant observations are included ("Number of obs").
How does Stata work with unbalanced data here?
We would be very happy to find out.
Thanks in advance,
Henning
We're researching into the mental health trajectories of immigrants and natives in Germany over time. For that purpose, we're using unbalanced panel data. We're estimating the within-person-changes with two fixed-effects models for immigrants and natives each, with mental health (mh) as outcome and time dummies (2020, 2018, 2016 etc. – survey every two years) as exposure, controlling for age. We want to visualize the results using coefplot.
Our main question is: As to which degree does our model
xtreg mh i.time_dummies c.age#c.age if immigrant==1, fe vce(robust)
consider values of those who didn't participate every time? We have quite large standard errors, which was somehow expected, especially for the immigrant group. But those standard errors are rising quickly (for all time points) when we include more years in our time dummy variable. We struggle a bit to find an explanation for that. At the same time, the regression output tells us that all immigrant observations are included ("Number of obs").
How does Stata work with unbalanced data here?
We would be very happy to find out.
Thanks in advance,
Henning
Comment