I estimate two models on the impact of conflict on education.
For Model 1: I start with following regression:
Then I add area and age fixed effects. As expected, Stata drops 'young' and 'conflict' due to multicollinearity:
For Model 2: I start with following regression:
I again add area and age fixed effects. Stata drops 'conflict', but not 'post'.
My questions are:
- The first model is based on a single post-conflict household survey and a comparison of a young (affected) and an old (unaffected) cohort.
- The second model is based on pre- and post-conflict household surveys (cross-section) and a comparison of a young cohort before (unaffected) and a young cohort after (affected) the conflict.
For Model 1: I start with following regression:
Code:
regress $dependent young conflict did $controls_ind $controls_hh
Code:
regress $dependent did $controls_ind $controls_hh i.area i.age
Code:
regress $dependent post conflict did $controls_ind $controls_hh
Code:
regress $dependent did post $controls_ind $controls_hh i.area i.age
- Why does Stata not drop 'post' in the fourth equation? I would have expected to be dropped similiar to 'cohort' in my third equation?
- Should I keep it in the model or remove it?
Comment