Dear Statalist,
Thank you for taking the time to read this post. My question pertains to whether or not to control for year dummies (i.year) in my unbalanced panel data. Any guidance would be greatly appreciated.
I am working with unbalanced panel data and using FE Poisson/PPMLHDFE to explore my research question. My dataset has 110 cross-sectional units (N = 110) and spans 40 years (T = 40). My hypothesis posits that the U.S. changes its foreign policies during specific years. To account for this, I created a dummy variable, "ab_signal," representing these specific years and have interacted it with another identity variable, ally.
Every unit receives a "1" for "ab_signal" during the years 1955-1961 and 1971-1978, while the entire data range is from 1950-1989. Given that "ab_signal" is intrinsically linked to year effects, I am wondering if I am still allowed/required to control for year effects in my model. My major concern is about the collinearity problem as you will see in the below figure.
If I should not control the year effect, is this model still robust or reliable? I would appreciate any insights or advice on how to approach this question. Thank you again.
Below is my current code:
Should I add year effect into the current model?
After I control year-effect, ab_signal is omitted. As below:

Thank you for taking the time to read this post. My question pertains to whether or not to control for year dummies (i.year) in my unbalanced panel data. Any guidance would be greatly appreciated.
I am working with unbalanced panel data and using FE Poisson/PPMLHDFE to explore my research question. My dataset has 110 cross-sectional units (N = 110) and spans 40 years (T = 40). My hypothesis posits that the U.S. changes its foreign policies during specific years. To account for this, I created a dummy variable, "ab_signal," representing these specific years and have interacted it with another identity variable, ally.
Every unit receives a "1" for "ab_signal" during the years 1955-1961 and 1971-1978, while the entire data range is from 1950-1989. Given that "ab_signal" is intrinsically linked to year effects, I am wondering if I am still allowed/required to control for year effects in my model. My major concern is about the collinearity problem as you will see in the below figure.
If I should not control the year effect, is this model still robust or reliable? I would appreciate any insights or advice on how to approach this question. Thank you again.
Below is my current code:
Code:
ppmlhdfe milaid_21 i.ab_signal_3##i.ally X1it X2it X3it X4it X5it absorb(unit_id) vce(cluster unit id)
Code:
ppmlhdfe milaid_21 i.ab_signal_3##i.ally X1it X2it X3it X4it X5it absorb(unit_id year) vce(cluster unit id)
Comment