Hello,
Say I'm estimating the impact of a program using an OLS difference-in-difference model, like so:
Is there an issue with controlling for something that is highly correlated with the DiD term (1.treatment*1.after)? My internal monologue is:
(1) You need to control for the correlated variable because without it, you're attributing the effect entirely to the treatment when it could've also been driven by the correlated control variable. BUT
(2) I've learned that the standard errors for highly correlated variables are untrustworthy.
Is this a situation that might be difficult to disentangle? Or am I overthinking this?
Thanks!
Say I'm estimating the impact of a program using an OLS difference-in-difference model, like so:
Code:
regress outcome i.treatment##i.intervention $controls
(1) You need to control for the correlated variable because without it, you're attributing the effect entirely to the treatment when it could've also been driven by the correlated control variable. BUT
(2) I've learned that the standard errors for highly correlated variables are untrustworthy.
Is this a situation that might be difficult to disentangle? Or am I overthinking this?
Thanks!
Comment