Dear Statalist community,
I want to execute a Diff-in-diff estimation.
My data is on the state-level and comprises the years 2005-2013.
State-year-level observations;
state:= 1,..,16;
year:= 2005,..,2013.
My treatment:
2 of 16 states implemented a policy:
state_1 in 2009, state_2 in 2010.
I expect this policy change (which is in place starting from the time of implementation) to affect the outcome variable not in the year of implementation, but with a two-year lag, three-year lag, .., .
1. Question)
Supposed the treatment-effect is treated as equal for both states, I model:
Is this specification correct?
If I excluded the terms "l3.D_ij l4.D_ij":
Would Stata assume in 2012/2013 that the outcome of state_1 would return to its state- and year-fixed effect level and expect no treatment effect?
2. Question)
How could I include state-specific time trends instead of assuming that they are independent of the state?
Is this specification correct?
3. Question)
Some of my covariates are distorted after 2010, some after 2011.
Is it possible to include them even if they only cover the pre-treatment period (like in the Synthetic Control Method approach)?
4. Question)
Public Health Insurance (share): Variable of treatment intensity OR important covariate?
The treatment only affects individuals that are publicly insured.
I do not expect the share of public HI to directly affect the outcome.
But the share is expected to affect the treatment effect.
Some papers interact the share of individuals with public HI with the treatment indicator.
But, one author states that "controlling for .. health insurance coverage (as a predictor of a closely related outcome) is important .. given that uninsured individuals are not directly affected by the mandates".
Is it enough to interact share of publicly HI with my treatment indicator or should I even include it as a covariate?
I think: If the share is exceptionally high in the two treated states, the treatment effect could be inflated.
How would I include the interaction term in my specification?
Thank you very much for your support/help in advance!
Kind regards,
Mischa
I want to execute a Diff-in-diff estimation.
My data is on the state-level and comprises the years 2005-2013.
State-year-level observations;
state:= 1,..,16;
year:= 2005,..,2013.
My treatment:
2 of 16 states implemented a policy:
state_1 in 2009, state_2 in 2010.
I expect this policy change (which is in place starting from the time of implementation) to affect the outcome variable not in the year of implementation, but with a two-year lag, three-year lag, .., .
1. Question)
Supposed the treatment-effect is treated as equal for both states, I model:
Code:
gen D=0 replace D=1 if state==1 & year==2009 replace D=1 if state==2 & year==2010 reg outcome_ij i.state i.year l2.D_ij l3.D_ij l4.D_ij, vce(cluster state)
If I excluded the terms "l3.D_ij l4.D_ij":
Would Stata assume in 2012/2013 that the outcome of state_1 would return to its state- and year-fixed effect level and expect no treatment effect?
2. Question)
How could I include state-specific time trends instead of assuming that they are independent of the state?
Code:
reg outcome_ij i.state i.year state#year l2.D_ij l3.D_ij l4.D_ij, vce(cluster state)
3. Question)
Some of my covariates are distorted after 2010, some after 2011.
Is it possible to include them even if they only cover the pre-treatment period (like in the Synthetic Control Method approach)?
4. Question)
Public Health Insurance (share): Variable of treatment intensity OR important covariate?
The treatment only affects individuals that are publicly insured.
I do not expect the share of public HI to directly affect the outcome.
But the share is expected to affect the treatment effect.
Some papers interact the share of individuals with public HI with the treatment indicator.
But, one author states that "controlling for .. health insurance coverage (as a predictor of a closely related outcome) is important .. given that uninsured individuals are not directly affected by the mandates".
Is it enough to interact share of publicly HI with my treatment indicator or should I even include it as a covariate?
I think: If the share is exceptionally high in the two treated states, the treatment effect could be inflated.
How would I include the interaction term in my specification?
Thank you very much for your support/help in advance!
Kind regards,
Mischa

Comment