Dear Members,
I'm doing a pooled OLS estimation using state-level data from two different years to see how smoking prevalence changed from the earlier year to the later year. There was a tax policy event in between. Because the tax policy was applied to all states, I don't have a control group of states to do a standard DiD. I like to see how the tax reform changed the smoking prevalence in states with below average tax during pre-reform time compared to states with above average tax. So I created a dummy (LTstate) taking value 1 for all states with below-average tax rates before the tax reform and 0 for the high-tax states. I also have a dummy for the tax reform which takes value 1 for the later year and 0 for the former year. However, the following specifications give me slightly different coefficient estimates.
In both cases, the coefficient on tax is identical. However, the estimate for LTstate differs in both models. For the interaction term model 1 returns estimates for tax#LTstate 1 1 while model 2 returns tax#LTstate 0 1. In both models, however, the coefficient has identical size but opposite signs.
My understanding is that the coefficient on tax should give the overall effect for all states (or is it capturing only the effect for high tax states because of the presence of interaction term). Coefficient on LTstate should provide the difference in smoking prevalence between low-tax and high-tax states before the tax policy I thought. But I am not sure why both models return alternative coefficient estimates for that dummy. Can someone guide me to pick the correct specification here that will help me estimate the effects separately for low-tax and high-tax state as well as a combined overall effect?
Thanking you in advance.
Rijo.
I'm doing a pooled OLS estimation using state-level data from two different years to see how smoking prevalence changed from the earlier year to the later year. There was a tax policy event in between. Because the tax policy was applied to all states, I don't have a control group of states to do a standard DiD. I like to see how the tax reform changed the smoking prevalence in states with below average tax during pre-reform time compared to states with above average tax. So I created a dummy (LTstate) taking value 1 for all states with below-average tax rates before the tax reform and 0 for the high-tax states. I also have a dummy for the tax reform which takes value 1 for the later year and 0 for the former year. However, the following specifications give me slightly different coefficient estimates.
Code:
- regress prevalence tax##LTstate [controls], vce(cluster state)
- regress prevalence tax LTstate tax#LTstate [controls], vce(cluster state)
My understanding is that the coefficient on tax should give the overall effect for all states (or is it capturing only the effect for high tax states because of the presence of interaction term). Coefficient on LTstate should provide the difference in smoking prevalence between low-tax and high-tax states before the tax policy I thought. But I am not sure why both models return alternative coefficient estimates for that dummy. Can someone guide me to pick the correct specification here that will help me estimate the effects separately for low-tax and high-tax state as well as a combined overall effect?
Thanking you in advance.
Rijo.
Comment