Hello!
I have country-level panel data (N=40, T=5) on cigarette smoking prevalence, per capita cigarette consumption, and a set of dependent variables that include both binary indicators, and continuous variables. The coefficients on the dummy variables are my key variables of interest - the type of cigarette tax structure that a country implements has been graded by other researchers on a scale of 1 to 5, where a score of 1 means that country has a weak tax structure, and a score of five represents that they have the most desirable tax structure. I want to use these scores, which are ultimately proxies for different types of tax structures, to answer two questions:
1. Do countries with simpler excise tax structures (i.e, a higher score) have lower smoking prevalence and lower cigarette consumption than countries adopting more complex excise tax structures (a lower score)?
2. What is the impact of a change in excise tax structure (reflected by a change in the tax structure score) on cigarette smoking prevalence and cigarette consumption?
To answer question 1, I have run the following regression for cigarette smoking prevalence, using a fractional logit regression:
Smoking_prevalencejt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + uit (eq1)
where TSS2 is a dummy variable =1 if a country's tax structure scored a 2, TSS3 is a dummy variable =1 if a country's tax structure scored a 3 etc. A score of 1 (TSS1) is the omitted category. Xjt is a vector of country-level demographic/macroeconomic - all variables in Xjt are continuous. δt represents year dummies.
For per capita cigarette consumption, I used OLS to estimate an identical equation, with per capita cigarette consumption as the dependent variable, as shown in eq2.
Per_capita_cig_consumptionjt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + uit (eq2)
In both estimations (eq1 and eq2), I clustered standard errors at the country-level.
To answer question 2 - what is the impact of a change in excise tax structure on cigarette smoking prevalence and cigarette consumption - I changed the specifications shown in equations 1 and 2 by adding country-fixed effects, so that I estimate the models shown in equations 3 and 4.
Smoking_prevalencejt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + αj + uit (eq3)
Per_capita_cig_consumptionjt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + αj + uit (eq4)
My understanding is that the use of two-way fixed effects will ensure that only within-country variation is used for model identification, allowing an analysis of the impact of a change in excise tax structure on smoking prevalence and per capita cigarette consumption in the sampled countries. Again, in both eq3 and eq4, standard errors were clustered by country. Following guidance from Carlo Lazzoro on this forum, equations 3 and 4 were implemented using -xtreg-.
However, I want to know how I can test whether a lag of the dependent variables should be included in the relevant models. Conceptually, it makes sense that, at the country-level, past smoking prevalence determines current smoking prevalence, and past consumption determines current consumption, given that cigarette smoking is addictive. Question: is there a formal test I can conduct to know if adding a lag of the dependent variable is the right thing to do?
Additionally, I am aware that if I do end up making this a dynamic panel data model, OLS will be biased. I have learned based on reading this forum, supplemented with these lecture notes, that because my N>T, I should be using systems GMM with -xtabond2-. However, while there is variation in my dummy variables within many (but not within all) countries over time, I am concerned that one cannot employ systems GMM with binary indicators (I haven't seen examples in the literature). I did find this forum in which it is indicated that dummy variables aren't a special case when it comes to differencing; but I am not confident that this applies in the application of systems GMM and the lack of empirical studies in my field that have used dummy variables makes me nervous of my own understanding. Question: is it possible to use estimate a model that includes dummy variables using -xtabond2-?
Thank you in advance for taking the time to read this.
Kind regards,
Sam
I have country-level panel data (N=40, T=5) on cigarette smoking prevalence, per capita cigarette consumption, and a set of dependent variables that include both binary indicators, and continuous variables. The coefficients on the dummy variables are my key variables of interest - the type of cigarette tax structure that a country implements has been graded by other researchers on a scale of 1 to 5, where a score of 1 means that country has a weak tax structure, and a score of five represents that they have the most desirable tax structure. I want to use these scores, which are ultimately proxies for different types of tax structures, to answer two questions:
1. Do countries with simpler excise tax structures (i.e, a higher score) have lower smoking prevalence and lower cigarette consumption than countries adopting more complex excise tax structures (a lower score)?
2. What is the impact of a change in excise tax structure (reflected by a change in the tax structure score) on cigarette smoking prevalence and cigarette consumption?
To answer question 1, I have run the following regression for cigarette smoking prevalence, using a fractional logit regression:
Smoking_prevalencejt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + uit (eq1)
where TSS2 is a dummy variable =1 if a country's tax structure scored a 2, TSS3 is a dummy variable =1 if a country's tax structure scored a 3 etc. A score of 1 (TSS1) is the omitted category. Xjt is a vector of country-level demographic/macroeconomic - all variables in Xjt are continuous. δt represents year dummies.
For per capita cigarette consumption, I used OLS to estimate an identical equation, with per capita cigarette consumption as the dependent variable, as shown in eq2.
Per_capita_cig_consumptionjt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + uit (eq2)
In both estimations (eq1 and eq2), I clustered standard errors at the country-level.
To answer question 2 - what is the impact of a change in excise tax structure on cigarette smoking prevalence and cigarette consumption - I changed the specifications shown in equations 1 and 2 by adding country-fixed effects, so that I estimate the models shown in equations 3 and 4.
Smoking_prevalencejt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + αj + uit (eq3)
Per_capita_cig_consumptionjt= β0 + β1TSS2jt + β2TSS3jt + β3TSS4jt + β4TSS5jt + β5Xjt + δt + αj + uit (eq4)
My understanding is that the use of two-way fixed effects will ensure that only within-country variation is used for model identification, allowing an analysis of the impact of a change in excise tax structure on smoking prevalence and per capita cigarette consumption in the sampled countries. Again, in both eq3 and eq4, standard errors were clustered by country. Following guidance from Carlo Lazzoro on this forum, equations 3 and 4 were implemented using -xtreg-.
However, I want to know how I can test whether a lag of the dependent variables should be included in the relevant models. Conceptually, it makes sense that, at the country-level, past smoking prevalence determines current smoking prevalence, and past consumption determines current consumption, given that cigarette smoking is addictive. Question: is there a formal test I can conduct to know if adding a lag of the dependent variable is the right thing to do?
Additionally, I am aware that if I do end up making this a dynamic panel data model, OLS will be biased. I have learned based on reading this forum, supplemented with these lecture notes, that because my N>T, I should be using systems GMM with -xtabond2-. However, while there is variation in my dummy variables within many (but not within all) countries over time, I am concerned that one cannot employ systems GMM with binary indicators (I haven't seen examples in the literature). I did find this forum in which it is indicated that dummy variables aren't a special case when it comes to differencing; but I am not confident that this applies in the application of systems GMM and the lack of empirical studies in my field that have used dummy variables makes me nervous of my own understanding. Question: is it possible to use estimate a model that includes dummy variables using -xtabond2-?
Thank you in advance for taking the time to read this.
Kind regards,
Sam
Comment