I am trying to analyse the impact of competition in US states on banking stability. I measure competition with two variables, the Herfindahl-Hirschman index in each state and the Interstate Branching Index (IBR) developed by Rice and Strahan (2010). This index takes the values between 0 and 4 with 4 indicating that the state is most heavily regulated. The interstate branching index originated when the US government introduced a new banking act that allowed banks to introduce barriers to competition. These barriers were introduced by different states at different points in time creating exogeneous shocks to competition. Moreover, some states revised their barriers during the study period introducing extra variation to competition (i.e. an X state deregulated in 2003 and 2005 causing a decrease in the index value from 4 to 3 and then from 3 to 1, respectively).
Initially, I adopted a fixed effects panel data approach. My model was based on bank level data and had the following form:
xtreg Stability_Measure ln_HHI ln_HHISQ IBR control_variables, fe vce(cluster StateCode) ~ errors were clustered at the state level
However, someone suggested using difference in difference analysis as it allows to make more causal inferences and control for omitted variables. Thus, I created the following model:
xtreg Stability_Measure ln_HHI ln_HHISQ IBR_0 IBR_1 IBR_2 IBR_3 ib4.IBR#c.ln_HHI control_variables,fe vce(cluster StateCode) ~ errors were clustered at the state level
IBR_0 IBR_1 IBR_2 IBR_3 are dummies that take the value 1 when a state deregulated and for the period thereafter. If I understand correctly, the final interaction will be the treatment effect (ib4.IBR#c.ln_HHI). I am not sure whether a difference in difference analysis is perfectly applicable because by 2010, all states have removed some of the barriers. Thus, there is no state where the index remained at 4 which might mean I do not have a control group (please correct me if I am wrong).
My questions are:
Are these models correct?
Which approach appears to be more correct from an econometric point of view?
Is there maybe a third approach that will be better?
Thank you very much!
Initially, I adopted a fixed effects panel data approach. My model was based on bank level data and had the following form:
xtreg Stability_Measure ln_HHI ln_HHISQ IBR control_variables, fe vce(cluster StateCode) ~ errors were clustered at the state level
However, someone suggested using difference in difference analysis as it allows to make more causal inferences and control for omitted variables. Thus, I created the following model:
xtreg Stability_Measure ln_HHI ln_HHISQ IBR_0 IBR_1 IBR_2 IBR_3 ib4.IBR#c.ln_HHI control_variables,fe vce(cluster StateCode) ~ errors were clustered at the state level
IBR_0 IBR_1 IBR_2 IBR_3 are dummies that take the value 1 when a state deregulated and for the period thereafter. If I understand correctly, the final interaction will be the treatment effect (ib4.IBR#c.ln_HHI). I am not sure whether a difference in difference analysis is perfectly applicable because by 2010, all states have removed some of the barriers. Thus, there is no state where the index remained at 4 which might mean I do not have a control group (please correct me if I am wrong).
My questions are:
Are these models correct?
Which approach appears to be more correct from an econometric point of view?
Is there maybe a third approach that will be better?
Thank you very much!
Comment