Hello everyone, I have a problem regarding a study I'm trying to do.
I have a panel data on 158 different neighborhoods, from january 2006 to december 2016 (132 months in total), I aim to show the effect of the creation of primary medical facilities in some (not all, around 50%) of the neighborhoods on the death rate by diabetes mellitus of these neighborhoods, the medical facilities started to be constructed/implemented in 2009, but it's an ongoing process, meaning that as I type this more are being created, I thought of doing a diff in diff approach to separate the treated group from the non treated group, but my main problem is that the implementation was gradual, meaning that some facilities were created in february 2009, some in december, some in june 2010 and so on... so I don't know if it would be possible to make a "dynamic diff in diff" like this, is it?
Also, I tried doing a regular panel regression with fixed effects for month and neighborhood, it looks like this:
("taxa_obito1" is death rate by diabetes, "taxa_cf1" is the rate of primary medical facilities in the neighborhood at a specific month and year), the results are what I expected (and wanted) them to be, more primary medical facilities have a negative effect on death rate by diabetes (not saying anything about causality yet..), however, when I run
the effect is positive! Meaning that it's saying that more medical facilities is positively correlated with deaths by diabetes, which seems weird to me. I should note that a lot of the values of the taxa_cf1 variable are zeroes, since not all of the neighborhoods have any medical facilities and those who do didn't have them until a certain time period, could that be affecting the model? When using this panel approach should I just delete the neighborhoods that didn't receive any facilities yet and focus on the ones that did?
I should say that the diff in diff approach seems more thorough to me, although I don't know how to get around the "implementation in different periods for different neighborhoods" problem
Sorry if the post was sort of confusing... any help would be appreciated
I have a panel data on 158 different neighborhoods, from january 2006 to december 2016 (132 months in total), I aim to show the effect of the creation of primary medical facilities in some (not all, around 50%) of the neighborhoods on the death rate by diabetes mellitus of these neighborhoods, the medical facilities started to be constructed/implemented in 2009, but it's an ongoing process, meaning that as I type this more are being created, I thought of doing a diff in diff approach to separate the treated group from the non treated group, but my main problem is that the implementation was gradual, meaning that some facilities were created in february 2009, some in december, some in june 2010 and so on... so I don't know if it would be possible to make a "dynamic diff in diff" like this, is it?
Also, I tried doing a regular panel regression with fixed effects for month and neighborhood, it looks like this:
Code:
xtreg taxa_obito1 taxa_cf1 i.meseano2, fe
Code:
xtgls taxa_obito1 taxa_cf1
I should say that the diff in diff approach seems more thorough to me, although I don't know how to get around the "implementation in different periods for different neighborhoods" problem
Sorry if the post was sort of confusing... any help would be appreciated
Comment