Hi everyone,
I am estimating linear models where the unit of analysis is the county, and counties are nested in states. There is no time component, and the number of counties in each state varies widely. The main independent variable is a dichotomous variable that varies within some of the states but not others. In other words, some states have “ones” or “zeros” in all of their counties, whereas in others the independent variable does vary across counties.
I might be getting this wrong, but my understanding is that if I run a fixed-effects model to account for state-level unobservables, all counties belonging to states where the independent variable of interest does not vary should be dropped from the analysis, as the state fixed-effect and the independent variable are perfectly collinear in those cases.
However, when I run the models in Stata this is not the case. I start with the pooled estimator ignoring the nesting of counties in states:
reg y x, robust
Now, the number of observations remains the same if I do either:
xtset state
xtreg y x, fe vce(cluster state)
or
reg y x i.state, vce(cluster state)
In the latter case, the output gives me coefficients for all states (except of course for the reference category), in other words none are omitted. Shouldn’t some of them be unidentifiable?
I am confused why nothing is being dropped out in the fixed effect model even if the independent variable is invariant for some of the clusters. Is it appropriate to run a fixed effect model with this kind of data, or should I just treat it as a cross-section perhaps with std errors clustered by state?
Thank you.
I am estimating linear models where the unit of analysis is the county, and counties are nested in states. There is no time component, and the number of counties in each state varies widely. The main independent variable is a dichotomous variable that varies within some of the states but not others. In other words, some states have “ones” or “zeros” in all of their counties, whereas in others the independent variable does vary across counties.
I might be getting this wrong, but my understanding is that if I run a fixed-effects model to account for state-level unobservables, all counties belonging to states where the independent variable of interest does not vary should be dropped from the analysis, as the state fixed-effect and the independent variable are perfectly collinear in those cases.
However, when I run the models in Stata this is not the case. I start with the pooled estimator ignoring the nesting of counties in states:
reg y x, robust
Now, the number of observations remains the same if I do either:
xtset state
xtreg y x, fe vce(cluster state)
or
reg y x i.state, vce(cluster state)
In the latter case, the output gives me coefficients for all states (except of course for the reference category), in other words none are omitted. Shouldn’t some of them be unidentifiable?
I am confused why nothing is being dropped out in the fixed effect model even if the independent variable is invariant for some of the clusters. Is it appropriate to run a fixed effect model with this kind of data, or should I just treat it as a cross-section perhaps with std errors clustered by state?
Thank you.
Comment