Hi, I am a university student and I am struggling to understand if my regression is correct.
I have a dataset spanning 150 countries across 6 variables including gini, readiness, vulnerability, gdppc, pop, pop density from 1995-2021. I want to run a regression to test the effect of readiness on gini, with the other variables as controls, with some lagged effects. Readiness and vulnerability come from the ND-GAIN index for climate shocks, and my research is on the effect of climate readiness on income inequality.
So far, I have tried to xtset ID, year, and then xtreg gini_disp readiness L10.readiness L20.readiness vuln gdppc pop popden, fe vce(cluster ID) where ID is the variable I created for each country to xtset.
However, I do not fully understand the meanings of fixed effects and clustering and so cannot tell if I am doing this correctly or not. My understanding is that to control for error terms being correlated with the independent variables (which is heterogeneous by country) due to omitted variable bias we must include fe and cluster for the country to get statistically correct results.
On a similar note, I have seen papers use ηi+μt to control for fixed effects by country and time (which is what I want to do!) - have I captured this in my regression above or do i need to alter it.7
Thank you in advance
I have a dataset spanning 150 countries across 6 variables including gini, readiness, vulnerability, gdppc, pop, pop density from 1995-2021. I want to run a regression to test the effect of readiness on gini, with the other variables as controls, with some lagged effects. Readiness and vulnerability come from the ND-GAIN index for climate shocks, and my research is on the effect of climate readiness on income inequality.
So far, I have tried to xtset ID, year, and then xtreg gini_disp readiness L10.readiness L20.readiness vuln gdppc pop popden, fe vce(cluster ID) where ID is the variable I created for each country to xtset.
However, I do not fully understand the meanings of fixed effects and clustering and so cannot tell if I am doing this correctly or not. My understanding is that to control for error terms being correlated with the independent variables (which is heterogeneous by country) due to omitted variable bias we must include fe and cluster for the country to get statistically correct results.
On a similar note, I have seen papers use ηi+μt to control for fixed effects by country and time (which is what I want to do!) - have I captured this in my regression above or do i need to alter it.7
Thank you in advance
Comment