Dear Statalist community
These are my FE results of a regression of Government spending on International migration. 6 specifications show the regression with extra control variables: Unemployment, Urban population, Adjusted net income, Population aged >65, and Population aged <14 respectively.
I would like to know whether I interpret these correctly or not.
2. Since Government spending is measured in % of GDP and International migration is measured in number of people, do I interpret the unit of estimator as a % point? eg. when the amount of immigrants increases by 1 unit (100,000 people), the government spending increases by 0.154% point
One thing with specification(6) is that it is 'Population aged <14' is not significant but it makes variables Adjusted net income and Population aged >65 significant, I therefore include this to avoid problem of omitted variable bias. I'm not too sure whether I understand this correctly or not...
Lastly I am not too sure whether I interpret R-squared equation correctly or not.
4. Even though R-squared increases continuously through each specification, this does not mean that specification (6) with the highest value of R-squared has the best goodness of fit. This value can be artificially high. For instance, the regression might include too many variables compared to the number of observations, leading to misleading interpretation of R-squared. It is, therefore, important to interpret R-squared with careful consideration. I therefore cooperate whether the standard error of the immigration coefficient is small or not into the interpretation. Since specification (5) has a reasonably high R-squared comparing with others’, the most significant immigration coefficient, and lowest standard error than the others — I focus on this specification for this investigation with the interaction term.
Thank you
Guest
These are my FE results of a regression of Government spending on International migration. 6 specifications show the regression with extra control variables: Unemployment, Urban population, Adjusted net income, Population aged >65, and Population aged <14 respectively.
(1) | (2) | (3) | (4) | (5) | (6) | ||
(i) | International Immigration ct Observations R2 |
0.074 (0.049) 659 0.141 |
0.160** (0.683) 657 0.188 |
0.154* (0.847) 657 0.277 |
0.158** (0.713) 613 0.314 |
0.166*** (0.603) 613 0.327 |
0.165** (0.671) 613 0.337 |
1. Since the significant result does not hold when the variable of unemployment is excluded in specification (1). This illustrates that unemployment and the international immigration variable are strongly correlated. This may reflect the fact that European unemployment are highly correlated with the number of foreigners entering the countries. Therefore, without an inclusion of unemployments, the estimator will be biased. Since the sign of unemployment coefficient is positive and there is a negative correlation between unemployment and immigration, these show a negative bias in coefficient of unemployment.
One thing with specification(6) is that it is 'Population aged <14' is not significant but it makes variables Adjusted net income and Population aged >65 significant, I therefore include this to avoid problem of omitted variable bias. I'm not too sure whether I understand this correctly or not...
3. It is worth mentioning that there are only 2 control variables that are significant in all specifications: unemployment and urban variables. I include other variables the (variable of adjusted net income, population aged 65 and above, and the population aged 14 and below) because more controls can reduce the chances of the omitted variable bias. This bias can cause significant coefficient to appear to be insignificant. The outcomes show that the coefficient of adjusted net income and population aged more than or equal to 65 becomes significant at 5% level after running the full regression. So, these variables are correlated with the immigration and should be included in the regression. However, as shown from the result, none of the variables are as important as unemployment variable. This is because it correlates with the variable of immigration, changing the coefficients of immigration by 0.086% point. The coefficients of immigration from specification (3) to (6) only slightly change.
4. Even though R-squared increases continuously through each specification, this does not mean that specification (6) with the highest value of R-squared has the best goodness of fit. This value can be artificially high. For instance, the regression might include too many variables compared to the number of observations, leading to misleading interpretation of R-squared. It is, therefore, important to interpret R-squared with careful consideration. I therefore cooperate whether the standard error of the immigration coefficient is small or not into the interpretation. Since specification (5) has a reasonably high R-squared comparing with others’, the most significant immigration coefficient, and lowest standard error than the others — I focus on this specification for this investigation with the interaction term.
Guest
Comment