Hello Statalist
I am in the process of writing my master thesis, and have gotten in a little over my head when it comes to the actual panel data regression, and would appreciate some insights.
I am researching Foreign Direct Investment from China to the world, with a balanced data set consisting of 131 countries across a time period of 10 years, meaning a total of 1310 observations.
I have a balanced data set with no missing values.
My dependent variable is FDI from China to the world, as a count variable, with mostly zeroes as observations.
My data is over-dispersed, as such i am applying a negative binomial distribution instead of a Poisson distribution.
My dependent variables are:
LogGDP LogGDPCap GDPG LogDist CoC RoL RQ GE PS VA Bilat ChinaExports ChinaImports HighTX Patent FuelX OreMetalX Openness Inflation
Where the first 3 relate to gross domestic product, then distance between countries, than 6 measures for institutional differences, a dummy variable (bilat) representing if a bilateral investment treaty exists, and some import and export variables, as well as inflation.
From my understanding, when i am not interested in between country differences, but only want to control for them, i can use a xtnbreg command with random effects. Random effects also fits the model better according to a Hausman test i performed when comparing fixed and random effects.
I am writing the following command in Stata for the output:
xtnbreg FDItotal LogGDP LogDist LogGDPCap CoC RoL RQ GE PS VA Bilat ChinaExports ChinaImports HighTX Patent FuelX OreMetalX Openness Inflation GDPG, re
Which gives me the output presented below in.
What i am asking is, is there anything i should be aware off, that i am not currently taking into account? From my understanding of the literature i have read, the negative binomial distribution is my best fit, and random effects should be applied. Any input regarding this being completely wrong or somewhat correct would be much appreciated.
I am not asking for interpretation of the output, "only" if it is the correct output.
Regards
Morten
I am in the process of writing my master thesis, and have gotten in a little over my head when it comes to the actual panel data regression, and would appreciate some insights.
I am researching Foreign Direct Investment from China to the world, with a balanced data set consisting of 131 countries across a time period of 10 years, meaning a total of 1310 observations.
I have a balanced data set with no missing values.
My dependent variable is FDI from China to the world, as a count variable, with mostly zeroes as observations.
My data is over-dispersed, as such i am applying a negative binomial distribution instead of a Poisson distribution.
My dependent variables are:
LogGDP LogGDPCap GDPG LogDist CoC RoL RQ GE PS VA Bilat ChinaExports ChinaImports HighTX Patent FuelX OreMetalX Openness Inflation
Where the first 3 relate to gross domestic product, then distance between countries, than 6 measures for institutional differences, a dummy variable (bilat) representing if a bilateral investment treaty exists, and some import and export variables, as well as inflation.
From my understanding, when i am not interested in between country differences, but only want to control for them, i can use a xtnbreg command with random effects. Random effects also fits the model better according to a Hausman test i performed when comparing fixed and random effects.
I am writing the following command in Stata for the output:
xtnbreg FDItotal LogGDP LogDist LogGDPCap CoC RoL RQ GE PS VA Bilat ChinaExports ChinaImports HighTX Patent FuelX OreMetalX Openness Inflation GDPG, re
Which gives me the output presented below in.
What i am asking is, is there anything i should be aware off, that i am not currently taking into account? From my understanding of the literature i have read, the negative binomial distribution is my best fit, and random effects should be applied. Any input regarding this being completely wrong or somewhat correct would be much appreciated.
I am not asking for interpretation of the output, "only" if it is the correct output.
Regards
Morten
Comment