Hello,
I have an unbalanced panel dataset based on surveys at the subregional level taken from 8 countries at different points in time. Therefore the individual observations are nested within regions, and regions within the 8 countries listed below (with the survey years mentioned inside the brackets):
India (1999, 2006, 2012, 2016),
Bangladesh (2004, 2011, 2014),
Pakistan (2007, 2012, 2018),
Nepal (1996, 2006, 2011, 2016),
Cambodia (2000, 2005, 2010, 2014),
Vietnam (1997, 2002, 2006, 2010, 2014),
Indonesia (1997, 2003, 2007, 2012),
Philippines (1998, 2003, 2008, 2013, 2017).
The dependent variable is a wealth index (g_iwi - growth in wealth index).
The main independent variable is dependency ratio (ln_dep - initial values of dep and g_dep - growth in dep) & ln_iwi (initial value of the wealth index)
Other independent variables are education (ln_educ), urbanization (ln_urban), population (ln_pop), share of population (share_pop), etc (at the regional level) and ethnic fractionalization (ln_ethnic) (at the country level). & dummy variable for each country & ln_ethnic_m (dummy to account for some missing values)
Therefore I ran several regressions in Stata but every time I get insignificant results or contradictory coefficients to the underlying theory (derived from the growth literature). My main concern is how can I deal with the different years for each country's , since I want to include all observations in a single model and what estimator is most suitable?
I use Stata 16.0. Below you have a description of the data and a simple OLS regression.
I am a novice in Stata and statistics so I would very much appreciate if anyone can guide me.
Best wishes,
I have an unbalanced panel dataset based on surveys at the subregional level taken from 8 countries at different points in time. Therefore the individual observations are nested within regions, and regions within the 8 countries listed below (with the survey years mentioned inside the brackets):
India (1999, 2006, 2012, 2016),
Bangladesh (2004, 2011, 2014),
Pakistan (2007, 2012, 2018),
Nepal (1996, 2006, 2011, 2016),
Cambodia (2000, 2005, 2010, 2014),
Vietnam (1997, 2002, 2006, 2010, 2014),
Indonesia (1997, 2003, 2007, 2012),
Philippines (1998, 2003, 2008, 2013, 2017).
The dependent variable is a wealth index (g_iwi - growth in wealth index).
The main independent variable is dependency ratio (ln_dep - initial values of dep and g_dep - growth in dep) & ln_iwi (initial value of the wealth index)
Other independent variables are education (ln_educ), urbanization (ln_urban), population (ln_pop), share of population (share_pop), etc (at the regional level) and ethnic fractionalization (ln_ethnic) (at the country level). & dummy variable for each country & ln_ethnic_m (dummy to account for some missing values)
Therefore I ran several regressions in Stata but every time I get insignificant results or contradictory coefficients to the underlying theory (derived from the growth literature). My main concern is how can I deal with the different years for each country's , since I want to include all observations in a single model and what estimator is most suitable?
I use Stata 16.0. Below you have a description of the data and a simple OLS regression.
Code:
. xtset region_c year panel variable: region_c (unbalanced) time variable: year, 1996 to 2018, but with gaps delta: 1 unit . xtdescribe region_c: 1, 2, ..., 143 n = 142 year: 1996, 1997, ..., 2018 T = 20 Delta(year) = 1 unit Span(year) = 23 periods (region_c*year uniquely identifies each observation) Distribution of T_i: min 5% 25% 50% 75% 95% max 1 2 3 4 4 5 5 Freq. Percent Cum. | Pattern ---------------------------+------------------------- 26 18.31 18.31 | ...1......1.....1...1.. 23 16.20 34.51 | ........1......1..1.... 23 16.20 50.70 | .1.....1...1....1...... 17 11.97 62.68 | ....1....1....1...1.... 16 11.27 73.94 | ..1....1....1....1...1. 6 4.23 78.17 | ................1...1.. 6 4.23 82.39 | .1....1...1...1...1.... 5 3.52 85.92 | ...........1....1.....1 5 3.52 89.44 | 1.........1....1....1.. 15 10.56 100.00 | (other patterns) ---------------------------+------------------------- 142 100.00 | XXXXX.XXXXXXX.XXXXX.XXX . reg g_iwi ln_iwi g_dep ln_dep ln_educ ln_urban ln_pop share_pop c.ln_ethnic##c.ln_dep ln_ethnic_m Bangladesh India Pakistan Nepal Indonesia Vietnam Philippines Cambodia note: ln_dep omitted because of collinearity note: Cambodia omitted because of collinearity Source | SS df MS Number of obs = 379 -------------+---------------------------------- F(17, 361) = 23.69 Model | 5.41297573 17 .318410337 Prob > F = 0.0000 Residual | 4.85207147 361 .013440641 R-squared = 0.5273 -------------+---------------------------------- Adj R-squared = 0.5051 Total | 10.2650472 378 .02715621 Root MSE = .11593 -------------------------------------------------------------------------------------- g_iwi | Coef. Std. Err. t P>|t| [95% Conf. Interval] ---------------------+---------------------------------------------------------------- ln_iwi | .1155921 .0485601 2.38 0.018 .020096 .2110883 g_dep | -.4594956 .0515608 -8.91 0.000 -.5608928 -.3580985 ln_dep | .2367487 .0615646 3.85 0.000 .1156784 .357819 ln_educ | -.0506995 .0558058 -0.91 0.364 -.1604448 .0590458 ln_urban | -.0303772 .0120283 -2.53 0.012 -.0540315 -.0067229 ln_pop | .0015716 .0081499 0.19 0.847 -.0144556 .0175988 share_pop | .000078 .0015892 0.05 0.961 -.0030472 .0032032 ln_ethnic | .0931254 .1855702 0.50 0.616 -.2718091 .4580599 ln_dep | 0 (omitted) | c.ln_ethnic#c.ln_dep | -.0379962 .0437562 -0.87 0.386 -.1240452 .0480528 | ln_ethnic_m | .0170842 .0282287 0.61 0.545 -.0384292 .0725976 Bangladesh | -.0248929 .0342641 -0.73 0.468 -.0922752 .0424893 India | .1015927 .0420808 2.41 0.016 .0188385 .1843469 Pakistan | -.1086989 .0485216 -2.24 0.026 -.2041195 -.0132783 Nepal | .2189254 .0416671 5.25 0.000 .1369846 .3008661 Indonesia | -.0080362 .0459966 -0.17 0.861 -.0984911 .0824188 Vietnam | .053028 .0418545 1.27 0.206 -.0292814 .1353373 Philippines | -.0750558 .0487437 -1.54 0.124 -.1709131 .0208014 Cambodia | 0 (omitted) _cons | -1.140357 .3646102 -3.13 0.002 -1.857383 -.4233299 --------------------------------------------------------------------------------------
Best wishes,
Comment