Currently I am trying to choose the right set of independent variables and right model.
My model is
ologit happiness i.external i.income i.d i.male i.GNIcap age age2 i.nchildren i.empl refincome i.marital i.health i.year, nolog
Variables used are:
happiness is categorical 1 not happy at all- 4 very happy
external - external religiosity, 1 yes 0 no - dummy
income - income scale 1 lower step 11 higher step - categorical
d - religious denominations - categorical, 11 main denominations.
male - dummy
GNIcap - categorical - low income countries, medium income countries and high income countries.
age age2
nchildren - number of children 1,2,3,4,5,6,7,8 and more.
empl - employment status - 1 full time 10 not employed - categorical
refincome - reference income
marital = marital status - 0 not married, 1 married
health - health status - very poor -0 , very good 4
year
I did sensitivity check, started with the following baseline model and added 1 variable to baseline model each time, finally end up with full model.
ologit happiness, nolog
ologit happiness i.external, nolog
ologit happiness i.external i.income, nolog
..........
......
...........
ologit happiness i.external i.income i.d i.male i.GNIcap age age2 i.nchildren i.empl refincome i.marital i.health i.year, nolog
By adding more variables to the baseline model, BIC and AIC are decreasing and Pseudo R2 is increasing.
It seems suspicious to me and I have feeling that even I would add 10 more variables to the model, BIC and AIC would decrease and Pseudo R2 would increase, showing the best model is with all variables.
The question is, am I right? or there is something suspicious in these results (AIC BIC and Pseudo R2).
My model is
ologit happiness i.external i.income i.d i.male i.GNIcap age age2 i.nchildren i.empl refincome i.marital i.health i.year, nolog
Variables used are:
happiness is categorical 1 not happy at all- 4 very happy
external - external religiosity, 1 yes 0 no - dummy
income - income scale 1 lower step 11 higher step - categorical
d - religious denominations - categorical, 11 main denominations.
male - dummy
GNIcap - categorical - low income countries, medium income countries and high income countries.
age age2
nchildren - number of children 1,2,3,4,5,6,7,8 and more.
empl - employment status - 1 full time 10 not employed - categorical
refincome - reference income
marital = marital status - 0 not married, 1 married
health - health status - very poor -0 , very good 4
year
I did sensitivity check, started with the following baseline model and added 1 variable to baseline model each time, finally end up with full model.
ologit happiness, nolog
ologit happiness i.external, nolog
ologit happiness i.external i.income, nolog
..........
......
...........
ologit happiness i.external i.income i.d i.male i.GNIcap age age2 i.nchildren i.empl refincome i.marital i.health i.year, nolog
By adding more variables to the baseline model, BIC and AIC are decreasing and Pseudo R2 is increasing.
It seems suspicious to me and I have feeling that even I would add 10 more variables to the model, BIC and AIC would decrease and Pseudo R2 would increase, showing the best model is with all variables.
The question is, am I right? or there is something suspicious in these results (AIC BIC and Pseudo R2).
Comment