Dear Statalist,
I am running a model where my dependent variable (y1) has 4 categories. My problem is that one of my regressors (y2, a categorical variable with 4 categories) is also influenced by the other explanatory variables (household characteristics, $x1). I have already identified an instrumental variable (included in $x2), my base outcomes for the both variables of interest (y1 and y2) are the largest categories and I have already checked for IIA running two separate multinomial logistic models.
I have tried different approaches to account for endogeneity, without success:
I tried [cmp][/CODE]: [cmp (y1 =$x1) (y2= $x2), ind($cmp_mprobit $cmp_mprobit)][/CODE], but I had problems to make it converge:
cannot compute an improvement -- discontinuous region encountered
[convergence not achieved
convergence not achieved
r(430);]
I also tried [gsem][/CODE] using latent variables to account for the correlation with the error terms: [gsem (i.y1<- $x1 L, mlogit) (i.y2<- $x2 L, mlogit), vce(cluster community) var(L@1)][/CODE]
And I got the following error message:
[initial values not feasible
r(1400);]
The [gsem] command runs properly if I exclude the latent variables: [gsem (i.y1y<- $x1 L, mlogit) (i.y2<- $x2 L, mlogit), vce(cluster community)]
But I would be ignoring the endogeneity problem.
My database has 320 observations and I am using Stata 14.
I would appreciate any advice you could provide to move forward.
I am running a model where my dependent variable (y1) has 4 categories. My problem is that one of my regressors (y2, a categorical variable with 4 categories) is also influenced by the other explanatory variables (household characteristics, $x1). I have already identified an instrumental variable (included in $x2), my base outcomes for the both variables of interest (y1 and y2) are the largest categories and I have already checked for IIA running two separate multinomial logistic models.
I have tried different approaches to account for endogeneity, without success:
I tried [cmp][/CODE]: [cmp (y1 =$x1) (y2= $x2), ind($cmp_mprobit $cmp_mprobit)][/CODE], but I had problems to make it converge:
cannot compute an improvement -- discontinuous region encountered
[convergence not achieved
convergence not achieved
r(430);]
I also tried [gsem][/CODE] using latent variables to account for the correlation with the error terms: [gsem (i.y1<- $x1 L, mlogit) (i.y2<- $x2 L, mlogit), vce(cluster community) var(L@1)][/CODE]
And I got the following error message:
[initial values not feasible
r(1400);]
The [gsem] command runs properly if I exclude the latent variables: [gsem (i.y1y<- $x1 L, mlogit) (i.y2<- $x2 L, mlogit), vce(cluster community)]
But I would be ignoring the endogeneity problem.
My database has 320 observations and I am using Stata 14.
I would appreciate any advice you could provide to move forward.
Comment