Hi Stata forum,
I am utilizing a zero-inflated ordered probit model for an analysis on substance use in Stata 17, and noticed a cut-point value in the output that looks problematic to me (compared to the other cut points). I also noticed that the average marginal effects for changes in either of my two predictors are completely unrelated (z = 0) to probability for zero values conditional on participation (i.e., zero level of substance use but in the ordered component), but significantly related to probability for non-participation. I don't know if those two things are related, but it seemed possible and I have been struggling to understand if this is concerning for interpretation of my effects. My model has two predictor variables that I standardized (x1 and x2 in the following dataex output) that are used in both model components, and a five level ordinally-ranked outcome on substance use frequency (y1 in the dataex output). x1 has a significant positive coefficient in the ordered component and a significant negative coefficient in the inflation component; x2 has a significant positive coefficient in both components of the model. I was hoping that someone could help me understand this peculiar cut point value and possibly related marginal effect difference between the two types of zeros, as I want to make sure my model is interpretable. I provided all relevant output and example data, please let me know if anything else would be helpful to include! Thank you for any help that could be provided.
Here is my code for the ZIOP model: zioprobit y1 x1 x2, inflate(x1 x2)
Here are the cut points I got, and /cut1 was concerning given the large negative value, standard error, and confidence interval:
---------Coefficient--Std. err.---[95% conf. interval]
---------------+----------------------------------------------------------------
/cut1 | -7.491846 194.4509 -388.6086 373.6249
/cut2 | .1161476 .2585863 -.3906722 .6229674
/cut3 | .8731389 .2714118 .3411814 1.405096
/cut4 | 1.443595 .2893972 .8763873 2.010803
--------------------------------------------------------------------------------
The average marginal effect output for probability of non-participation and participation with zero-consumption were:
Non-participation: margins, predict(pnpar) dydx(x1 x2)
--------------------------------------------------------------------------------
| Delta-method
| ------dy/dx------std. err.----z---P>|z|---[95% conf. interval]
---------------+----------------------------------------------------------------
x1 | .0807176 .0243734 3.31 0.001 .0329466 .1284886
x2 | -.2075266 .023853 -8.70 0.000 -.2542775 -.1607756
--------------------------------------------------------------------------------
Participation with zero-consumption: margins, predict(pjoint1 outcome(0)) dydx(x1 x2)
--------------------------------------------------------------------------------
| Delta-method
| ------dy/dx-------std. err.----z----P>|z|---[95% conf. interval]
---------------+----------------------------------------------------------------
x1 | -1.21e-09 .0934703 -0.00 1.000 -.1831984 .1831984
x2 | -4.05e-10 .0268515 -0.00 1.000 -.052628 .052628
--------------------------------------------------------------------------------
Here is example data:
I am utilizing a zero-inflated ordered probit model for an analysis on substance use in Stata 17, and noticed a cut-point value in the output that looks problematic to me (compared to the other cut points). I also noticed that the average marginal effects for changes in either of my two predictors are completely unrelated (z = 0) to probability for zero values conditional on participation (i.e., zero level of substance use but in the ordered component), but significantly related to probability for non-participation. I don't know if those two things are related, but it seemed possible and I have been struggling to understand if this is concerning for interpretation of my effects. My model has two predictor variables that I standardized (x1 and x2 in the following dataex output) that are used in both model components, and a five level ordinally-ranked outcome on substance use frequency (y1 in the dataex output). x1 has a significant positive coefficient in the ordered component and a significant negative coefficient in the inflation component; x2 has a significant positive coefficient in both components of the model. I was hoping that someone could help me understand this peculiar cut point value and possibly related marginal effect difference between the two types of zeros, as I want to make sure my model is interpretable. I provided all relevant output and example data, please let me know if anything else would be helpful to include! Thank you for any help that could be provided.
Here is my code for the ZIOP model: zioprobit y1 x1 x2, inflate(x1 x2)
Here are the cut points I got, and /cut1 was concerning given the large negative value, standard error, and confidence interval:
---------Coefficient--Std. err.---[95% conf. interval]
---------------+----------------------------------------------------------------
/cut1 | -7.491846 194.4509 -388.6086 373.6249
/cut2 | .1161476 .2585863 -.3906722 .6229674
/cut3 | .8731389 .2714118 .3411814 1.405096
/cut4 | 1.443595 .2893972 .8763873 2.010803
--------------------------------------------------------------------------------
The average marginal effect output for probability of non-participation and participation with zero-consumption were:
Non-participation: margins, predict(pnpar) dydx(x1 x2)
--------------------------------------------------------------------------------
| Delta-method
| ------dy/dx------std. err.----z---P>|z|---[95% conf. interval]
---------------+----------------------------------------------------------------
x1 | .0807176 .0243734 3.31 0.001 .0329466 .1284886
x2 | -.2075266 .023853 -8.70 0.000 -.2542775 -.1607756
--------------------------------------------------------------------------------
Participation with zero-consumption: margins, predict(pjoint1 outcome(0)) dydx(x1 x2)
--------------------------------------------------------------------------------
| Delta-method
| ------dy/dx-------std. err.----z----P>|z|---[95% conf. interval]
---------------+----------------------------------------------------------------
x1 | -1.21e-09 .0934703 -0.00 1.000 -.1831984 .1831984
x2 | -4.05e-10 .0268515 -0.00 1.000 -.052628 .052628
--------------------------------------------------------------------------------
Here is example data:
Code:
* Example generated by -dataex-. For more info, type help dataex clear input byte y1 double(x1 x2) 0 -1.3182884334179708 .20508497962936276 0 .524604411531049 -.14909458294033573 0 -1.227507998230392 1.398686299361347 1 -1.370305060967942 .4890011458617213 0 .4312344726990797 .7124374325778748 1 .5953319787558887 1.256011745657513 0 .2573068620969491 -1.5814214420163302 0 -.15077979054329863 .17350508713602047 0 .12950025586106886 -.373065290372721 0 -.2223674762202617 -.5095255438281034 0 .9093840963462069 -.46909474696924125 0 -.7940520666401204 -1.749535766728144 0 -.08722609553029873 -2.4057738937944126 0 .013070352980564335 -2.2976897713418234 0 1.2055170496209213 -.3715003773759855 0 .06331580021848733 .3012970595100346 0 -1.0264236049503106 -1.6053814952665941 3 -.41623476714315194 .6389024730321154 0 .9474061751413685 .26363221479177906 0 .833914487119212 -.04411344033717609 0 .22399453722857704 -.8665165666622053 0 -.11030220361651744 .3056480809756038 1 -2.2511483317781344 -.26789419175220097 0 .9460646056098914 -.1747506695776634 2 -.7910641640213468 1.4470988979402217 0 .8803551434315904 1.3189178389268883 0 .7494546220628164 -.28436926202091534 0 1.1978924000013262 -2.048315244145489 0 -.14241825529256236 -.2557690308757761 0 -2.377694447427347 -.9101635837076019 0 -.20448149687100634 .37619171977983457 0 -1.222417603770672 -2.2270584003813765 0 .23041824982273618 -.40758432304409714 0 -1.4392347205294587 .02342125831448503 2 .03341758056372694 .18706390160970152 0 .3458859680573202 -1.4088352968787703 0 .10065050176473565 -1.234722779034119 1 -.34492081259401763 .36260619817126105 4 1.6059971698394186 1.368892691952312 end

Comment