Default -margins- prediction for -heckprobit-

Mike Lacy

Join Date: Apr 2014

Posts: 2449
#1

Default -margins- prediction for -heckprobit-

10 May 2021, 13:45

I'm looking to try to understand the postestimation prediction quantities -margins- offers for -heckprobit-. I've realized that the default predicted values and derivatives for the “outcome” variable do not reflect variables that appear only in the selection equation. This stems from the so-called "pmargin" default that -margins- uses on for -heckprobit-. (See here for the full documentation.)

I find this surprising because I thought a key substantive point of a selection model was to recognize that in the wider population, the outcome of interest depends partly on variables that affect the selection process, insofar as being selected is necessary to experience the ultimate outcome.

To fix ideas, let's imagine a study of persons in which the ultimate outcome of interest is alcohol "abuse," for which one must first be selected into "any use" of alcohol. Let's suppose that "religious conservatism" (RC) affects selection into "any use," but not "abuse," as opposed to "reasonable use." If we take the default post-estimation prediction from -margins-, we'll implicitly ignore that RC affects "abuse" by preventing one from entering a state without which "abuse" can't occur. From this perspective,using one of the non-default predictions that -margins- offers would make more sense, in particular one that is documented as:

p11: Φ₂(x_jb,z_jg,?), predicted probability Pr(y_j^probit=1, y_j^select=1)
where Φ₂ is the bivariate normal probability function, y_j^{probit _{is the outcome of interest, and}}y_j^{select _{is the outcome in the selection process.}}

I'd interpret this to mean the probability that an individual is selected and goes on to experience the outcome of interest.

I'd think that this "p11" prediction would clearly be preferred here, and by extension in other similar and common situations. Because the default choices in Stata typically reflect the most typical and sensible choice, I'm questioning my thinking about the preferred prediction choice and I'd appreciate some reaction. I'm not recognizing any discussion of this point in the more abstract discussion in the literature. Can anyone offer some some clarification or a countering perspective?
Tags: None

Mike Lacy

Join Date: Apr 2014
Posts: 2449

10 May 2021, 14:03

At the risk of excessive length, but in order to avoid the dreaded no-details "it doesn't work" posting, I'll follow up with illustration using the (infelicitous) example data used for -heckprobit- in the manual:

Code:

use http://www.stata-press.com/data/r15/school
heckprobit private years logptax, select(vote=years loginc logptax)

Probit model with sample selection              Number of obs     =         95
                                                      Selected    =         59
                                                      Nonselected =         36

                                                Wald chi2(2)      =       1.04
Log likelihood = -74.24497                      Prob > chi2       =     0.5935

------------------------------------------------------------------------------
             |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
private      |
       years |  -.1142596   .1461715    -0.78   0.434    -.4007505    .1722313
     logptax |   .3516101   1.016483     0.35   0.729     -1.64066     2.34388
       _cons |  -2.780667   6.905827    -0.40   0.687    -16.31584    10.75451
-------------+----------------------------------------------------------------
vote         |
       years |  -.0167511   .0147735    -1.13   0.257    -.0457067    .0122045
      loginc |   .9923023   .4430008     2.24   0.025     .1240368    1.860568
     logptax |  -1.278783   .5717545    -2.24   0.025    -2.399401   -.1581646
       _cons |  -.5458205   4.070417    -0.13   0.893    -8.523692    7.432051
-------------+----------------------------------------------------------------
     /athrho |  -.8663164   1.450017    -0.60   0.550    -3.708298    1.975665
-------------+----------------------------------------------------------------
         rho |  -.6994978   .7405281                     -.9987983    .9622674
------------------------------------------------------------------------------
LR test of indep. eqns. (rho = 0):   chi2(1) =     0.27   Prob > chi2 = 0.6020


. margins, at(loginc = (9.8 10.2))

Predictive margins                              Number of obs     =         95
Model VCE    : OIM

Expression   : Pr(private=1), predict()

1._at        : loginc          =         9.8

2._at        : loginc          =        10.2

------------------------------------------------------------------------------
             |            Delta-method
             |     Margin   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
         _at |
          1  |    .171889   .2116119     0.81   0.417    -.2428627    .5866407
          2  |    .171889   .2116119     0.81   0.417    -.2428627    .5866407
------------------------------------------------------------------------------

Note that the different values of loginc (25th and 75th percentiles) don't affect the predicted probability in the preceding, but see that they do in the following:

Code:

. margins, predict(p11) at(loginc = (9.8 10.2))

Predictive margins                              Number of obs     =         95
Model VCE    : OIM

Expression   : Pr(private=1,vote=1), predict(p11)

1._at        : loginc          =         9.8

2._at        : loginc          =        10.2

------------------------------------------------------------------------------
             |            Delta-method
             |     Margin   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
         _at |
          1  |   .0295966   .0232421     1.27   0.203    -.0159571    .0751502
          2  |   .0547779   .0249651     2.19   0.028     .0058472    .1037087
------------------------------------------------------------------------------

Comment

Percy Fang

Join Date: Nov 2021

Posts: 2
#3

22 Aug 2022, 05:33

Hi Mike,

I have the exactly same question. Have you figured this out?

Best,
Percy
Comment
Mike Lacy

Join Date: Apr 2014

Posts: 2449
#4

22 Aug 2022, 09:03

I don't think I ever quite figured this out, and looking back now, I'm having trouble bringing back to mind what I originally *did* understand. I'd welcome response from the various knowledgeable people here.
Comment
Percy Fang

Join Date: Nov 2021

Posts: 2
#5

25 Aug 2022, 20:01

Thanks Mike for your following-up. If we look at the predict options for 'biprobit' (a relevant model), the default is p11 as you mentioned, which makes more sense.
Actually, i don't have a good understanding on pmargin ((marginal) predicted probability of success Pr(y^probit_j= 1).). Why 'predicted probability of success can skip the step of selection? Could you help me explain its meaning in a more intuitive way?

Best,
Percy
Comment

Announcement

Default -margins- prediction for -heckprobit-

Comment

Comment

Comment

Comment