Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Default -margins- prediction for -heckprobit-

    I'm looking to try to understand the postestimation prediction quantities -margins- offers for -heckprobit-. I've realized that the default predicted values and derivatives for the “outcome” variable do not reflect variables that appear only in the selection equation. This stems from the so-called "pmargin" default that -margins- uses on for -heckprobit-. (See here for the full documentation.)

    I find this surprising because I thought a key substantive point of a selection model was to recognize that in the wider population, the outcome of interest depends partly on variables that affect the selection process, insofar as being selected is necessary to experience the ultimate outcome.

    To fix ideas, let's imagine a study of persons in which the ultimate outcome of interest is alcohol "abuse," for which one must first be selected into "any use" of alcohol. Let's suppose that "religious conservatism" (RC) affects selection into "any use," but not "abuse," as opposed to "reasonable use." If we take the default post-estimation prediction from -margins-, we'll implicitly ignore that RC affects "abuse" by preventing one from entering a state without which "abuse" can't occur. From this perspective,using one of the non-default predictions that -margins- offers would make more sense, in particular one that is documented as:

    p11: Φ2(xjb,zjg,?), predicted probability Pr(yjprobit =1, yjselect =1)
    where Φ2 is the bivariate normal probability function, yjprobit is the outcome of interest, and yjselect is the outcome in the selection process.

    I'd interpret this to mean the probability that an individual is selected and goes on to experience the outcome of interest.

    I'd think that this "p11" prediction would clearly be preferred here, and by extension in other similar and common situations. Because the default choices in Stata typically reflect the most typical and sensible choice, I'm questioning my thinking about the preferred prediction choice and I'd appreciate some reaction. I'm not recognizing any discussion of this point in the more abstract discussion in the literature. Can anyone offer some some clarification or a countering perspective?



  • #2
    At the risk of excessive length, but in order to avoid the dreaded no-details "it doesn't work" posting, I'll follow up with illustration using the (infelicitous) example data used for -heckprobit- in the manual:
    Code:
    use http://www.stata-press.com/data/r15/school
    heckprobit private years logptax, select(vote=years loginc logptax)
    
    Probit model with sample selection              Number of obs     =         95
                                                          Selected    =         59
                                                          Nonselected =         36
    
                                                    Wald chi2(2)      =       1.04
    Log likelihood = -74.24497                      Prob > chi2       =     0.5935
    
    ------------------------------------------------------------------------------
                 |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
    -------------+----------------------------------------------------------------
    private      |
           years |  -.1142596   .1461715    -0.78   0.434    -.4007505    .1722313
         logptax |   .3516101   1.016483     0.35   0.729     -1.64066     2.34388
           _cons |  -2.780667   6.905827    -0.40   0.687    -16.31584    10.75451
    -------------+----------------------------------------------------------------
    vote         |
           years |  -.0167511   .0147735    -1.13   0.257    -.0457067    .0122045
          loginc |   .9923023   .4430008     2.24   0.025     .1240368    1.860568
         logptax |  -1.278783   .5717545    -2.24   0.025    -2.399401   -.1581646
           _cons |  -.5458205   4.070417    -0.13   0.893    -8.523692    7.432051
    -------------+----------------------------------------------------------------
         /athrho |  -.8663164   1.450017    -0.60   0.550    -3.708298    1.975665
    -------------+----------------------------------------------------------------
             rho |  -.6994978   .7405281                     -.9987983    .9622674
    ------------------------------------------------------------------------------
    LR test of indep. eqns. (rho = 0):   chi2(1) =     0.27   Prob > chi2 = 0.6020
    
    
    . margins, at(loginc = (9.8 10.2))
    
    Predictive margins                              Number of obs     =         95
    Model VCE    : OIM
    
    Expression   : Pr(private=1), predict()
    
    1._at        : loginc          =         9.8
    
    2._at        : loginc          =        10.2
    
    ------------------------------------------------------------------------------
                 |            Delta-method
                 |     Margin   Std. Err.      z    P>|z|     [95% Conf. Interval]
    -------------+----------------------------------------------------------------
             _at |
              1  |    .171889   .2116119     0.81   0.417    -.2428627    .5866407
              2  |    .171889   .2116119     0.81   0.417    -.2428627    .5866407
    ------------------------------------------------------------------------------
    Note that the different values of loginc (25th and 75th percentiles) don't affect the predicted probability in the preceding, but see that they do in the following:

    Code:
    . margins, predict(p11) at(loginc = (9.8 10.2))
    
    Predictive margins                              Number of obs     =         95
    Model VCE    : OIM
    
    Expression   : Pr(private=1,vote=1), predict(p11)
    
    1._at        : loginc          =         9.8
    
    2._at        : loginc          =        10.2
    
    ------------------------------------------------------------------------------
                 |            Delta-method
                 |     Margin   Std. Err.      z    P>|z|     [95% Conf. Interval]
    -------------+----------------------------------------------------------------
             _at |
              1  |   .0295966   .0232421     1.27   0.203    -.0159571    .0751502
              2  |   .0547779   .0249651     2.19   0.028     .0058472    .1037087
    ------------------------------------------------------------------------------

    Comment


    • #3
      Hi Mike,

      I have the exactly same question. Have you figured this out?

      Best,
      Percy

      Comment


      • #4
        I don't think I ever quite figured this out, and looking back now, I'm having trouble bringing back to mind what I originally *did* understand. I'd welcome response from the various knowledgeable people here.

        Comment


        • #5
          Thanks Mike for your following-up. If we look at the predict options for 'biprobit' (a relevant model), the default is p11 as you mentioned, which makes more sense.
          Actually, i don't have a good understanding on pmargin ((marginal) predicted probability of success Pr(yprobitj = 1).). Why 'predicted probability of success can skip the step of selection? Could you help me explain its meaning in a more intuitive way?

          Best,
          Percy

          Comment

          Working...
          X