Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Hurdle model "churdle" output interpretation with interaction

    Dear Statalist members,

    I have a question regarding the interpretation of an output after the "churdle" command.

    I know that most authors report marginal effects as outputs since the coefficients cannot be interpreted as unconditional effects. Since my regression includes an interaction term with a dummy I can discuss marginal effects for both cases, but reporting them in a table is not very handy. (Or am I wrong and there exist a good way to do this?)

    Therefore, I would like to report the standard output and somehow interpret the respective coefficients. My understanding is that the estimates of the linear model are conditional estimates (conditional on selection). Is that correct?

    Let us consider the example of the stata manual that models the decision to exercise or not as a function of commute, whours, and age. These variables are written in select(). Once a decision to exercise is made, the time an individual exercises is modeled as a linear function of age, smoke, distance, and single.

    Code:
    . use http://www.stata-press.com/data/r14/fitness
    
    .
    . churdle linear hours age i.smoke distance i.single, select(commute whours age) ll(0)
     
    Iteration 0:   log likelihood = -23657.236  
    Iteration 1:   log likelihood = -23344.182  
    Iteration 2:   log likelihood = -23340.051  
    Iteration 3:   log likelihood = -23340.044  
    Iteration 4:   log likelihood = -23340.044  
    
    Cragg hurdle regression                         Number of obs     =     19,831
                                                    LR chi2(6)        =    9059.26
                                                    Prob > chi2       =     0.0000
    Log likelihood = -23340.044                     Pseudo R2         =     0.1625
    
    ------------------------------------------------------------------------------
           hours |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
    -------------+----------------------------------------------------------------
    hours        |
             age |   .0015116    .000763     1.98   0.048     .0000162     .003007
                 |
           smoke |
        smoking  |   -1.06646   .0460578   -23.15   0.000    -1.156731   -.9761879
        distance |  -.1333868   .0126344   -10.56   0.000    -.1581497   -.1086238
                 |
          single |
         single  |   .9940893   .0258775    38.42   0.000     .9433703    1.044808
           _cons |   .9138855   .0396227    23.06   0.000     .8362264    .9915447
    -------------+----------------------------------------------------------------
    selection_ll |
         commute |  -.2953345   .0624665    -4.73   0.000    -.4177666   -.1729024
          whours |   .0022974   .0069306     0.33   0.740    -.0112864    .0158811
             age |  -.0485347   .0006501   -74.65   0.000     -.049809   -.0472604
           _cons |   2.649945   .0499795    53.02   0.000     2.551987    2.747903
    -------------+----------------------------------------------------------------
    lnsigma      |
           _cons |   .0083199   .0099648     0.83   0.404    -.0112107    .0278506
    -------------+----------------------------------------------------------------
          /sigma |   1.008355    .010048                      .9888519    1.028242
    ------------------------------------------------------------------------------
    For instance, if the coefficient of age in the linear model is .0015116, can I say that given the individual is selected (given the individual exercises), a one unit increase of age, exercise hours increase .0015116 units? Is this interpretation correct?

    Thank you very much!!!

  • #2
    No you need to run margins command. See here: https://www.stata.com/manuals/rchurdle.pdf

    Comment

    Working...
    X