Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Simultaneous Probit Model - Binary dependent and endogenous

    I am trying to avoid the Forbidden regression. I have two simultaneous equations one for poverty and the other for informal employment specified as follows

    Poor = B0 + B1 Informal employment + B2X +B3Z1+ U ..........(1)

    Informal Employment = B0 + B1 Poor + B2X +B3Z2+ U...........(2)

    where both dependent and endogenous variables are binary for the two equations and vector X has same exogenous variables for eqn 1 &2 , Z1 and Z2 are instruments

    I was following Maddala(1983), who suggested estimating probit ML in the first and second stages; however, after reading Angrist, I discovered this is impossible and leads to forbidden regression. rather i should use LPM. Kindly assist me in working this out for my two simultaneous equations..

    Thanks

  • #2
    It's not a bad idea to use linear models and estimate each equation by 2SLS. It's true estimating first-stage probits doesn't get you very far.

    I believe the -cmp- command can estimate the joint model by MLE if you want to account for the binary nature, although logical consistency and convergence can be issues.

    I do wonder what kind of causality you're hoping to establish. That informal employment "causes" poverty? I'm not sure the two equation passes the so-called "autonomy" requirement. Does each equation live on its own, apart from the other? Good SEM applications have this feature. A demand function is one side of the market, supply is the other. Cities choose a police force size, criminal commit crimes. And so on. It's a gray area in your case, but worth thinking about. I've said in my writing before: just because two variables are jointly determined doesn't mean an SEM is appropriate.

    Comment


    • #3
      Originally posted by Jeff Wooldridge View Post
      It's not a bad idea to use linear models and estimate each equation by 2SLS. It's true estimating first-stage probits doesn't get you very far.

      I believe the -cmp- command can estimate the joint model by MLE if you want to account for the binary nature, although logical consistency and convergence can be issues.

      I do wonder what kind of causality you're hoping to establish. That informal employment "causes" poverty? I'm not sure the two equation passes the so-called "autonomy" requirement. Does each equation live on its own, apart from the other? Good SEM applications have this feature. A demand function is one side of the market, supply is the other. Cities choose a police force size, criminal commit crimes. And so on. It's a gray area in your case, but worth thinking about. I've said in my writing before: just because two variables are jointly determined doesn't mean an SEM is appropriate.
      Thank you for the cmp suggestion.

      So, from the literature, there is a bidirectional relationship between informality and poverty, with various factors influencing both. Low earnings from informal employment significantly contribute to household poverty. On the other hand, household poverty can compel the head of the household to accept informal work, as they cannot afford to remain unemployed. so I wanted to analyse this bidirectional relationship. Given the simultaneity of informal employment and household poverty, estimating the equations by ordinary least squares would result in inconsistent and biased estimates of the determinants of household heads’ informal employment and of the implications on household poverty as suggested by https://www.journals.uchicago.edu/do...urnalCode=edcc. So that's why I wanted to employ a simultaneous maximum likelihood probit model to account for the existing reverse causality. I have tried the cmp however the system is not converging maybe since the two variables appear in both equations either as dependent or endogenous simultaneously.

      Comment


      • #4
        For an individual, quantity of chicken and quantity of broccoli are also simultaneously determined. Would you use an SEM for them? No, because demand functions are quantities as functions of prices and income and tastes. Modeling one quantity as a function of other quantities doesn’t answer an interesting question. This is the sort of example I’m thinking about.

        Comment


        • #5
          Hello Sir,

          I’m working with a model that includes both a binary dependent variable and a binary endogenous regressor. Specifically, I’m estimating the effect of bank account ownership (a dummy variable) on poverty status (also a dummy), while controlling for other relevant factors. However, the model is likely affected by endogeneity due to issues such as omitted variable bias and simultaneity. What estimation method should I use in this context?

          Thank you.

          Comment


          • #6
            Do you have a valid instrumental variable for bank account ownership?

            Comment


            • #7
              Yes, Sir.

              Also, please suggest a relevant command in Stata. I can't use ivprobit as it requires the endogenous variable to be continuous.
              Last edited by Varsha Vaishnav; 02 Jul 2025, 20:26.

              Comment


              • #8
                The biprobit command works in this case, assuming you are estimating a single equation with y1 = poverty on the left and y2 = bank account ownership on the right.

                Comment


                • #9
                  Sir, I'm not estimating a single equation. It's like a 2SLS IV setup.

                  Comment


                  • #10
                    Right. What I should have said is that y2 is not also a function of y1. The equation for y2 consists of z1 and z2, where z2 includes the instrumental variables and z1 are exogenous variables in the main equation. Each of z1 and z2 can be a list of variables:

                    Code:
                    biprobit (y1 = y2 z1)  (y2 = z1 z2)
                    You can find a discussion in Chapter 15 of my 2010 MIT Press book.

                    Comment


                    • #11
                      biprobit should work. I wonder if "eregress entreat" might be useful. It's linear on the DV but dichotomous on the treatment.

                      Comment


                      • #12
                        Thank you, Sir.

                        Comment

                        Working...
                        X