Announcement

Collapse
No announcement yet.
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • Regression with two endogenous variables (one is interaction) and possible exclusion restriction violation?

    I am trying to estimate the naive regression:

    d = b + s + s*b, where s is endogenous. I'm instrumenting for s with iv.

    I've seen other posters run something similar to the following regression
    Code:
    ivreg2 d = b (s s_b = iv iv_b)
    where s_b and iv_b are interaction terms.

    However, I would really like to do something like this instead:
    Code:
    ivreg2 d = b (s = iv) (s_b = iv_b)
    because I'm don't think s should be predicted with iv_b since I worry that b might influence s.

    My questions are:
    (1) is there a way to run my second regression?
    (2) if my concern is correct, would that constitute a violation of the exclusion restriction?
    (3) is my second regression correct?

  • #2
    Krista: The command will not let you do what you want, and you should not do what you want to do. You should instead use the first command, or ivregress 2sls. Any time you have two or more endogenous explanatory variables, it is important to include everything treated as exogenous -- whether it's in the equation or not -- in every reduced form. This is what ivreg2 and ivregress do. This is Stata's way of protecting you from yourself. Who's to say that iv_b won't have some predictive power for s, or that iv has some predictive power for s_b? If they do, they need to be in the first stages, and the first command does that.

    Now, it's true that, if you look at the first stages, the hope is that iv "mostly" predicts s and iv_b "mostly" predicts the s_b, but there's no restriction as long as you let IV work they way it's supposed to. The only thing I would add is to use the "robust" option if you have cross-sectional data.

    JW

    Comment


    • #3
      Thank you! That makes sense.

      Comment

      Working...
      X