Instrumental variables with binary endogenous regressor

Fei Wang replied

02 Jul 2022, 20:31
Originally posted by Devon Smith View Post

Hi Jeff and Joao:

In the above example, which first stage F-stat should be reported? D = A₀ + A₁Z + A₂X + V or D= AZ+ D-hat?

Devon, if you only use Z as the instrument, then the first-stage regression is as below and the F stat is only related to Z.

Code:

regress D Z X

If you use both Z and Dhat as instruments, then the first-stage regression would be as below and the F stat is related to the joint significance of Z and Dhat.

Code:

regress D Z Dhat X

ADD: But you should't "manually" run the first stage. Use -ivregress- and correctly specify the set of instruments there. After -ivregress-, run -estat firststage- (for homoskedasticity) or -weakivtest- (for heteroskedasticity, from SSC) to obtain the correct F stat.
Last edited by Fei Wang; 02 Jul 2022, 20:45.
1 like
Leave a comment:
Devon Smith replied

02 Jul 2022, 15:35
Hi Jeff and Joao:

In the above example, which first stage F-stat should be reported? D = A₀ + A₁Z + A₂X + V or D= AZ+ D-hat?
Leave a comment:
Jeff Wooldridge replied

20 Sep 2019, 14:57
Mat: Yes, the standard errors are valid provided the error satisfies the zero conditional mean restriction in my earlier post (which is essentially needed for consistency, anyway). I show this in Chapter 6 of my MIT Press book. You should make them robust to heteroskedasticity, as usual.
Leave a comment:
Mat Log replied

20 Sep 2019, 12:39
Dear all,

Do we get the proper standard errors when running the following command that you suggested:

ivregress 2sls Y (D = Z D_hat) X

Thanks a lot,

Mat
Leave a comment:
Jeff Wooldridge replied

16 Jul 2017, 15:46
Provided E(U|X,Z) = 0, using either Z, Dhat, or both produces consistent estimators. Joao's proposal has the advantage of being asymptotically no worse than either of the other two approaches if Var(U|X,Z) is constant. It also produces an overidentifying restriction, although I'm not sure how valuable that is as a specification test in this context.

A potential downside is that using (Z,Dhat) is more likely to lead to a weak instruments problem. If the probit model for D is correct then Z is redundant as an IV. Therefore, the rule for having a strong enough instrument -- roughly, the first-stage F statistic is above 10 -- would be harder to achieve.
1 like
Leave a comment:
Joao Santos Silva replied

16 Jul 2017, 11:13
Dear Eric de Souza,

A LPM in the first stage is not wrong and that is actually what should be done.

The question is what instrument to use in the first stage. We can just use Z, but that is unlikely to be a good instrument if Z is continuous and D binary. An alternative is to replace Z with E(D|Z,X), which is likely a better instrument; this can be approximated using a logit or probit. What we did in the 1997 paper was actually to use both Z and an estimate of E(D|Z,X) obtained with a logit.

So, the command I would suggest is something like (which has a LPM in the first stage):

ivregress 2sls Y (D = Z D_hat) X

Does is make sense?

Best wishes,

Joao
Leave a comment:
Eric de Souza replied

16 Jul 2017, 10:12
@João: Why would using an LPM in the first stage be wrong? And since the Dhat from a probit model is non-linear in X and Z, is there not a risk that it still be linearly correlated with U as defined in the equation in #1 above?
Leave a comment:
Joao Santos Silva replied

15 Jul 2017, 14:49
Dear Maria,

Just to add to the variety of (late) advice you are getting, I would suggest using both Z and D_hat as instruments. That is what we did in (see the top of page 291):

Windmeijer, F. and Santos Silva, J.M.C. (1997), Estimation of Count Data Models with Endogenous Regressors; An Application to Demand for Health Care, Journal of Applied Econometrics, 12(3), pp. 281-294.

Best wishes,

Joao
Leave a comment:
Eric de Souza replied

15 Jul 2017, 12:36
What you need is that Dhat not be correlated with the residual is the original model: Y = B₀ + B₁D + B₂X + U,
A probit model in the first stage will not give you that because it is nonlinear.
Just run a 2SLS regression using Z as an instrument for D: ivregress 2sls Y (D = Z) X
Leave a comment:
Peter Perera replied

15 Jul 2017, 12:08
Hi Maria,
See pages 142-144 of Mostly Harmless Econometrics by Angrist and Pischke. I believe you need to only include only D_hat and X.
Leave a comment:
Marcos Almeida replied

26 Jun 2017, 08:17
I gather the biprobit model would fit your needs.
Leave a comment:

Announcement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment: