Instrumental Variable using Panel Data with Binary Endogenous Variable

Helen Liang

Join Date: Nov 2019

Posts: 2
#1

Instrumental Variable using Panel Data with Binary Endogenous Variable

11 Nov 2019, 09:53

Dear all,

Can someone let me know how to implement a procedure similar to XTIVREG but with a binary endogenous variable? Here is the model I want to estimate:

xtivreg DV controls (x1 = z1 z2), fe

where controls are exogenous variables, z1 & z2 are instruments;
x1 is a binary endogenous variable (0 or 1) and the first stage is a logit model so this doesn't fit the xtivreg specification.

Is there an existing procedures to implement this? Or do I have to do the two stages manually?

If I do the two stages manually, how do I adjust the standard errors in the second stage for significance tests?

Thank you very much!
Tags: None
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2126
#2

11 Nov 2019, 20:12

Helen: A couple of things. First, there is no need to use a logit first stage for x1 even though it is binary. If you are assuming the standard model with a constant coefficient on x1 then ysing the above command, but with the addition of vce(cluster id) is sufficient.

Nevertheless, you may get more efficiency by using a first-stage logit estimation. You haven't shown exactly how you're doing that. Is it a correlated random effects logit? Just a standard pooled logit? In any case, assuming the model is correctly specified, you don't need to adjust the standard errors in the second stage if you are using the fitted logit probabilities as instruments (not regressors). So something like this should do it:

Code:

logit x1 controls z1 z2 predict p1hat xtivreg DV controls (x1 = p1hat), fe vce(cluster id)

Incidentally, I strongly recommend time effects in both models if this is a true panel data.

JW
1 like
Comment
Helen Liang

Join Date: Nov 2019

Posts: 2
#3

13 Nov 2019, 07:39

Thanks a lot, Jeff!

The first-stage is a random effects logit (but not a correlated random effects logit), because we assume x1, the endogenous variable (manager's investment choice), is determined by unobserved firm level variables, some of which are random. We also did a standard pooled logit just as a reference. The code used for the first stage is:

xtlogit x1 controls z1 z2, vce(robust)
predict p1hat, pr

I suppose we should go ahead using the xtivreg, and include the separate two stage manual estimation as a reference, in case significance levels vary between the two methods.
Thank you so much for the help!
Comment
Nitin Jain

Join Date: Apr 2022

Posts: 65
#4

06 May 2024, 07:09

D

Originally posted by Jeff Wooldridge View Post

Helen: A couple of things. First, there is no need to use a logit first stage for x1 even though it is binary. If you are assuming the standard model with a constant coefficient on x1 then ysing the above command, but with the addition of vce(cluster id) is sufficient.

Nevertheless, you may get more efficiency by using a first-stage logit estimation. You haven't shown exactly how you're doing that. Is it a correlated random effects logit? Just a standard pooled logit? In any case, assuming the model is correctly specified, you don't need to adjust the standard errors in the second stage if you are using the fitted logit probabilities as instruments (not regressors). So something like this should do it:

Code:

logit x1 controls z1 z2 predict p1hat xtivreg DV controls (x1 = p1hat), fe vce(cluster id)

Incidentally, I strongly recommend time effects in both models if this is a true panel data.

JW

Dear Prof. Wooldridge, Could you please share a reference for this? Thanks.
Comment

Announcement

Instrumental Variable using Panel Data with Binary Endogenous Variable

Comment

Comment

Comment