Dear All,
If valid instruments are not available, we can use Lewbel's (2018, 2012) heteroskedasticity-based constructed instrument. This has also been shown to be applicable in the case of binary endogenous variables.
It constructs valid instruments on the basis of information contained in the heteroskedasticity of the error term of the structural equation. The estimator gain efficiency, if we also include an external instrument. This what is available in my data, but which alone does not constitute a valid instrument, as shown via:
Estimating the Lewbel approach in a panel fixed effects model with clustered standard erros, I write:
Now I am wondering whether the excluded instrument can also be the fitted probit estimate of the first stage "y1_hat". This is a bit tricky in interpretation but yields a higher F-Test than the alternative with z1 (results are fairly similar)
If valid instruments are not available, we can use Lewbel's (2018, 2012) heteroskedasticity-based constructed instrument. This has also been shown to be applicable in the case of binary endogenous variables.
It constructs valid instruments on the basis of information contained in the heteroskedasticity of the error term of the structural equation. The estimator gain efficiency, if we also include an external instrument. This what is available in my data, but which alone does not constitute a valid instrument, as shown via:
Code:
xtlogit y1 z1 x, fe vce(boot)
predict y1_hat
xtivreg2 y2 x (y1 = y1_hat), fe cluster(hhid) first endog(I)
FIXED EFFECTS ESTIMATION
------------------------
Number of groups = 198 Obs per group: min = 3
avg = 3.0
max = 3
First-stage regressions
-----------------------
FIXED EFFECTS ESTIMATION
------------------------
Number of groups = 198 Obs per group: min = 3
avg = 3.0
max = 3
First-stage regression of y1:
Statistics robust to heteroskedasticity and clustering on hhid
Number of obs = 594
Number of clusters (hhid) = 198
------------------------------------------------------------------------------
| Robust
y1 | Coefficient std. err. t P>|t| [95% conf. interval]
-------------+----------------------------------------------------------------
y1_hat | -.2426688 .2516973 -0.96 0.336 -.7375261 .2521885
x1 | .2264221 .0475699 4.76 0.000 .1328958 .3199484
x2 | -.1929559 .0392403 -4.92 0.000 -.2701054 -.1158063
x3 | -.0001582 .0001217 -1.30 0.194 -.0003975 .000081
x4 | .0008101 .0001883 4.30 0.000 .0004398 .0011803
x5 | -.0160819 .0660399 -0.24 0.808 -.1459217 .113758
x6 | -.0001422 .0001727 -0.82 0.411 -.0004819 .0001974
------------------------------------------------------------------------------
F test of excluded instruments:
F( 1, 197) = 0.93
Prob > F = 0.3362
Sanderson-Windmeijer multivariate F test of excluded instruments:
F( 1, 197) = 0.93
Prob > F = 0.3362
Summary results for first-stage regressions
-------------------------------------------
(Underid) (Weak id)
Variable | F( 1, 197) P-val | SW Chi-sq( 1) P-val | SW F( 1, 197)
y1 | 0.93 0.3362 | 0.94 0.3313 | 0.93
NB: first-stage test statistics cluster-robust
Stock-Yogo weak ID F test critical values for single endogenous regressor:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for i.i.d. errors only.
Underidentification test
Ho: matrix of reduced form coefficients has rank=K1-1 (underidentified)
Ha: matrix has rank=K1 (identified)
Kleibergen-Paap rk LM statistic Chi-sq(1)=0.77 P-val=0.3788
Weak identification test
Ho: equation is weakly identified
Cragg-Donald Wald F statistic 1.00
Kleibergen-Paap Wald rk F statistic 0.93
Stock-Yogo weak ID test critical values for K1=1 and L1=1:
10% maximal IV size 16.38
15% maximal IV size 8.96
20% maximal IV size 6.66
25% maximal IV size 5.53
Source: Stock-Yogo (2005). Reproduced by permission.
NB: Critical values are for Cragg-Donald F statistic and i.i.d. errors.
Weak-instrument-robust inference
Tests of joint significance of endogenous regressors B1 in main equation
Ho: B1=0 and orthogonality conditions are valid
Anderson-Rubin Wald test F(1,197)= 1.41 P-val=0.2363
Anderson-Rubin Wald test Chi-sq(1)= 1.43 P-val=0.2313
Stock-Wright LM S statistic Chi-sq(1)= 2.83 P-val=0.0922
Estimating the Lewbel approach in a panel fixed effects model with clustered standard erros, I write:
Code:
ivreg2h y2 X (y1 = z), fe cluster(hhid) first

Comment