Dear community members,
I am trying to apply the fixed effects estimator that accounts for a possible sample selection bias proposed by wooldridge (1995). I am wondering if anyone could tell me wheather the following is the right approach:
(A short Introduction to the model)
In the model the equation of primary interest is defined as:
y_it = x_itβ + α_i + ε_it
and the selection equation as:
d*_it = z_itγ + µ_i + u_it
d_it = 1 if d*_it>0, 0 otherwise
where y_it is only observed, if d_it=1 and α_i and µ_i are the individual-specific time-invariant unobserved effects.
Wooldridge (1995) then derives the following estimator:
Step 1: For each t, run a a cross-sectional probit model for participation and for d_it=1, compute the Inverse Mills Ratio (lambda_hat)
Step 2: Define: w_hatit=(1,xi1,xi2,...xiT,xit,0,...,0,lambda_hatit,0,...0)
Step 3: Run a pooled OLS regression on: yit=w_hatitθ+eit if d_it=1
What i have done so far is:
to obtain the xi1,xi2,...xiT:
For each dependent variable I ran:
to obtain 0,...,0,lambda_hatit,0,...0:
then I ran the following regression:
y= x1_i1991-2012 x2_i1991-2012 ... x5_i1991-2012 x1-x5 lambda_hat_d_1991-2012 if d_it=1
Thank you so much in advance
Johann
Wooldridge, J. M. (1995), ‘‘Selection Corrections for Panel Data Models under Conditional Mean IndependenceAssumptions,’’ Journal of Econometrics 68, 115-132.
I am trying to apply the fixed effects estimator that accounts for a possible sample selection bias proposed by wooldridge (1995). I am wondering if anyone could tell me wheather the following is the right approach:
(A short Introduction to the model)
In the model the equation of primary interest is defined as:
y_it = x_itβ + α_i + ε_it
and the selection equation as:
d*_it = z_itγ + µ_i + u_it
d_it = 1 if d*_it>0, 0 otherwise
where y_it is only observed, if d_it=1 and α_i and µ_i are the individual-specific time-invariant unobserved effects.
Wooldridge (1995) then derives the following estimator:
Step 1: For each t, run a a cross-sectional probit model for participation and for d_it=1, compute the Inverse Mills Ratio (lambda_hat)
Step 2: Define: w_hatit=(1,xi1,xi2,...xiT,xit,0,...,0,lambda_hatit,0,...0)
Step 3: Run a pooled OLS regression on: yit=w_hatitθ+eit if d_it=1
What i have done so far is:
to obtain the xi1,xi2,...xiT:
For each dependent variable I ran:
Code:
forvalues i = 1991 1992 to 2012 { by personID: gen x1_i`i' = x1[`i'-1990] }
Code:
forvalues i = 1991 1992 to 2012 { quietly probit d z1-z7 if year==`i' quietly predict xb_hat_`i', xb quietly gen lambda_hat_`i' = normalden(xb_hat_`i')/normal(xb_hat_`i') if d==1 quietly gen lambda_hat_d_`i' = lambda_hat_`i' quietly replace lambda_hat_d_`i' = 0 if year!=`i' quietly drop xb_hat_`i' lambda_hat_`i' }
y= x1_i1991-2012 x2_i1991-2012 ... x5_i1991-2012 x1-x5 lambda_hat_d_1991-2012 if d_it=1
Thank you so much in advance
Johann
Wooldridge, J. M. (1995), ‘‘Selection Corrections for Panel Data Models under Conditional Mean IndependenceAssumptions,’’ Journal of Econometrics 68, 115-132.
Comment