Dear Nick Cox Jeff Wooldridge and Clyde Schechter .
I have a query regarding dummy endogenous variable model (Heckman, 1978) and beautifully explained in Jeff Wooldridge Econometrics analysis of cross-section and panel data, 2010 (p. 622, ch.18)
Suppose we are trying to estimate the ATE (or ATT) of being a "union member" (treatment) on "wages" (outcome) of women. Further, let the treatment assignment is endogenous.
Therefore our baseline outcome equation is:
wages = a + b*X + c*union + e_i ----------------------------(outcome equation)
The decision to obtain the union membership is endogenous, and we are interested in obtaining consistent/unbiased estimates of the parameter (c) in such a case. The Heckman procedure provides the consistent estimates of treatment effect in such a scenario. I briefly explain the procedure below-
The binary treatment variable union is assumed to stem from the latent variable:
union* = W_i *gamma+ u_i
The decision to obtain a treatment is made according to the rule:
We will use the variables wage (wage), grade (years of schooling completed), smsa (an indicator for living in
an SMSA—standard metropolitan statistical area), black (an indicator for being African-American),
tenure (tenure at current job), and south (an indicator for living in the South). The variable union is a dummy (=1) for being a union member and zero otherwise.
Stage-1: The Heckman procedure runs a probit regression of the treatment equation on vector W_i :
P(union=1|W) = Phi(W_i *gamma) --------------------------- (treatment equation)
where W_i is a vector of variable that determine the treatment assignment.
where Phi isa standard normal density function
we calculate the hazard rate (h_i) for each observation in stage-1 such that:
Stage-2: We augment the outcome regression with the hazard (h_i) and the additional parameters (d1 and d2) are obtained on the interaction of [(union*h_i)] and [(1-union)*h_i], respectively. Therefore, we estimate
E(wages| X, union;W) = a + b*X + c*union + d1[union*(phi/Phi)] + d2[(1-union)*(-phi/1-Phi)] --------------(augmented model)
I run the following codes to obtain the parameter of interest (c). The effect of treatment is not allowed to vary at different levels of covariates, therefore we run the following:
(In this case ATE = ATET)
hazard(h) will create a new variable (h) containing the hazard from the treatment equation. The option first specifies that the first-step probit estimates of the treatment equation are also displayed.
If we want the effect of treatment to vary at different levels of covariates, we run the following:
(In this case ATE!= ATET)
we use margins command to calculate ATE and ATET respectively,
By running these codes, I get coefficient (c) on the treatment dummy (union ) along with other covariates. The codes generate the hazard (h_i) variable for each observation. But I did not obtain the coefficients d1 and d2, respectively, from the augmented outcome model. How should I tweak the code to obtain ATE and ATET along with coefficients d1 and d2. Please get back to me, I shall be very thankful.
Regards,
(Ridwan)
I have a query regarding dummy endogenous variable model (Heckman, 1978) and beautifully explained in Jeff Wooldridge Econometrics analysis of cross-section and panel data, 2010 (p. 622, ch.18)
Suppose we are trying to estimate the ATE (or ATT) of being a "union member" (treatment) on "wages" (outcome) of women. Further, let the treatment assignment is endogenous.
Therefore our baseline outcome equation is:
wages = a + b*X + c*union + e_i ----------------------------(outcome equation)
The decision to obtain the union membership is endogenous, and we are interested in obtaining consistent/unbiased estimates of the parameter (c) in such a case. The Heckman procedure provides the consistent estimates of treatment effect in such a scenario. I briefly explain the procedure below-
The binary treatment variable union is assumed to stem from the latent variable:
union* = W_i *gamma+ u_i
The decision to obtain a treatment is made according to the rule:
Code:
union = { 1, if union* > 0 { 0, otherwise
an SMSA—standard metropolitan statistical area), black (an indicator for being African-American),
tenure (tenure at current job), and south (an indicator for living in the South). The variable union is a dummy (=1) for being a union member and zero otherwise.
Stage-1: The Heckman procedure runs a probit regression of the treatment equation on vector W_i :
P(union=1|W) = Phi(W_i *gamma) --------------------------- (treatment equation)
where W_i is a vector of variable that determine the treatment assignment.
where Phi isa standard normal density function
we calculate the hazard rate (h_i) for each observation in stage-1 such that:
Code:
h_i = {phi(.)/Phi(.) ; union=1 { -phi(.)/1-Phi(.) ; union=0
E(wages| X, union;W) = a + b*X + c*union + d1[union*(phi/Phi)] + d2[(1-union)*(-phi/1-Phi)] --------------(augmented model)
I run the following codes to obtain the parameter of interest (c). The effect of treatment is not allowed to vary at different levels of covariates, therefore we run the following:
(In this case ATE = ATET)
Code:
use https://www.stata-press.com/data/r17/union3, clear etregress wage age grade smsa black tenure, treat(union = south black tenure) hazard(h) first
If we want the effect of treatment to vary at different levels of covariates, we run the following:
(In this case ATE!= ATET)
Code:
use https://www.stata-press.com/data/r17/union3, clear etregress wage age grade smsa i.union#c.(black tenure), treat(union = south black tenure) hazard(h) first vce(robust)
Code:
margins r.union, vce(unconditional) contrast(nowald) // ATE margins r.union, vce(unconditional) contrast(nowald) subpop(union) // ATET
By running these codes, I get coefficient (c) on the treatment dummy (union ) along with other covariates. The codes generate the hazard (h_i) variable for each observation. But I did not obtain the coefficients d1 and d2, respectively, from the augmented outcome model. How should I tweak the code to obtain ATE and ATET along with coefficients d1 and d2. Please get back to me, I shall be very thankful.
Regards,
(Ridwan)
Comment