Hi all,
In a recent paper, Semykina & Wooldridge (2015) (link) suggest that Stata's -biprobit- command can be used to estimate average treatment effects (ATE) for binary responses with self selection into a binary treatment when the data has a panel structure.
I had seen previously from Austin Nichol's slides here that biprobit can be used for endogenous switching (self selection) with binary treatment and binary response, but I did not know that it could be extended to a panel data context. For the cross-sectional case, the command would be simply something like:
biprobit (Y= control_variables treatment) (treatment= control_variables Instruments)
Semykina & Wooldridge (2015) suggest that the above command can be modified for panel data, explaining briefly in a footnote:
MAIN QUESTIONS:
Can anyone here provide more details on how to implement this with stata? For example, what do they mean by “the augmented equation (with time averages)”? what are these time averages? Should I include year dummies? Is the panel data structure dealt with random effects in this method?
Also, are there ways to get ATE and ATET separately? Can we do it with the margins command? Finally, is there a way to perform the test for selection bias outlined in Semykina & Wooldridge (2015) after the biprobit command in Stata?
ALTERNATIVES:
As an alternative to the -biprobit- command, I think there must be a way to do this with the -cmp- command by David Roodman in a manner similar to that discussed in posts such as this or this. Any guidance on how to exactly implement the self selection case and calculate ATE and ATET with the -cmp- command would also be appreciated.
The -biprobit- command looks more attractive to me at this point because my panel data has a survey structure with probability weights and -biprobit- works with the -svy:- prefix. Although the requirement to use vce(cluster) noted by Semykina & Wooldridge (2015) will probably not let me use the svy prefix anyways.
Another approach based on control functions is outlined in Murtazashvili & Wooldridge (2016) but I can't find stata code for that either. I know the control function approach is what is used by stata's -eteffects- command, but that command also does not handle panel data.
References:
Murtazashvili & Wooldridge (2016) - A control function approach to estimating switching regression models with endogenous explanatory variables and endogenous switching
Semykina & Wooldridge (2015) - Binary response panel data models with sample selection and self selection
In a recent paper, Semykina & Wooldridge (2015) (link) suggest that Stata's -biprobit- command can be used to estimate average treatment effects (ATE) for binary responses with self selection into a binary treatment when the data has a panel structure.
I had seen previously from Austin Nichol's slides here that biprobit can be used for endogenous switching (self selection) with binary treatment and binary response, but I did not know that it could be extended to a panel data context. For the cross-sectional case, the command would be simply something like:
biprobit (Y= control_variables treatment) (treatment= control_variables Instruments)
Semykina & Wooldridge (2015) suggest that the above command can be modified for panel data, explaining briefly in a footnote:
in Stata estimating treatment effects can be implemented by pooling the data and estimating the augmented equation (with time averages) using the “biprobit” command. Standard errors robust to serial dependence can be obtained using “cluster” option.
Can anyone here provide more details on how to implement this with stata? For example, what do they mean by “the augmented equation (with time averages)”? what are these time averages? Should I include year dummies? Is the panel data structure dealt with random effects in this method?
Also, are there ways to get ATE and ATET separately? Can we do it with the margins command? Finally, is there a way to perform the test for selection bias outlined in Semykina & Wooldridge (2015) after the biprobit command in Stata?
ALTERNATIVES:
As an alternative to the -biprobit- command, I think there must be a way to do this with the -cmp- command by David Roodman in a manner similar to that discussed in posts such as this or this. Any guidance on how to exactly implement the self selection case and calculate ATE and ATET with the -cmp- command would also be appreciated.
The -biprobit- command looks more attractive to me at this point because my panel data has a survey structure with probability weights and -biprobit- works with the -svy:- prefix. Although the requirement to use vce(cluster) noted by Semykina & Wooldridge (2015) will probably not let me use the svy prefix anyways.
Another approach based on control functions is outlined in Murtazashvili & Wooldridge (2016) but I can't find stata code for that either. I know the control function approach is what is used by stata's -eteffects- command, but that command also does not handle panel data.
References:
Murtazashvili & Wooldridge (2016) - A control function approach to estimating switching regression models with endogenous explanatory variables and endogenous switching
Semykina & Wooldridge (2015) - Binary response panel data models with sample selection and self selection
Comment