Hi, I am doing a coarsened exact matching and I am considered about the dropped observations. I match on the pre-treatment period like this:
drop if treated==.
gen cem_treated = 1 if treated=1 & Year<2011
replace cem_treated=0 if cem_treated==.
Then I use cem_treated as variable to match on exactly like this:
cem Employees(#0) sector(#0), tr(cem_treated)
Before I drop cem_matched==0 as I should I have equal number of observations in the control group (each year) and equal number in the treatment group. When I drop cem_matched==0, I of course delete unmatched controls (but it is still equally many before and after treatment). But for the treatment group, a lot of observations are dropped for the POST-treatment period. The reason for that can be missing values in the difference in difference variables but it is not.
I tried to match on the whole period as well to check for that. Then it works fine, it only drops control variables and a few treated but it is equally many before and after treatment. The command was in that case:
cem Employees(#0) sector(#0), tr(treated)
Do anyone know why variables are dropped in the post-period when matching is done for the pre-period?
Thanks in advance
drop if treated==.
gen cem_treated = 1 if treated=1 & Year<2011
replace cem_treated=0 if cem_treated==.
Then I use cem_treated as variable to match on exactly like this:
cem Employees(#0) sector(#0), tr(cem_treated)
Before I drop cem_matched==0 as I should I have equal number of observations in the control group (each year) and equal number in the treatment group. When I drop cem_matched==0, I of course delete unmatched controls (but it is still equally many before and after treatment). But for the treatment group, a lot of observations are dropped for the POST-treatment period. The reason for that can be missing values in the difference in difference variables but it is not.
I tried to match on the whole period as well to check for that. Then it works fine, it only drops control variables and a few treated but it is equally many before and after treatment. The command was in that case:
cem Employees(#0) sector(#0), tr(treated)
Do anyone know why variables are dropped in the post-period when matching is done for the pre-period?
Thanks in advance
Comment