Difference-in Difference weighted with propensity scores

Fathima Salih

Join Date: Jun 2020

Posts: 25
#1

Difference-in Difference weighted with propensity scores

10 Jun 2020, 08:22

Hello,

I'm using a difference-in-difference (DID) model weighted with propensity scores to estimate the impact of a treatment on student test scores.

The data are for the same schools in both 2012 (baseline, pre-treatment) and 2016 (endline, post-treatment). But the students in each of the rounds are different and there mat be significant differences in composition between treatment and control groups. So I calculate propensity score weights as follows using psmatch2, for the baseline and endline data separately:

Code:

psmatch2 treatment inc nobreakfast if year==2012 rename _pscore pscore2012 psmatch2 treatment inc nobreakfast if year==2016 * create a weight that is the inverse (??) of pscore gen ps_weight=1/pscore2012 if year==2012 replace ps_weight=1/_pscore if year==2016

I then use these weights in the DID model as follows

Code:

xtset school_id xtreg stdmath i.y2016##i.treatment inc nobreakfast ps_weight, fe vce(robust)

Question 1: Is the above calculation of weights correct? i.e., using the inverse of the propensity scores to calculate the weight and inserting it as above in the DID model.

Question 2: The bigger problem is that there are actually 4 groups: treatment pre, treatment post, control pre, control post. So shouldn't the propensity scores be estimated for each student relative to one of the 4 groups such as treatment pre? How do I do this on Stata? Is it as below?

Code:

gen group= 1 if year==2012 & treatment==1 replace group= 2 if year==2012 & treatment==0 replace group= 3 if year==2016 & treatment==1 replace group= 4 if year==2016 & treatment==0 mlogit group inc nobreakfast y2016 predict ps gen ps_weight2=1/ps

* References: E. Leuven and B. Sianesi. (2003). "PSMATCH2: Stata module to perform full
Mahalanobis and propensity score matching, common support graphing, and covariate
imbalance testing". http://ideas.repec.org/c/boc/bocode/s432001.html.

The data used is:
* Example generated by -dataex-. To install: ssc install dataex
clear
input float(school_id year treatment stud_id) double nobreakfast float(y2016 stdmath inc)
18 2012 1 5 0 0 -.20173773 6.584059
18 2012 1 11 0 0 -.7822701 6.974636
18 2012 1 19 0 0 1.3076463 7.804821
18 2016 1 38 0 1 -.8983765 6.926063
18 2016 1 39 0 1 -1.0725362 7.141726
19 2012 1 49 0 0 2.004285 7.547534
19 2012 1 50 0 0 .6690608 7.30482
19 2016 1 59 1 1 .9593269 6.634033
19 2016 1 66 0 1 1.3656995 6.412418
19 2016 1 67 0 1 1.3656995 9.253709
19 2016 1 68 1 1 .3207414 5.673886
19 2016 1 69 1 1 -.02757803 7.625872
19 2016 1 70 0 1 -.7242168 7.333391
19 2016 1 71 0 1 -.4339507 7.54091
19 2016 1 72 0 1 .14658166 9.662361
19 2016 1 73 1 1 .3787946 6.246556
19 2016 1 74 1 1 -1.014483 7.201874
19 2016 1 75 0 1 .26268813 7.833391
19 2016 1 76 1 1 -.3178442 5.33342
20 2012 1 77 1 0 -1.4208556 5.031689
20 2012 1 78 0 0 -.14368449 5.297753
20 2012 1 79 0 0 -.3178442 7.30482
20 2012 1 83 0 0 -.7242168 6.292029
20 2016 1 99 0 1 .727114 5.673886
20 2016 1 100 1 1 -.14368449 7.040909
end
[/CODE]
Tags: None
Dimitriy V. Masterov

Join Date: Mar 2014

Posts: 609
#2

10 Jun 2020, 13:54

On Q1, take a look at this post on CV that contains Stata code of PSM DID and IPW DID.

I think compositional differences between treatment and control are OK, as long as the effect is time-invariant (conditional on controls).
Comment
Dung Le

Join Date: May 2018

Posts: 120
#3

11 Jun 2020, 10:54

Originally posted by Dimitriy V. Masterov View Post

On Q1, take a look at this post on CV that contains Stata code of PSM DID and IPW DID.

I think compositional differences between treatment and control are OK, as long as the effect is time-invariant (conditional on controls).

Hi Dimitriy V. Masterov,

I am unable to install - xfill- command by using net from http://www.sealedenvelope.com/. Do you have any other ways to install the above-mentioned command?

Thank you.
Comment

Dimitriy V. Masterov

Join Date: Mar 2014
Posts: 609

11 Jun 2020, 14:25

Dung Le "unable to install" is not nearly enough detail for me to suggest a solution. To misquote Tolstoy, happy Stata users are all alike; every unhappy one is unhappy in its own way. At the very least, provide an error code or a message.

But two possible solutions spring to mind immediately:

1) Use some other command to fill in the missing values, like carryforward.

2) Here's the code for xfill.ado that you can paste into your do-file (or stick in a file called xfill.ado in a directory that Stata checks):

Code:

*! v1.0.0 08/07/2002 ARB
program define xfill, sortpreserve
    version 7
    syntax varlist [if] [in] [, I(varname)]
    xt_iis `i'
    local ivar "`s(ivar)'"
    marksample touse, novarlist strok
    qui {
    count if `touse'== 1
    if r(N)==0 {
        disp "{error}no observations"
        exit 2000
    }
        tempvar miss ok
        foreach xvar of local varlist {
        gen byte `miss'=missing(`xvar')
                sort `ivar' `miss'
                by `ivar' `miss': gen byte `ok'=`xvar'[_n]==`xvar'[_n-1] if _n>1
                recode `ok' .=1
                cap assert `ok'==1 if `touse'
                if _rc {
                        nois disp "{txt}`xvar' is not constant within `ivar' -> fill not performed"
                }
                else {
                        by `ivar': replace `xvar'=`xvar'[1] if `touse'
                }
                drop `miss' `ok'
        }
end

Comment

leon xf

Join Date: Oct 2017

Posts: 10
#5

12 May 2024, 17:24

Originally posted by Dimitriy V. Masterov View Post

On Q1, take a look at this post on CV that contains Stata code of PSM DID and IPW DID.

I think compositional differences between treatment and control are OK, as long as the effect is time-invariant (conditional on controls).

Hi @Dimitriy V. Masterov I am very lucky to find your post.

My question is, since your code for the PSM and IPW weighted DID is in a setting of classical DID, can the same weighting be used in generalized DID (as Clyde Schechter described here : https://www.statalist.org/forums/for...93#post1575193)?

Many thanks!

Leon
Comment
Dimitriy V. Masterov

Join Date: Mar 2014

Posts: 609
#6

13 May 2024, 08:38

I suspect the answer is no because now you must have many PSM models since treatment can happen at different times. I have not seen my approach extended to staggered treatments, though I have not kept up with the literature so that it may now exist.
1 like
Comment
leon xf

Join Date: Oct 2017

Posts: 10
#7

13 May 2024, 08:45

Originally posted by Dimitriy V. Masterov View Post

I suspect the answer is no because now you must have many PSM models since treatment can happen at different times. I have not seen my approach extended to staggered treatments, though I have not kept up with the literature so that it may now exist.

Thank you for your answer!

Thus, in the generalized DiD (aka Two-way Fixed Effects DiD), one should just run the regressions without any weighting, correct?
Comment

Announcement