Hi all,
I have the following data and problem:
Data set: I have a sample of firms that experienced a certain event in one or more years (treatment group) as well as a sample of firms as control group. I have panel data of these firms for the years 2000-2015 as well as an additional data set which includes a dummy variable which can change throughout the year (e.g. for control firm C the variable has the value 0 for the period 01/01/2000 until 03/04/2005 and 1 for the period 04/04/2005 until 31/12/2015).
Problem: I now want to do two things:
1) Match the firms of the treatment group with companies from the control group based on the propensity score for the respective years (e.g. treatment firm A from the treatment group which experienced the event in 2005 should be matched with a firm from the control group from the year 2005). I would do this similar to the way described here: http://www.stata.com/statalist/archi.../msg00073.html.
2) As a next step and based on the matching, I want to add the data from the additional data set to the matched firms (e.g. treatment firm A experienced the event on 06/06/2005 and has been matched with control firm C and I would like to add the status at that time for C, i.e. the value 1 for the dummy variable). Any recommendation on how this could be done the best way?
Thanks in advance,
Felix
Illustrative data overview:
Starting point:
1.
2.
Desired result:
I have the following data and problem:
Data set: I have a sample of firms that experienced a certain event in one or more years (treatment group) as well as a sample of firms as control group. I have panel data of these firms for the years 2000-2015 as well as an additional data set which includes a dummy variable which can change throughout the year (e.g. for control firm C the variable has the value 0 for the period 01/01/2000 until 03/04/2005 and 1 for the period 04/04/2005 until 31/12/2015).
Problem: I now want to do two things:
1) Match the firms of the treatment group with companies from the control group based on the propensity score for the respective years (e.g. treatment firm A from the treatment group which experienced the event in 2005 should be matched with a firm from the control group from the year 2005). I would do this similar to the way described here: http://www.stata.com/statalist/archi.../msg00073.html.
2) As a next step and based on the matching, I want to add the data from the additional data set to the matched firms (e.g. treatment firm A experienced the event on 06/06/2005 and has been matched with control firm C and I would like to add the status at that time for C, i.e. the value 1 for the dummy variable). Any recommendation on how this could be done the best way?
Thanks in advance,
Felix
Illustrative data overview:
Starting point:
1.
year | firm_id | treatment | event_date |
2005 | 10001 | 1 | 06062005 |
... | ... | ... | ... |
2005 | 10002 | 0 | . |
firm_id | start_date | end_date | status |
10001 | 01012000 | 31122015 | 0 |
10002 | 01012000 | 03042005 | 0 |
10002 | 04042005 | 31122015 | 1 |
year | firm_id | treatment | event_date | ... | status |
2005 | 10001 | 1 | 06062005 | ... | 0 |
... | ... | ... | ... | ... | ... |
2005 | 10002 | 0 | . | ... | 1 |
Comment