Hello statalist,
I have a group of treatment firms and a group of non-treatment firms. I want to match each treatment firm to 1 non-treatment firm. As this is a step i want to do before a diff-in-diff model, I'm not interested in using a teffects approach.
For the variables country_iso (country of operation), bvd_sector (sector of operation) i want to find exact matches. For the variable last_avail_year (last year of available data) I'd like to allow +-1 year for matches.
When the treatment firms are matched to potential control firms on abovestanding criterias, i want to find the nearest neighbors on the following variables: incorporation_year (year firm was incorporated), revenue_last (last available revenue data) and pre2005_appln (patent applications before 2005).
Treatment firms have ETS == 1, and non-treatment have ETS == 0.
To be able to evaluate the quality of the matches, I would like to generate some variable calculating the percentage difference on the variables used for nearest neighbour matching.
After having found the control firms, I'd like to remove the non-treated firms that are not matched.
How should I go about this?
Kind regards, Lorens
I have a group of treatment firms and a group of non-treatment firms. I want to match each treatment firm to 1 non-treatment firm. As this is a step i want to do before a diff-in-diff model, I'm not interested in using a teffects approach.
For the variables country_iso (country of operation), bvd_sector (sector of operation) i want to find exact matches. For the variable last_avail_year (last year of available data) I'd like to allow +-1 year for matches.
When the treatment firms are matched to potential control firms on abovestanding criterias, i want to find the nearest neighbors on the following variables: incorporation_year (year firm was incorporated), revenue_last (last available revenue data) and pre2005_appln (patent applications before 2005).
Treatment firms have ETS == 1, and non-treatment have ETS == 0.
To be able to evaluate the quality of the matches, I would like to generate some variable calculating the percentage difference on the variables used for nearest neighbour matching.
After having found the control firms, I'd like to remove the non-treated firms that are not matched.
How should I go about this?
Kind regards, Lorens
Comment