Good morning to everybody,
I am running a differences-in-differences to assess the impact of a policy on patenting. This policy targeted just specific economic sectors that are identifiable.
For this reason, I used targeted economic sectors as the treatment group and all the other sectors as the control group.
In this way, I obtained a panel with:
(1) 30K sectors (2.5k of them are treated)
(2) #patents per year per sector as dependent variable (i have 500K patents in total for the period 2006-2019)
(3) post_policy that is a dummy =1 if the year is post-policy
(4) treated_sector that is a dummy =1 if a sector is treated
Using the commands xtset... and then xtreg... I find that the results are not statistically significant.
For this reason, I would like to divide sectors into quartiles according to their performance (measured as patents) and then re-run the previously mentioned commands.
What should I do, considering that my data is highly skewed on the right since there are many zeros (i.e. no patents for a given sector for a given year)?
Does it make sense, otherwise, to remove outliers from the control groups?
Thanks in advance
I am running a differences-in-differences to assess the impact of a policy on patenting. This policy targeted just specific economic sectors that are identifiable.
For this reason, I used targeted economic sectors as the treatment group and all the other sectors as the control group.
In this way, I obtained a panel with:
(1) 30K sectors (2.5k of them are treated)
(2) #patents per year per sector as dependent variable (i have 500K patents in total for the period 2006-2019)
(3) post_policy that is a dummy =1 if the year is post-policy
(4) treated_sector that is a dummy =1 if a sector is treated
Using the commands xtset... and then xtreg... I find that the results are not statistically significant.
For this reason, I would like to divide sectors into quartiles according to their performance (measured as patents) and then re-run the previously mentioned commands.
What should I do, considering that my data is highly skewed on the right since there are many zeros (i.e. no patents for a given sector for a given year)?
Does it make sense, otherwise, to remove outliers from the control groups?
Thanks in advance

Comment