Hello,
I am working on running a diff in diff regression but would like some feedback on how I should specify my variables appropriately given the research setting.
I would like to study the effect of a policy that was introduced in all states in 1973, but prior to this, 5 states had introduced this policy in 1970. Would this be considered a staggered diff in diff setting? How should I go about defining my treatment variable? Should I set treatment =1 for all the states that legalized in 1970 and then for all states in 1973? Alternatively, can I create a treatment variable called early state, which =1 for the 5 states that legalized prior to 1970?
And in either case, how would I go about testing pre-trends? I am under the impression that I have to run a regression of my outcome on interactions between the treatment and post period years (excluding the year pre treatment) with state and year fixed effects. Is this a valid way to go about it?
Many thanks for your help in advance!
I am working on running a diff in diff regression but would like some feedback on how I should specify my variables appropriately given the research setting.
I would like to study the effect of a policy that was introduced in all states in 1973, but prior to this, 5 states had introduced this policy in 1970. Would this be considered a staggered diff in diff setting? How should I go about defining my treatment variable? Should I set treatment =1 for all the states that legalized in 1970 and then for all states in 1973? Alternatively, can I create a treatment variable called early state, which =1 for the 5 states that legalized prior to 1970?
And in either case, how would I go about testing pre-trends? I am under the impression that I have to run a regression of my outcome on interactions between the treatment and post period years (excluding the year pre treatment) with state and year fixed effects. Is this a valid way to go about it?
Many thanks for your help in advance!
Comment