"randomselect" in a program used in a simulation

Mayra Pineda

Join Date: Jan 2019

Posts: 13
#1

"randomselect" in a program used in a simulation

29 Jan 2019, 22:27

Hello!

I am working on a program that aims to make randomization inference of four coefficients from a regression. My dataset is a balanced panel that contains information for 13 states and seven years. I have two "treatment groups." The first one is represented by a dummy variable that takes the value of 1 for state i and 0 otherwise and the second variable takes the value of 1 for two other states, different from i. My treatment effects come from the interaction of these two variables with year dummy variables for 2015 and 2016. The four coefficients come from these interactions.

I am estimating the following equation:

yst = state_s + year_t + beta1*treat1*2015_st + beta2*treat1*2016_st+ bet3*treat2*2015_st+beta4*treat2*2016_st + u_st

The purpose of my program is to estimate this equation several times, such that I observe different values for the coefficients beta_1, beta_2, beta_3, and beta_4. The differences in these coefficients will come from randomly assign the states to treatments 1 and 2. However, I am obtaining the results of one single replication, and then the simulation stops after displaying in red font the error .xxxxxx. I would appreciate if someone can explain to me why I cannot obtain more than one set of estimators. Is it because I am using the command "randomselect" to generate the treatment groups?

My code is as follows:

Code:

gen y15 = 0 replace y15 = 1 if year == 2015 gen y16 = 0 replace y16 = 1 if year == 2016 capture prog drop example program example, rclass *Treatment 1 randomselect if state_fips!=0, gen(treat1) select(state) n(1) /*I am not considering a state for the randomization*/ *Randomly select two bordering states randomselect if (treat1!=1 & state_fips!=0), gen(treat2) select(state) n(2) replace treat2 = 0 if treat1 == 1 *Generate treatment variables gen treat1_15 = treat1*y15 gen treat1_16 = treat1*y16 gen treat2_15 = treat2*y15 gen treat2_16 = treat2*y16 drop treat1 treat2 *Estimation xtreg y i.year treat1_15 treat1_16 treat2_15 treat2_16 [weight=total], fe vce(cluster state_fips) capture return scalar t1_15 = _b[treat1_15] capture return scalar t1_16 = _b[treat1_16] capture return scalar t2_15 = _b[treat2_15] capture return scalar t2_16 = _b[treat2_ 16] end simulate t1_15 = r(t1_15) t1_16 = r(t1_16) t2_15 = r(t2_15) t2_16 = r(t2_16), saving("coefficients.dta", replace) reps(100): example

Thank you!
Tags: None
Sergiy Radyakin

Join Date: Apr 2014

Posts: 1867
#2

29 Jan 2019, 23:59

Mayra, I can't run your code because it relies on the data you don't show, and you don't even show the error code that you are getting (xxxxxx).

My guess is that if you wish to replicate something, each replication must be neutral, so that if it generates something in the data it must dispose of it at the end of the iteration.
So if you generated for example treat2_15, drop it at the end of your estimation code, otherwise the next iteration will stumble upon the variable with the same name already existing in the data.

Best, Sergiy
Comment
Mayra Pineda

Join Date: Jan 2019

Posts: 13
#3

30 Jan 2019, 05:34

Hello Sergiy,

I apologize for not sharing all the information required to ask a question. I should've been more careful when reading the instructions for sharing a post. In any case, you were right about eliminating all the variables. I appreciate your help!
Comment
Jesse Wursten

Join Date: Jan 2016

Posts: 915
#4

30 Jan 2019, 05:52

I didn't read your post in detail, but isn't this essentially a specific case of bootstrapping?
1 like
Comment
Mike Lacy

Join Date: Apr 2014

Posts: 2416
#5

30 Jan 2019, 08:53

If you want to do "randomization inference," I'd advise you to try to use one of the built-in programs in Stata that are designed to do this: -bootstrap- and -permute-. Also, there are some bootstrapping options designed specifically for -xt- commands. See -help xt_vce_options-. However, I am not knowledgeable about using randomization methods in an -xt- context.

I doubt if -randomselect- is going to be useful for you here. Even if that kind of sampling command were relevant, I think you'd find that the user-written -gsample- command is a more contemporary and likely more robust choice.
1 like
Comment
Mayra Pineda

Join Date: Jan 2019

Posts: 13
#6

30 Jan 2019, 15:42

Originally posted by Mike Lacy View Post

If you want to do "randomization inference," I'd advise you to try to use one of the built-in programs in Stata that are designed to do this: -bootstrap- and -permute-. Also, there are some bootstrapping options designed specifically for -xt- commands. See -help xt_vce_options-. However, I am not knowledgeable about using randomization methods in an -xt- context.

I doubt if -randomselect- is going to be useful for you here. Even if that kind of sampling command were relevant, I think you'd find that the user-written -gsample- command is a more contemporary and likely more robust choice.

Thank you for your answer, Mike. I appreciate the suggestion of using bootstrap or permute. I will try assign treament using each one of these commands and see how much the distributions of the coefficients differ.
Comment
Mayra Pineda

Join Date: Jan 2019

Posts: 13
#7

30 Jan 2019, 15:46

Originally posted by Jesse Wursten View Post

I didn't read your post in detail, but isn't this essentially a specific case of bootstrapping?

Yes, it is. Although I didn´t know how to randomly assign treatment and generate indicator variables based on this assignment using the boostrap command. That is why I used "randomselect."
Comment

Announcement

"randomselect" in a program used in a simulation

Comment

Comment

Comment

Comment

Comment

Comment