Bootstraping the CPS from IPUMS using bsample

Arnau Valladares-Esteban

Join Date: Nov 2019

Posts: 15
#1

Bootstraping the CPS from IPUMS using bsample

06 Oct 2021, 05:36

Hello,

I'm working with the basic CPS as provided by IPUMS.

I need to bootstrap the data to compute standard errors. As far as I understand, I can do it using bsample. I need to let bsample know that the CPS is a monthly survey. However, it's not clear to me if I need to do that by using the option strata() or cluster(). Any suggestion?

Also, I guess I need to use bsample with the option weights(). Does that mean that I do not need to use weights for my computations with the bootstrapped samples?

Any help is very much appreciated.

Thanks!

Last edited by Arnau Valladares-Esteban; 06 Oct 2021, 05:39.
Tags: None
Felix Bittmann

Join Date: Aug 2018

Posts: 691
#2

06 Oct 2021, 05:39

I do not know the CPS data but you probably do not want to use bsample, unless you program something. I would start from your analysis part, what model do you intend to estimate in the end?

Best wishes

Stata 18.0 MP | ORCID | Google Scholar
Comment
Arnau Valladares-Esteban

Join Date: Nov 2019

Posts: 15
#3

06 Oct 2021, 06:04

That's irrelevant. I know I have to bootstrap.
Comment
Felix Bittmann

Join Date: Aug 2018

Posts: 691
#4

06 Oct 2021, 06:07

In Stata you usually want to bootstrap directly in the analysis command. For example, when your model is OLS you would use:

Code:

regress y x1 x2 x3, vce(bootstrap)

Best wishes

Stata 18.0 MP | ORCID | Google Scholar
Comment
Arnau Valladares-Esteban

Join Date: Nov 2019

Posts: 15
#5

06 Oct 2021, 06:09

I'm not running a regression after bootstrapping. I do something else that would be too long to explain.
Comment
Weiwen Ng

Join Date: Jun 2015

Posts: 1241
#6

06 Oct 2021, 06:12

Many large public surveys need to be svyset, specifying a probability weight and a stratum variable at minimum. For the US Current Population Survey., it’s a bit more complicated. The suggested command appears to be:

Code:

svyset [iw=wtsupp], jkrweight(repwtp1-repwtp160, multiplier(.025)) /// vce(jackknife) mse

I offer this in case it helps someone answer the OP’s question.

Be aware that it can be very hard to answer a question without sample data. You can use the dataex command for this. Type help dataex at the command line.

When presenting code or results, please use the code delimiters format them. Use the # button on the formatting toolbar, between the " (double quote) and <> buttons.
1 like
Comment
Felix Bittmann

Join Date: Aug 2018

Posts: 691
#7

06 Oct 2021, 06:20

If the CPS is a panel study you want to sample entire histories of individuals, and here you need the cluster option, like:

Code:

bsample, cluster(person_id) idcluster(newid)

The important part is that you use the newly generated variable newid as the id for the following analyses to not count the observations incorrectly. Regarding the weight question, you do not need this if you are working with sampling weights.

Best wishes

Stata 18.0 MP | ORCID | Google Scholar
1 like
Comment

Announcement

Bootstraping the CPS from IPUMS using bsample

Comment

Comment

Comment

Comment

Comment

Comment