How to eliminate male-only households from individual-level data

Julia Beach

Join Date: Apr 2019

Posts: 6
#1

How to eliminate male-only households from individual-level data

14 Apr 2019, 08:35

Dear Stata Users,

I am new to Stata.

I have a Census dataset on individuals, that are identified per household numbers. To reduce the size of my dataset, I want to drop all male-only households.

I tried something like drop per hh_id if sex == 1, but obviously this doesnt work.
Any hint on where I could find the right command?
Thank you very much in advance.
Tags: None
Rich Goldstein

Join Date: Mar 2014

Posts: 4493
#2

14 Apr 2019, 08:53

it is probably possible to do this in fewer steps, but I prefer something like the following:

Code:

egen countm=count(1) if sex==1, by(hh_id) egen counth=count(1), by(hh_id) drop if countm==counth

you may then want to

Code:

drop countm counth

be sure to save the file under a new name
Comment
Nick Cox

Join Date: Mar 2014

Posts: 35783
#3

14 Apr 2019, 09:17

Code:

bysort hhid (sex) : drop if sex[1] == 1 & sex[_N] == 1

See also Stata data management FAQs.
1 like
Comment
Sonnen Blume

Join Date: Aug 2018

Posts: 342
#4

15 Apr 2019, 16:01

Thanks for the code. Could you please tell a bit what is the function of: [_N]
Comment
Nick Cox

Join Date: Mar 2014

Posts: 35783
#5

15 Apr 2019, 17:47

The allusion in #3 was to https://www.stata.com/support/faqs/d...ions-in-group/

_N indexes the last observation -- and is thus also the number of observations -- or vice versa if you prefer. Under by: the meaning is changed to the last observation in each group defined by the varlist given to by:.
1 like
Comment
Sonnen Blume

Join Date: Aug 2018

Posts: 342
#6

16 Apr 2019, 21:24

Originally posted by Nick Cox View Post

The allusion in #3 was to https://www.stata.com/support/faqs/d...ions-in-group/

_N indexes the last observation -- and is thus also the number of observations -- or vice versa if you prefer. Under by: the meaning is changed to the last observation in each group defined by the varlist given to by:.

That's really helpful. Thank you.
Comment

Announcement

How to eliminate male-only households from individual-level data

Comment

Comment

Comment

Comment

Comment