How to group IDs in Stata

Adeola Akintola

Join Date: Jun 2019

Posts: 8
#1

How to group IDs in Stata

16 Jun 2022, 04:34

Hello.
I need to group the cases into two by the IDs using the last character (N or P).
How do I do this in stata.
A sample of the study IDs is listed below.
Thank you.

[CODE]
* Example generated by -dataex-. For more info, type help dataex
clear
Sty_id
"ADH001N"
"ADH001P"
"ADH002N"
"ADH002P"
Tags: None
Øyvind Snilsberg

Join Date: Oct 2021

Posts: 591
#2

16 Jun 2022, 05:29

Code:

gen wanted = regex(Sty_id,"N$")
1 like
Comment
Andrew Musau

Join Date: Oct 2014

Posts: 10482
#3

16 Jun 2022, 05:32

You appear to have edited the dataex output. You should post the exact output as whether the variable Sty_id is a string variable or a numerical variable with value labels is crucial.

Code:

gen which= substr(Sty_id, -1, 1) sort which l, sepby(which)
1 like
Comment
Rich Goldstein

Join Date: Mar 2014

Posts: 4547
#4

16 Jun 2022, 05:34

and here is a solution that does not use regular expressions:

Code:

gen byte wanted = inlist(substr(Sty_id,-1,1),"N","P")

note that since you butchered your -dataex- example (please don't do that), this is not tested
1 like
Comment
Adeola Akintola

Join Date: Jun 2019

Posts: 8
#5

16 Jun 2022, 07:13

The codes worked. Thank to everyone for your assistance. I have also noted the point raised by Andrew Musau and Rich Goldstein on edited output. I need your expert advise on how to improve my coding in stata. Thank you.
Comment

Announcement

How to group IDs in Stata

Comment

Comment

Comment

Comment