Hello, I am beginner of STATA, and look forward to your guidance.
I want to make a new variable: baseline_age of my dataset.
I have to assemble baseline age of the dataset.
However, some of the ID does not have the data of age of visit_number 1.
The individual who has no age data at visit_number1 has the record of age the next year, therefore, I want to make a new variable: baseline age by substration -1 from the next year in this ID.
For example, ID number 1, and 2 have age data in visit_number 1, also has the next year data of age, visit_number 2.
However, ID number 3, and 4 doesn't have age data in visit_number 1, only have the data of age in next year, visit_number 2.
My dataset is like as below:
Code:
ID visit_number age 1 1 48 2 2 49 2 1 25 2 2 26 3 1 . 3 2 29 4 1 . 4 2 35
Code:
generate age1=age if visit_number==1 generate age2=age if visit_number==2 generate age2_1=age2-1 generate baseline_age=age1 replace baseline_age=age2_1 if age1 == .
Code:
ID visit_number age age1 age2 age2_1 baseline_age 1 1 48 48 . . 48 1 2 49 . 49 48 48 2 1 25 25 . . 25 2 2 26 . 26 25 25 3 1 . . . . . 3 2 29 . 29 28 28 4 1 . . . . . 4 2 35 . 35 34 34
However, this modified dataset has same structure in ID 1,2, (visit number 2) and ID 3,4, (visit number2).
I want to know the methods to selection of age variable from different row.
Thank you for your understanding.
Comment