Dear All,
I have a dataset containing data from people in many regions and I want to calculate age-standardized rates using a standard population as reference.
I have two major things I need help to fix: the first is deriving rates when each line of data refers to just one person and not an age-group. The second is about inputing the population of choice.
The data looks like this:
ID age-group_per region age-group_pop dead
6 5-9 a 2000 0
7 10-14 b 3000 1
9 20-24 a 1000 0
10 30-34 b 6000 1
11 10-14 c 6000 0
12 10-14 b 3000 0
13 5-9 a 2000 0
14 20-24 c 1000 1
15 20-24 b 2000 1
age-group_per is the age-group each individual falls into
region is the region each one lives
age-group_pop is the population of that age group in that region
dead is the mortality of each individual
To use the standardization commands, I will expect that the total number of people that died in each age-group for each region would have been already calculated as a separate variable (died) and in that case, it would be straightfoward to get a rate eg.
age-group region age-group_pop died
5 -9 a 2000 3
20-24 a 1000 2
10-14 b 3000 2
20-24 b 2000 3
10-14 c 1000 2
20-24 c 1000 2
but it is not so in this case.
How do you suggest it can be formatted to make calculating a rate (died/age-group_pop) possible and standaridization easy?
An off-shoot question is how do I incorporate the "standard population" if it is not one of the populations being examined?
I would appreciate comments.
Thanks a lot.
Bode.
I have a dataset containing data from people in many regions and I want to calculate age-standardized rates using a standard population as reference.
I have two major things I need help to fix: the first is deriving rates when each line of data refers to just one person and not an age-group. The second is about inputing the population of choice.
The data looks like this:
ID age-group_per region age-group_pop dead
6 5-9 a 2000 0
7 10-14 b 3000 1
9 20-24 a 1000 0
10 30-34 b 6000 1
11 10-14 c 6000 0
12 10-14 b 3000 0
13 5-9 a 2000 0
14 20-24 c 1000 1
15 20-24 b 2000 1
age-group_per is the age-group each individual falls into
region is the region each one lives
age-group_pop is the population of that age group in that region
dead is the mortality of each individual
To use the standardization commands, I will expect that the total number of people that died in each age-group for each region would have been already calculated as a separate variable (died) and in that case, it would be straightfoward to get a rate eg.
age-group region age-group_pop died
5 -9 a 2000 3
20-24 a 1000 2
10-14 b 3000 2
20-24 b 2000 3
10-14 c 1000 2
20-24 c 1000 2
but it is not so in this case.
How do you suggest it can be formatted to make calculating a rate (died/age-group_pop) possible and standaridization easy?
An off-shoot question is how do I incorporate the "standard population" if it is not one of the populations being examined?
I would appreciate comments.
Thanks a lot.
Bode.
Comment