Hello all,
Apologies if this is a very simple/basic query. I am very new to STATA
Having collected my data I have been trying to figure out how I can analyse a variable with multiple responses into something that can be used for analysis.
Originally it appeared like this when I first downloaded:
Labelled
Using odkmeta I managed to separate these responses into 4 variables:
However now I am at a loss at how I can analyse this meaningfully. i.e. I would like to know the frequency of how many people bathe in the lake (4 in this example)
I have tried to see if I can sort the responses by order in the original variable e.g.:
1. 1 2 3 4
2. 1 2 3
3. 2 3
4. 2 3
So that I can at least relabel 2 3 into "Bathing + Fishing" so that ID 3 and 4 have the same observation.
I have tried to use logistic regression to generate dummy variables but this has ended up with 56 new variables and as above, input for 3 and 4 are actually the same, just that they are ordered differently so appears different to stata
Does anyone have any ideas?
Apologies if this is a very simple/basic query. I am very new to STATA
Having collected my data I have been trying to figure out how I can analyse a variable with multiple responses into something that can be used for analysis.
Originally it appeared like this when I first downloaded:
| Study ID | lake_activity |
| 1 | 1 2 3 4 |
| 2 | 3 2 1 |
| 3 | 2 3 |
| 4 | 3 2 |
| Study ID | lake_activity |
| 1 | Swimming (1) Bathing (2) Fishing (3) Washing clothes (4) |
| 2 | Fishing (3) Bathing (2) Swimming (1) |
| 3 | Bathing (2) Fishing (3) |
| 4 | Fishing (3) Bathing (2) |
| Study ID | lake_activity1 | lake_activity2 | lake_activity3 | lake_activity4 |
| 1 | Swimming (1) | Bathing (2) | Fishing (3) | Washing clothes (4) |
| 2 | Fishing (3) | Bathing (2) | Swimming (1) | |
| 3 | Bathing (2) | Fishing (3) | ||
| 4 | Fishing (3) | Bathing (2) |
I have tried to see if I can sort the responses by order in the original variable e.g.:
1. 1 2 3 4
2. 1 2 3
3. 2 3
4. 2 3
So that I can at least relabel 2 3 into "Bathing + Fishing" so that ID 3 and 4 have the same observation.
I have tried to use logistic regression to generate dummy variables but this has ended up with 56 new variables and as above, input for 3 and 4 are actually the same, just that they are ordered differently so appears different to stata
Does anyone have any ideas?

Comment