Dear Statalist community,
I've been a long-time reader and for the first time saw the need to create a profile.
I'm relatively new to the Kappa statistic in STATA. I've read the relevant documentation and practiced with it, and I hope you can help me with the following.
Please note: I've simplified the variable names and values to keep the explanation of the problem as simple as possible. I use the car metaphor since this is a commonly used database in STATA:
For a study we asked 101 participants whether characteristics of a car would make ik less likely (score = -1) or more likely (score = +1) for them to buy a car.
If the characteristic had no influence on their decision to buy this car, they were assigned the score '0'.
The data looks like this:

v1 = observer 1 and this continues all the way to v101.
Now when I run .kap v1 - v101 the outcome that I receive is:

My first question:
The way I interpret this is that these 101 observers have 1.3% agreement in what factors they consider to make it less likely to buy a car, 23% regarding factors that they don't care about when purchasing, and 27% when it comes to factors that they think make a car more attractive to buy (all low agreement).
However, looking at all the factors together, they seem to agree in 24% of the cases.
Am I interpreting this correct? I don't know why but this just seems confusing to me.
My second question:
In order to see if I understood this analysis, I dropped all variables and just look at the car being red.
When I run the .kap v1 - v101 analysis I get:

However, when I tried 3 consecutive other variables, I get the exact same values.
How so?
Thank you in advance for your time and thanks for being such a great community.
Best,
Levent
I've been a long-time reader and for the first time saw the need to create a profile.
I'm relatively new to the Kappa statistic in STATA. I've read the relevant documentation and practiced with it, and I hope you can help me with the following.
Please note: I've simplified the variable names and values to keep the explanation of the problem as simple as possible. I use the car metaphor since this is a commonly used database in STATA:
For a study we asked 101 participants whether characteristics of a car would make ik less likely (score = -1) or more likely (score = +1) for them to buy a car.
If the characteristic had no influence on their decision to buy this car, they were assigned the score '0'.
The data looks like this:

v1 = observer 1 and this continues all the way to v101.
Now when I run .kap v1 - v101 the outcome that I receive is:

My first question:
The way I interpret this is that these 101 observers have 1.3% agreement in what factors they consider to make it less likely to buy a car, 23% regarding factors that they don't care about when purchasing, and 27% when it comes to factors that they think make a car more attractive to buy (all low agreement).
However, looking at all the factors together, they seem to agree in 24% of the cases.
Am I interpreting this correct? I don't know why but this just seems confusing to me.
My second question:
In order to see if I understood this analysis, I dropped all variables and just look at the car being red.
When I run the .kap v1 - v101 analysis I get:

However, when I tried 3 consecutive other variables, I get the exact same values.
How so?
Thank you in advance for your time and thanks for being such a great community.
Best,
Levent
Comment