Now I feel I need to clarify my earlier post. There is indeed no question about missing values being coded as such. There is also no question that the median would in any case be appropriate for the data at hand while the mean is not, from a (mathematical) statistical point of view. So Clyde's approach is the bullet-proof one in that sense, and probably the one to follow here.
I just wanted to point out, that we are usually not so interested in the mathematical correctness for the sake of mathematical correctness. We are using statistical models to interpret the data we have seen in the light of substantive theory. In doing so we make lots of assumptions and treating an ordinal Likert-type variable as interval is, in my opinion, not a very critical one. This is to say, while Clyde is clearly correct, it might be a little bit to hard to state that the mean is meaningless for ordinal data. In my (admittedly limited both, in quantity and in scientific discipline) experience, using the mean or median tends to give the same substantial answer in situations as the one described here. If they do not, this might have other reasons than the measurement of the variables. That is not to say, it is always and automatically appropriate to treat such variables as interval data, but I have the feeling the circumstances in which it would not be a reasonable assumption are the exception rather than the rule. Anyway, Clyde is also correct in pointing out that this is an empirical question.
As a last comment, given the (complex) research question outlined above, I guess treating this variable as ordinal or (quasi) interval data, is probably not the decision that matters most for the validity of the answer.
Best
Daniel
I just wanted to point out, that we are usually not so interested in the mathematical correctness for the sake of mathematical correctness. We are using statistical models to interpret the data we have seen in the light of substantive theory. In doing so we make lots of assumptions and treating an ordinal Likert-type variable as interval is, in my opinion, not a very critical one. This is to say, while Clyde is clearly correct, it might be a little bit to hard to state that the mean is meaningless for ordinal data. In my (admittedly limited both, in quantity and in scientific discipline) experience, using the mean or median tends to give the same substantial answer in situations as the one described here. If they do not, this might have other reasons than the measurement of the variables. That is not to say, it is always and automatically appropriate to treat such variables as interval data, but I have the feeling the circumstances in which it would not be a reasonable assumption are the exception rather than the rule. Anyway, Clyde is also correct in pointing out that this is an empirical question.
As a last comment, given the (complex) research question outlined above, I guess treating this variable as ordinal or (quasi) interval data, is probably not the decision that matters most for the validity of the answer.
Best
Daniel
Comment