I have a survey with 258 observations.
Survey questions are all categorical (None/Some/Most/All and Yes/No)
I am trying to see which questions fall into factors/scales.
Because my data is categorical I ran
1. Reliability tests (alpha - inter-item reliability)
2. Polychoric correlations to create a matrix to plug-into my exploratory factor analysis
polychoric `my vars'
display r(sum_w)
global N = r(sum_w)
matrix r = r(R)
factormat r, n(213) factors(3) *note the n is lower hear because of missingness
3. Once I do this, I am stuck with how to run a CFA...
Because I am running an analysis on categorical variables, I want to use sem with the adf method, but there are few things happening with my output that I don't fully understand.
a. When I run sem r, method(adf) - which is running my factor analysis on the polychoric matrix I saved above - Stata is unable to come to convergence. BUT when I run sem `my vars',
method (adf) it gives me an output. Why would the latter method give me an output? And is it reliable? I am assuming because the adf method deals with categorical variables, I can
use the output from the sem `my vars', method(adf) and take it as reliable. I just wanted to check and try to understand why the first one doesn't work. I feel like it has something to do with Stata not understanding how to interpret a matrix with the sem funcation.
b. When I run sem `my vars', method(adf) the coefficients are all between 0.9 and 2. These factor loadings are very different from the one I received in Step 2 (which range between 0.4 and
0.8). I feel as though this is because of my small sample size. BUT when I run sem `my vars', stand I get comparable factor loadings to Step 2. I think those are incorrect because they
aren't accounting for the categorical nature of my data, but I don't know how to interpret the super high coefficients from the adf command.
Any advice on the issues in 3a and 3b would be greatly appreciated!
Survey questions are all categorical (None/Some/Most/All and Yes/No)
I am trying to see which questions fall into factors/scales.
Because my data is categorical I ran
1. Reliability tests (alpha - inter-item reliability)
2. Polychoric correlations to create a matrix to plug-into my exploratory factor analysis
polychoric `my vars'
display r(sum_w)
global N = r(sum_w)
matrix r = r(R)
factormat r, n(213) factors(3) *note the n is lower hear because of missingness
3. Once I do this, I am stuck with how to run a CFA...
Because I am running an analysis on categorical variables, I want to use sem with the adf method, but there are few things happening with my output that I don't fully understand.
a. When I run sem r, method(adf) - which is running my factor analysis on the polychoric matrix I saved above - Stata is unable to come to convergence. BUT when I run sem `my vars',
method (adf) it gives me an output. Why would the latter method give me an output? And is it reliable? I am assuming because the adf method deals with categorical variables, I can
use the output from the sem `my vars', method(adf) and take it as reliable. I just wanted to check and try to understand why the first one doesn't work. I feel like it has something to do with Stata not understanding how to interpret a matrix with the sem funcation.
b. When I run sem `my vars', method(adf) the coefficients are all between 0.9 and 2. These factor loadings are very different from the one I received in Step 2 (which range between 0.4 and
0.8). I feel as though this is because of my small sample size. BUT when I run sem `my vars', stand I get comparable factor loadings to Step 2. I think those are incorrect because they
aren't accounting for the categorical nature of my data, but I don't know how to interpret the super high coefficients from the adf command.
Any advice on the issues in 3a and 3b would be greatly appreciated!
Comment