Hi everyone,
I am working on the validation of a latent measure with ordinal items (4-point Likert scales) and working with a very large sample size (400K). For this purpose, I am estimating gsem, ologit models and using the estat ic command to obtain the AIC and BIC statistics to compare competing factor structures. I've also estimated a polychoric correlation matrix and PCA. The problem is that factor loadings, internal consistency statistics (omega) and eigenvalues all strongly support a single factor model, but the BIC statistics are still significantly smaller for two-factor models. My question is, does anyone know if there are ways to correct for sample size when estimating the BIC or of discussion about appropriate benchmark values when examining this statistic in a large sample? I know that 10 is used as a threshold to indicate strong evidence of a better model for the Schwarz version of the BIC, but that doesn't seem appropriate with the output I am getting. Even using a 10% random sample (n= 40,827), I still obtain a two-factor model with a BIC that is 767 below the one-factor model (357,796 vs. 357,029). Note that the sample sizes are the same between models.
Most of the technical notes that I can find about BIC are not discussing gsem models specifically.
https://www.stata.com/manuals13/rbicnote.pdf#rBICnote
https://www.stata.com/statalist/arch.../msg00884.html
Thank you,
Emma
I am working on the validation of a latent measure with ordinal items (4-point Likert scales) and working with a very large sample size (400K). For this purpose, I am estimating gsem, ologit models and using the estat ic command to obtain the AIC and BIC statistics to compare competing factor structures. I've also estimated a polychoric correlation matrix and PCA. The problem is that factor loadings, internal consistency statistics (omega) and eigenvalues all strongly support a single factor model, but the BIC statistics are still significantly smaller for two-factor models. My question is, does anyone know if there are ways to correct for sample size when estimating the BIC or of discussion about appropriate benchmark values when examining this statistic in a large sample? I know that 10 is used as a threshold to indicate strong evidence of a better model for the Schwarz version of the BIC, but that doesn't seem appropriate with the output I am getting. Even using a 10% random sample (n= 40,827), I still obtain a two-factor model with a BIC that is 767 below the one-factor model (357,796 vs. 357,029). Note that the sample sizes are the same between models.
Most of the technical notes that I can find about BIC are not discussing gsem models specifically.
https://www.stata.com/manuals13/rbicnote.pdf#rBICnote
https://www.stata.com/statalist/arch.../msg00884.html
Thank you,
Emma
Comment