I'm fitting a marginal logistic model for clustered data and have a question about the variance multiplier for the robust variance. my understanding is that I need to specify the nmp option in xtgee to get this, and that the multiplier is M/(M-k), where M is the number of clusters and k is the number of cluster level predictor variables in my model. per the attached output, involving 9000 observations in 26 clusters, I get identical variance estimates in xtgee regardless of whether I specify the nmp option. Does this mean that the variance adjustment is already baked in, and that the nmp option is no longer needed, or that the adjustment is not M/(M-k), but rather N/(N-k), which in this case would be a pretty small adjustment? My understanding from the reading I've been doing is that it should be M (the cluster size) and not N (total sample size). As shown in the attached output, I also ran the same model using logistic with the vce(cluster clname) option and, as expected, matched the xtgee output. this is relevant for me since my actual dataset is 5 times as large as this and hence I can't fit xtgee due to matsize constraints in my version of Stata (I'm limited to 800 but need ~3400). However the model fits fine using the logistic command. unfortunately logistic does not even provide a variance adjustment option, even though in this case it is also using a sandwich estimator. so if it turns out I do need to adjust the variances from this model I presume I can just multiply the resulting SEs by sqrt (M/[M-k]). I realize that in any case I should also be treating the resulting test statistic as a t-statistic and adjust the df accordingly (M-k in this case). thanks for your thoughts. Bill
-
Login or Register
- Log in with
Comment