Dear all,
I'm facing some difficulties with my analysis of survey data.
I'm using DHS data and analyzing the association between stunting and regions of the country. I set the data for "svy" and the result is below:
So, according to the results there is no significance.
I was asked to perform ANOVA to test relationship among the region, however I've read that ANOVA is not suitable for categorical variables.
So I tried Pearson chi2 test of independence using the following code:
My question is how to assign weights for Pearson chi2 test? And which test is better?
In my understanding, in svy the chi2 statistic is converted to an F statistic. Does this mean that the test of independence was performed?
Then, why the separate tab command with chi2 produces significant results?
Can ANOVA be performed for such data?
I'm not good at statistics so I apologize for such nonconstructive question.
Thanks beforehand
I'm facing some difficulties with my analysis of survey data.
I'm using DHS data and analyzing the association between stunting and regions of the country. I set the data for "svy" and the result is below:
Code:
svy: tab region stunted, row (running tabulate on estimation sample) Number of strata = 9 Number of obs = 4,523 Number of PSUs = 354 Population size = 4,713.6234 Design df = 345 ---------------------------------------- Region of | Stunted residence | not-stun stunted Total ----------+----------------------------- Dushanbe | .8122 .1878 1 GBAO | .7579 .2421 1 SUGHD | .7328 .2672 1 DRS | .7378 .2622 1 KHATLON | .7285 .2715 1 | Total | .7391 .2609 1 ---------------------------------------- Key: row proportion Pearson: Uncorrected chi2(4) = 11.1518 Design-based F(2.89, 998.03) = 2.0425 P = 0.1087
I was asked to perform ANOVA to test relationship among the region, however I've read that ANOVA is not suitable for categorical variables.
So I tried Pearson chi2 test of independence using the following code:
Code:
. tab region stunted, row chi2 +----------------+ | Key | |----------------| | frequency | | row percentage | +----------------+ Region of | Stunted residence | not-stunt stunted | Total -----------+----------------------+---------- Dushanbe | 580 131 | 711 | 81.58 18.42 | 100.00 -----------+----------------------+---------- GBAO | 298 96 | 394 | 75.63 24.37 | 100.00 -----------+----------------------+---------- SUGHD | 667 242 | 909 | 73.38 26.62 | 100.00 -----------+----------------------+---------- DRS | 931 330 | 1,261 | 73.83 26.17 | 100.00 -----------+----------------------+---------- KHATLON | 918 330 | 1,248 | 73.56 26.44 | 100.00 -----------+----------------------+---------- Total | 3,394 1,129 | 4,523 | 75.04 24.96 | 100.00 Pearson chi2(4) = 20.0773 Pr = 0.000
In my understanding, in svy the chi2 statistic is converted to an F statistic. Does this mean that the test of independence was performed?
Then, why the separate tab command with chi2 produces significant results?
Can ANOVA be performed for such data?
I'm not good at statistics so I apologize for such nonconstructive question.
Thanks beforehand
Comment