Hi
Apologies if this is a very obvious question - it seems to me that this is something people must do quite frequently, but I've done quite a lot of searching online and haven't been able to figure out how it's done.
I am working with complex survey panel data from South Africa, and performing an explanatory analysis which looks at how living in different locations may be related to mobility in certain labour market outcomes. By locations I mean "formal urban area", "informal urban area" and "rural area".
One part of the analysis relies on mobility matrices, which looks at how labour market outcomes in a specific type of location have changed from wave1 to wave3 of the panel. This basically means creating cross-tabs of the same variable, but from different waves. So for example, I use:
to see how employment status has changed between wave1 and wave3 for people living in informal urban areas. The three employment status outcomes used in this example are "Employed", "Unemployed" and "NEA" (not economically active).
Similarly, I use:
to see how employment status has changed for people living in rural areas.
Say I am interested in whether the proportion of people moving from "Employed" in Wave1 to "Unemployed" in Wave3 significantly differs between informal urban and rural areas. One way I can quickly do this myself is by comparing the reported confidence intervals of that proportion in the two tables. If the confidence intervals do not overlap I know that there is a significant difference at the 95% significance level. If one confidence interval completely contains the other confidence interval, I know that there is not sufficient evidence at the 95% level to conclude that there is difference. However I am unsure what to do in Stata when the confidence intervals only partially overlap.
Outside of Stata, my understanding is that in this case the difference may be significant at the 95% level, but that I need to perform a Chi-2 test to check this. The issue is that I'm not sure how to operationalise this in Stata in this context.
Any help on how to do this would be greatly appreciated, or advice on a better way to test whether there is a significant difference in specific proportions between the urban formal/urban informal/ rural tables.
Thanks very much,
Josh
Apologies if this is a very obvious question - it seems to me that this is something people must do quite frequently, but I've done quite a lot of searching online and haven't been able to figure out how it's done.
I am working with complex survey panel data from South Africa, and performing an explanatory analysis which looks at how living in different locations may be related to mobility in certain labour market outcomes. By locations I mean "formal urban area", "informal urban area" and "rural area".
One part of the analysis relies on mobility matrices, which looks at how labour market outcomes in a specific type of location have changed from wave1 to wave3 of the panel. This basically means creating cross-tabs of the same variable, but from different waves. So for example, I use:
Code:
svy, subpop(panel_empl_informal): tab w1_empl_stat w3_empl_stat, row ci percent
Similarly, I use:
Code:
svy, subpop(panel_empl_rural): tab w1_empl_stat w3_empl_stat, row ci percent
Say I am interested in whether the proportion of people moving from "Employed" in Wave1 to "Unemployed" in Wave3 significantly differs between informal urban and rural areas. One way I can quickly do this myself is by comparing the reported confidence intervals of that proportion in the two tables. If the confidence intervals do not overlap I know that there is a significant difference at the 95% significance level. If one confidence interval completely contains the other confidence interval, I know that there is not sufficient evidence at the 95% level to conclude that there is difference. However I am unsure what to do in Stata when the confidence intervals only partially overlap.
Outside of Stata, my understanding is that in this case the difference may be significant at the 95% level, but that I need to perform a Chi-2 test to check this. The issue is that I'm not sure how to operationalise this in Stata in this context.
Any help on how to do this would be greatly appreciated, or advice on a better way to test whether there is a significant difference in specific proportions between the urban formal/urban informal/ rural tables.
Thanks very much,
Josh
Comment