Hi all!
I’m working with a pooled cross section data set which comes from a survey that is repeated monthly. I’m trying to specify the characteristics of my data set with the command svyset, but I’m not completely sure if I’m doing it well and I’d really appreciate if you could guide me a bit.
The survey collects data from individuals and each one has an individual expansion factor, say, one person represents other hundred people, another one could represent 150, and so on.
My data set comes from 2008 to 2013. For example, for one year I have something like this:
Where Xvariables are the variables which describe each observation (such as sex, age, city, among others).
I have the exactly same survey for every month since January 2008 till September 2013. I’m attempting to analyze the data as pooled cross sections, and I used the following Statalist post as a guide:
http://www.stata.com/statalist/archi.../msg00521.html
Like the post says, as samples are taken independently, I specify the year/wave as super-strata. Then my command is like follows:
Where fxp represents the expansion factor of each observation.
I was wondering if the specification reflects what my data set is, and if it is necessary to specify some sort of estimation (jackknife, bootstrap, etc.).
Thanks for your help.
I’m working with a pooled cross section data set which comes from a survey that is repeated monthly. I’m trying to specify the characteristics of my data set with the command svyset, but I’m not completely sure if I’m doing it well and I’d really appreciate if you could guide me a bit.
The survey collects data from individuals and each one has an individual expansion factor, say, one person represents other hundred people, another one could represent 150, and so on.
My data set comes from 2008 to 2013. For example, for one year I have something like this:
Obs. | Year | Month | Expansion factor (fxp) | X variables |
1 | 2008 | 1 | 152 | . |
2 | 2008 | 1 | 68 | . |
3 | 2008 | 1 | 205 | . |
4 | 2008 | 2 | 120 | . |
5 | 2008 | 2 | 208 | . |
6 | 2008 | 2 | 89 | . |
7 | 2008 | 3 | 97 | . |
8 | 2008 | 3 | 134 | . |
… | 2008 | … | … | … |
n-1 | 2008 | 12 | 35 | . |
n | 2008 | 12 | 168 | . |
I have the exactly same survey for every month since January 2008 till September 2013. I’m attempting to analyze the data as pooled cross sections, and I used the following Statalist post as a guide:
http://www.stata.com/statalist/archi.../msg00521.html
Like the post says, as samples are taken independently, I specify the year/wave as super-strata. Then my command is like follows:
egen monthXyear = group(month year)[INDENT=2]svyset monthXyear [pw=fxp], strata(year)[/INDENT]
Where fxp represents the expansion factor of each observation.
I was wondering if the specification reflects what my data set is, and if it is necessary to specify some sort of estimation (jackknife, bootstrap, etc.).
Thanks for your help.
Comment