Hi there,
I am working with survey data and the svy commands, and wondering: what is the most accurate way to conduct a test of means in my scenario?
In my dataset:
However, is this correct? I'm not sure right now why the standard error results are different when I do (i) svy only only the subset of round1-4 records (see directly below), versus (ii) when I do svy as shown above. I appreciate your time and help. Thank you!
I am working with survey data and the svy commands, and wondering: what is the most accurate way to conduct a test of means in my scenario?
In my dataset:
- The records are from five different survey rounds, where the variable, source indicates which round the data is from
- I would like to compare the following two means: (i) the mean age across rounds 1-4, with (i) the mean age in round 5; the variable is called "FQ_age"
- The sampling weight is round-specific and is stored in the variable "FQweight".
- Generated a variable, source_age to group all records in rounds 1-4 under source_age=0 and all round 5 records as source_age=1
- Then, did svy means over (source_age)
- Then, used the test command
However, is this correct? I'm not sure right now why the standard error results are different when I do (i) svy only only the subset of round1-4 records (see directly below), versus (ii) when I do svy as shown above. I appreciate your time and help. Thank you!
Comment