I have a large (4 million) record weighted survey data set. I am only concerned with a subset of the data. I want to delete records/observations which are not part of my analysis so I can work with a smaller list of observations. I am concerned that if I start deleting observations it may affect the weights of the remaining observations undermining the validity of the remaining weighted sample. Is this a valid concern, or will the weights applied to the remaining observations retain their original validity? I realize I could use a (subpop) approach, but it would be simpler to cull unwanted observations prior to analysis.
-
Login or Register
- Log in with
Comment