I am using two waves (2007 and 2013/14) of the Demographic and Health Survey (DHS) data, which uses a multistage stratified sampling design. Since the multi-stage cluster deviates from Simple Random Sampling assumptions I need to weight the data to provide unbiased estimates as well as apply a stratification and cluster adjustment to correct the standard errors. The weights are provided by the DHS and specific to the recode files.
The guidance from the DHS is straightforward for cross-sections. My study cohort includes 52,656 children born between 2000-2013. This cross-section of births was then expanded into a panel dataset at the child-month level, with each child observed for up to 59 months.The outcome variable is binary (0/1 mortality).
I've applied the following adjustments (later using svy), but think I am not correctly weighting the data to account for multiple observations by the same child. I know this wasn't possible in Stata v13, but think it is possible in Stata v14. Any help would be greatly appreciated.
The guidance from the DHS is straightforward for cross-sections. My study cohort includes 52,656 children born between 2000-2013. This cross-section of births was then expanded into a panel dataset at the child-month level, with each child observed for up to 59 months.The outcome variable is binary (0/1 mortality).
I've applied the following adjustments (later using svy), but think I am not correctly weighting the data to account for multiple observations by the same child. I know this wasn't possible in Stata v13, but think it is possible in Stata v14. Any help would be greatly appreciated.
Code:
gen wt= v005/1000000 egen strata=group(v000 v025 ADM1_CODE) // strata also includes the survey (identified by v000) in the group command egen v001r = group(v000 v001) // cluster also includes the survey in the group command svyset v001r [pw=wt], strata(strata) singleunit(centered) DATA DICTIONARY
Variable Description v005 womens individual sample weight v000 country code and survey phase v025 urban/rural ADM1_CODE geographic province v001 survey cluster id child level unique id
Comment