Hi all, this is my labor force survey 2020, and the structure of LFS 2021 is similar. As the survey subjects are different in the two surveys, I want to create a pseudo-panel by using Deaton (1985) method (https://www.princeton.edu/~deaton/do...s_Sections.pdf), I based my code on that, but I'm not sure if it's accurate, and second, how do we handle the individual weights variable when converting from repeated cross-sectional data to pseudo-data? Someone can help me please!!! Many thanks
Here's my code to create pseudo (but do not done anything about weights:
Code:
clear input double(monthsv province district commune householdID memberID urban weight sex year_birth age wage survey_year) 4 1 1 1 1 3 1 134.71753154175403 1 1984 35 6000 2020 1 1 1 1 1 3 1 134.71753154175403 1 1984 35 8000 2020 1 1 1 1 1 4 1 134.71753154175403 1 1987 32 9000 2020 4 1 1 1 1 4 1 134.71753154175403 1 1987 32 7000 2020 4 1 1 1 1 5 1 130.90478469928155 2 1989 30 4850 2020 1 1 1 1 1 5 1 130.90478469928155 2 1989 30 6850 2020 1 1 1 1 5 1 1 134.71753154175403 1 1960 59 30000 2020 4 1 1 1 5 1 1 134.71753154175403 1 1960 59 20000 2020 4 1 1 1 5 2 1 223.32760790596396 2 1961 58 15000 2020 1 1 1 1 5 2 1 223.32760790596396 2 1961 58 133000 2020 1 1 1 1 5 3 1 134.71753154175403 1 1989 30 13850 2020 4 1 1 1 5 3 1 134.71753154175403 1 1989 30 13850 2020 4 1 1 1 5 4 1 134.71753154175403 1 1987 33 14850 2020 1 1 1 1 5 4 1 134.71753154175403 1 1987 32 14850 2020 1 1 1 1 5 5 1 130.90478469928155 2 1990 29 8850 2020 end
Code:
gen birth_year = survey_year - age gen birth_cohort = floor(birth_year/5)*5 collapse (count) n_obs=age, by(sex birth_cohort survey_year) egen cohort_id = group(sex birth_cohort) xtset cohort_id survey_year

Comment