I am currently using the National Child Development Survey (1958-) to analyse the impact of height, weight and marriage on wages. There are 6 waves of data which are relevant to my study and need to be merged (eventually!). Firstly I imagine I need to go through each wave and only keep the variables relevant to my analysis using the keep command, my first question is how do I approach this? Should I sequentially open each wave in stata, keep the relevant variables then save each new dataset to make merging at a later stage easier?
As a side note some of the variables in certain waves need to be edited as the questions asked across the panel were not uniform. For example I will have to convert the weights in each wave to the same unit of measurement. Should I do this in the initial editing stage of each wave or wait until I have all my data together and then start to make the variables match.
I apologise if I have explained my problem poorly, any help would be greatly appreciated.
Thank you.
As a side note some of the variables in certain waves need to be edited as the questions asked across the panel were not uniform. For example I will have to convert the weights in each wave to the same unit of measurement. Should I do this in the initial editing stage of each wave or wait until I have all my data together and then start to make the variables match.
I apologise if I have explained my problem poorly, any help would be greatly appreciated.
Thank you.
Comment