Hi People,
I am working on a data set, which is really huge. It is an administrative data set and I want to Combine them, one is about firms maybe 500 000 observations over 1,4 gb size, the other one is about
individual employee data, much bigger with nearly 10 Million observations about 4 gb in size. When I want to merge both , my Memory is too low, the complete stata programe breaks up. So do you have any tips for me for working with Panel data? I want to calculate such things like churning rate or gross flow turnover rate. Does it maybe makes sense only to Keep real Panel data, because then my Observation number would reduce immense?
I am working on a data set, which is really huge. It is an administrative data set and I want to Combine them, one is about firms maybe 500 000 observations over 1,4 gb size, the other one is about
individual employee data, much bigger with nearly 10 Million observations about 4 gb in size. When I want to merge both , my Memory is too low, the complete stata programe breaks up. So do you have any tips for me for working with Panel data? I want to calculate such things like churning rate or gross flow turnover rate. Does it maybe makes sense only to Keep real Panel data, because then my Observation number would reduce immense?
Comment