Dear All,
the following thread discussing the same issue points to a useful command -chunky- that can be used to cut a large CSV file into smaller pieces, which may be processed separately.
This is helpful, but I wonder if there is any possibility to tell it to maintain some groups of observations together? (even at the expense of having the chunks' size more volatile). In other words, I am looking for a by(hhid) or similar option that could say that (in this case) I wish the observations (persons) belonging to the same household to stay together in the same chunk. The source data is grouped, but not sorted (meaning you can have A,A,C,C,C,C,D,D,B,B,B,B,B, but not A,C,A,B....).
I estimate that the size of the group (e.g. 1MB of data) is negligible compared to the size of the chunk (e.g. 1GB of data).
Is there an option I am overlooking in -chunky- or is there any other more recent and usable command for this purpose?
PS: I understand I can re-import each chunk, combine them together and save by groups, but I'd rather look for a more ready-made solution.
Thank you, Sergiy
the following thread discussing the same issue points to a useful command -chunky- that can be used to cut a large CSV file into smaller pieces, which may be processed separately.
This is helpful, but I wonder if there is any possibility to tell it to maintain some groups of observations together? (even at the expense of having the chunks' size more volatile). In other words, I am looking for a by(hhid) or similar option that could say that (in this case) I wish the observations (persons) belonging to the same household to stay together in the same chunk. The source data is grouped, but not sorted (meaning you can have A,A,C,C,C,C,D,D,B,B,B,B,B, but not A,C,A,B....).
I estimate that the size of the group (e.g. 1MB of data) is negligible compared to the size of the chunk (e.g. 1GB of data).
Is there an option I am overlooking in -chunky- or is there any other more recent and usable command for this purpose?
PS: I understand I can re-import each chunk, combine them together and save by groups, but I'd rather look for a more ready-made solution.
Thank you, Sergiy
Comment