Greetings Statalist. My question is as follows: I am working on a very large public dataset with a colleague, and for a variety of reasons we are using Dropbox as our shared folder for code and data storage.
However, the full dta file is so large that it uses nearly all available Dropbox storage. I am trying to figure out the best way to rectify this. One idea I had was to go through the raw data and create a second much smaller dta file with only the variables I know we want. Then zip the raw file so it uses less space (or delete altogether, it's fairly simple to re-download.)
Another idea was to try to zip the file and then call directly from the file in the zipped folder? Is this possible?
I am very open to other solutions, as well.
However, the full dta file is so large that it uses nearly all available Dropbox storage. I am trying to figure out the best way to rectify this. One idea I had was to go through the raw data and create a second much smaller dta file with only the variables I know we want. Then zip the raw file so it uses less space (or delete altogether, it's fairly simple to re-download.)
Another idea was to try to zip the file and then call directly from the file in the zipped folder? Is this possible?
I am very open to other solutions, as well.
Comment