I have about 600 csv files. I would like to drop some variables & observations in those files, then convert them into Stata format, and eventually combine (append) them into one big file (hopefully the size is not too big. Maybe around 5 or 6GB).
Specifically, I run the following codes for the first file and would like to repeat such procedures for all the other files:
import delimited "E:\Research\0001.csv", encoding(UTF-8)
drop V1 V2 V3
keep if country=="United States"
save "E:\Research\0001_n.dta"
After converting them into Stata files, I would like to combine(append) them into one single file. Does anyone know how to do it? Any help or guidance will be greatly appreciated!! Thank you in advance.
BTW, when I tried to import this csv file into Stata, it shows
"Note: Unmatched quote while processing row 605334; this can be due to a formatting problem in the file or because a quoted data element spans multiple lines. You should carefully inspect your data after importing. Consider using option bindquote(strict) if quoted data spans multiple lines or option bindquote(nobind) if quotes are not used for binding data".
But when I looked at the data, it looks okay. Should I do anything about it?
Specifically, I run the following codes for the first file and would like to repeat such procedures for all the other files:
import delimited "E:\Research\0001.csv", encoding(UTF-8)
drop V1 V2 V3
keep if country=="United States"
save "E:\Research\0001_n.dta"
After converting them into Stata files, I would like to combine(append) them into one single file. Does anyone know how to do it? Any help or guidance will be greatly appreciated!! Thank you in advance.
BTW, when I tried to import this csv file into Stata, it shows
"Note: Unmatched quote while processing row 605334; this can be due to a formatting problem in the file or because a quoted data element spans multiple lines. You should carefully inspect your data after importing. Consider using option bindquote(strict) if quoted data spans multiple lines or option bindquote(nobind) if quotes are not used for binding data".
But when I looked at the data, it looks okay. Should I do anything about it?
Comment