Hi all-- I am trying to add geocoded variables to a large dataset and have encountered a strange problem. When I export my data from stata to csv and then re-import it into stata, the summary statistics for the variables of interest are the same but the exact values are different. To be more specific, I exported a file with no duplicate observations but then when I reimport the file back into stata it reports duplicate observations. Moreover, when I go to merge the new (imported from csv) file into the old (stata) file, there are many failed matches. Has anyone else had this problem? I have reduced the dataset to three variables-- 2 of type "float" and one that is a string variable. There are close to 800,000 observations so it's difficult for me to manually figure out what has been dropped and what has been duplicated. Any advice would be appreciated. Thanks!
-
Login or Register
- Log in with

Comment