I was wondering how to best sort data to locate strange or missing values. The data file has over 4 million observations, I cannot "tab" results for a lot of the variables because the variables take on too many values. One way that I've been approaching this is by using "gsort variable_name, mfirst" and then browsing the sorted data manually to see if there are any odd values. Is there a more efficient way to approach this?
Thanks!
Andrew
Thanks!
Andrew

Comment