I am analyzing a large-ish data set (around 500,000 observations) .
I am looking to tabulate between two variables, Zip Code and Assigned Yard, with Zip Code being the independent variable.
I want to know the frequency of each assigned yard per zip code (each zip code will only go to 1-3 yards). As expected tabulate Zip Yard says "too many values". I tried to make a much smaller version of hte data set, and tabulate Zip Yard worked fine. So then using the entire data set I'm looking to do a few zips at a time. For example, tabulate Zip Yard if Zip= 1001 or tabulate Zip Yard if Zip <= 1001. I'm getting type mismatch.
I know what "type mismatch" means , but don't see how my data meets that criteria. Furthermore, if the data is th ewrong type, how would it have tabulated in the smaller data set?
Thanks in advance for any help.
I am looking to tabulate between two variables, Zip Code and Assigned Yard, with Zip Code being the independent variable.
I want to know the frequency of each assigned yard per zip code (each zip code will only go to 1-3 yards). As expected tabulate Zip Yard says "too many values". I tried to make a much smaller version of hte data set, and tabulate Zip Yard worked fine. So then using the entire data set I'm looking to do a few zips at a time. For example, tabulate Zip Yard if Zip= 1001 or tabulate Zip Yard if Zip <= 1001. I'm getting type mismatch.
I know what "type mismatch" means , but don't see how my data meets that criteria. Furthermore, if the data is th ewrong type, how would it have tabulated in the smaller data set?
Thanks in advance for any help.
Comment