Im doing some Stata practice on 3 datasets. One data file has transactions with dates, barcodes, store code, etc. Another sheet has what each barcode means and what category and subcategories it is. A third has what each store code means and the attributes for each store. So the analysis is basic and along the lines of at category/sub category is sold the most/least. What is the highest selling categories every month.
So my thought was to load all files with seperate variables and set a for loop identifying all bar codes with the item categories and subcategories and assign a category/sub to each transaction and store attribute. But then I realized this strategy would involve 2 for loops searching through the 1k store list and 200k item list for each transaction in the 1M transaction list.
So my thought was to load all files with seperate variables and set a for loop identifying all bar codes with the item categories and subcategories and assign a category/sub to each transaction and store attribute. But then I realized this strategy would involve 2 for loops searching through the 1k store list and 200k item list for each transaction in the 1M transaction list.
Comment