Hello, I'm new to Stata. I have hunted around and can't seem to find a simple answer to what I'm sure is a common issue. The problem is duplicate rows. Lets say I want to calculate the mean of a dataset and my data is as follows:
I'm just using this as an example to illustrate. I want to calculate the mean employees for each manager, which is [30+40+60]/3. But I can't figure out a way for Stata to understand that Bob is unique, Mary is unique, Jonas is unique, and not six different managers.
Any help on which way I should be looking would be much appreciated.
Var_Manager | Var_total employees | Var_region served |
Bob | 30 | North |
Bob | 30 | East |
Mary | 40 | South |
Jonas | 60 | North |
Jonas | 60 | South |
Jonas | 60 | West |
Any help on which way I should be looking would be much appreciated.
Comment