I don't understand the new data organization. What on earth is that variable Year doing in there? In the example data, it isn't even a variable--it's just a constant 1950. But assuming in the full data set it does vary, it seems to have no connection whatsoever to any of the other variables. The other variables are chronologically related only to the Date and Year StartRole and EndRole variables. The variable Year should just be dropped from the data set: it is completely out of place.
In addition, I do not understand clearly what you want to do with the percentage variables. Do you want a total for each manager separately in each company and year? Or do you want a grand total for all the female managers, and another grand total for all the male mangers in each company in year?
In addition, I do not understand clearly what you want to do with the percentage variables. Do you want a total for each manager separately in each company and year? Or do you want a grand total for all the female managers, and another grand total for all the male mangers in each company in year?
Comment