Hello,
This is really just a statistics question. I have a panel dataset over 20 years of 50 variables for every country in the world. We want to get a simple estimate of whether the linear trend over time is different from 0 so we are regressing the indicator on time with country fixed effects. But right now the indicators are all in different units. We want to standardize the outcome variables so my question is given that the application is regressing the result on time, when we calculate the standardized values (z-scores or min-max), should we be pooling the data for every indicator over all years or calculating for each year separately? And following that, is there any argument for why z-scores or min-max would be preferred or easier to interpret?
Thanks for your help!
-Kate
This is really just a statistics question. I have a panel dataset over 20 years of 50 variables for every country in the world. We want to get a simple estimate of whether the linear trend over time is different from 0 so we are regressing the indicator on time with country fixed effects. But right now the indicators are all in different units. We want to standardize the outcome variables so my question is given that the application is regressing the result on time, when we calculate the standardized values (z-scores or min-max), should we be pooling the data for every indicator over all years or calculating for each year separately? And following that, is there any argument for why z-scores or min-max would be preferred or easier to interpret?
Thanks for your help!
-Kate
Comment