Dear all,
I have a panel dataset of firms in which my dependent variable is a ratio that can take positive and nagative values. I am interested, however, not in the sign of the ratio but on its magnitude and I decided to use the absolute value of the ratio as dependent variable.
This leads the dependent variable to be, of course, positively skewed. I have checked for the normality of residulas after I run the xtreg regression (in which I cluster st.errors by firms), and the residuals are fairly normally distributed graphically, and also the skeweness and kurtoris seem to be not that bad (Skeweness 0.784; Kurtosis 4.21).
I know that, for OLS assumptions to be not violated, what matters is that the residuals are normally distributed and not the dependent variable, however some of my instructors (I am a phd student) raised some concern on the DV in absolute terms and suggested to use a log trasformation of the DV.
When I take the log transformation, however, some absolute values turn to be negative when the ratio is <1. Moreover, when i run again the regression with the log DV, the residuals become skeewed, so I guess I should not transform the DV in log and stick to the absolute values.
Could you tell me if my intuition is right or if, instead, having a absolute DV brings other problems that I am not considering?
Thanks a lot in advance.
I have a panel dataset of firms in which my dependent variable is a ratio that can take positive and nagative values. I am interested, however, not in the sign of the ratio but on its magnitude and I decided to use the absolute value of the ratio as dependent variable.
This leads the dependent variable to be, of course, positively skewed. I have checked for the normality of residulas after I run the xtreg regression (in which I cluster st.errors by firms), and the residuals are fairly normally distributed graphically, and also the skeweness and kurtoris seem to be not that bad (Skeweness 0.784; Kurtosis 4.21).
I know that, for OLS assumptions to be not violated, what matters is that the residuals are normally distributed and not the dependent variable, however some of my instructors (I am a phd student) raised some concern on the DV in absolute terms and suggested to use a log trasformation of the DV.
When I take the log transformation, however, some absolute values turn to be negative when the ratio is <1. Moreover, when i run again the regression with the log DV, the residuals become skeewed, so I guess I should not transform the DV in log and stick to the absolute values.
Could you tell me if my intuition is right or if, instead, having a absolute DV brings other problems that I am not considering?
Thanks a lot in advance.
Comment