I'm looking into satisfaction with GP practices in England and I’m using NHS payments to practices as one of the independent variables in the regression model. I have two questions regarding transformation and interpretation of results.
Question 1:
The payments in the dataset are a) positively skewed (sktest 2.81) and b) have a range of -23,000 to over 1,000,000, including some who received £0.
From what I’ve read before, it’s standard to transform variables such as income before running a regression if they are skewed, but I also read that I cannot log this variable because it contains negative and zero values. What kind of transformation should I undertake to correct for skewness, a quadric or cubic transformation maybe?
Question 2:
The dependent variable is the practice satisfaction score on a scale of 1-5. The variable is negatively skewed as it seems clustering of satisfaction scores at the higher end is common for these kinds of surveys. In order to make the distribution more normal I first reflected and then logged the variable.
The independent variable for payments is expressed in £ and pence (e.g. £25,022.60). If I don’t transform it, I would just like to check what my interpretation of the results will be – a one unit change in the IV is associated with a 100 times the coefficient percent change in the DV. Am I correct in saying that a one unit here is £0.01?
Question 1:
The payments in the dataset are a) positively skewed (sktest 2.81) and b) have a range of -23,000 to over 1,000,000, including some who received £0.
From what I’ve read before, it’s standard to transform variables such as income before running a regression if they are skewed, but I also read that I cannot log this variable because it contains negative and zero values. What kind of transformation should I undertake to correct for skewness, a quadric or cubic transformation maybe?
Question 2:
The dependent variable is the practice satisfaction score on a scale of 1-5. The variable is negatively skewed as it seems clustering of satisfaction scores at the higher end is common for these kinds of surveys. In order to make the distribution more normal I first reflected and then logged the variable.
The independent variable for payments is expressed in £ and pence (e.g. £25,022.60). If I don’t transform it, I would just like to check what my interpretation of the results will be – a one unit change in the IV is associated with a 100 times the coefficient percent change in the DV. Am I correct in saying that a one unit here is £0.01?
Comment