Hi,
Just a very quick question that hopefully someone will be able to answer. I am estimating a log-linear equation: lnY=b0+b1*lnX1+b2*lnX2+ ... + e
One of my X variables (a very important one) takes negative values in many observations. Its range is around -20 to +20. Obviously, I can't take logs of the zero or negative values; one solution I've used before when dealing with variables with occasional zero values is the inverse hyperbolic sine transformation: ln(x+sqrt(x^2+1)) which is approximately equal to ln(2*x), however, given that I'm working with a lot of negative values, I don't think this is suitable.
What I've done instead is to rescale the variable before taking its log: ln(20+x). Are there any problems with doing this? In terms of the interpretation, this variable is being used as a proxy.
Thanks,
Alex Stead
Just a very quick question that hopefully someone will be able to answer. I am estimating a log-linear equation: lnY=b0+b1*lnX1+b2*lnX2+ ... + e
One of my X variables (a very important one) takes negative values in many observations. Its range is around -20 to +20. Obviously, I can't take logs of the zero or negative values; one solution I've used before when dealing with variables with occasional zero values is the inverse hyperbolic sine transformation: ln(x+sqrt(x^2+1)) which is approximately equal to ln(2*x), however, given that I'm working with a lot of negative values, I don't think this is suitable.
What I've done instead is to rescale the variable before taking its log: ln(20+x). Are there any problems with doing this? In terms of the interpretation, this variable is being used as a proxy.
Thanks,
Alex Stead
Comment