Hello all,
I try to run a regression with Interest rate spread (imagine mortgage rate minus the treasure of comparable maturity) on a list of control variables. I have about 300,000 observations. The problem is that, for some legal reason, I only observe the rate spread if it is above certain threshold (3% in my case). This threshold is set so high that about 96% of my observations have censored rate spread (i.e., missing value). My question is, what is the best econometric method to choose? I tried tobit and heckman two stage. But I am concerned with the strong assumptions needed (particularly the normality assumption as interest rate spread is by no way normal). As it is already a rate measure, I don't think taking log is a good idea here.
Another general concern is the heavy censor rate. Can I trust tobit (or any method) at all if over 96% of my observations are censored below?
Thank you so much for the help.
Best
Hua
I try to run a regression with Interest rate spread (imagine mortgage rate minus the treasure of comparable maturity) on a list of control variables. I have about 300,000 observations. The problem is that, for some legal reason, I only observe the rate spread if it is above certain threshold (3% in my case). This threshold is set so high that about 96% of my observations have censored rate spread (i.e., missing value). My question is, what is the best econometric method to choose? I tried tobit and heckman two stage. But I am concerned with the strong assumptions needed (particularly the normality assumption as interest rate spread is by no way normal). As it is already a rate measure, I don't think taking log is a good idea here.
Another general concern is the heavy censor rate. Can I trust tobit (or any method) at all if over 96% of my observations are censored below?
Thank you so much for the help.
Best
Hua
Comment