I need to calculate RMSE a number of times, where I have saved data and various predictions. In checking the stats, I noticed my own calculations and stata's results (via cnsreg) were off. It seems Stata adjusts for the degrees of freedom in all RMSE calculations. I find this odd, as textbooks I've looked at do not adjust for degrees of freedom. (E.g. Wooldridge writes "This is essentially the sample standard deviation of the forecast errors (without any degrees of freedom adjustment)." Greene also implies no adjustment. Is this a well known issue? Is it normally such as small effect people don't mind? Are the textbooks out of date? I can find stata forum posts confidently asserting stata's approach is correct/normal, but nothing acknowledging the discrepancy between econometrics textbooks and stata's implementation.
My own code was running cnsreg with a constraint of 1 on the only RHS variable (the model prediction), and without a constant, as a quick way of calculating the RMSE. That approach means cnsreg actually adds one to the sample size, as there is one constraint.
Is there a way, other than coding the whole calculation, to opt for an unadjusted RMSE? And is this adjustment standard outside of my favourite textbooks?
Many thanks.
My own code was running cnsreg with a constraint of 1 on the only RHS variable (the model prediction), and without a constant, as a quick way of calculating the RMSE. That approach means cnsreg actually adds one to the sample size, as there is one constraint.
Is there a way, other than coding the whole calculation, to opt for an unadjusted RMSE? And is this adjustment standard outside of my favourite textbooks?
Many thanks.
Comment