Stepwise regression as a robustness test after an OLS?

Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#1

Stepwise regression as a robustness test after an OLS?

12 Jun 2021, 18:12

Would it be accepted to use the same variables, but switch from OLS to stepwise in Stata?
Or what other test would you recommend after having used an OLS?
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#2

13 Jun 2021, 02:21

Thomas:
after an OLS, you shoud usually check for:
- heteroskedasticity of the residuals distribution (see -estat hettest-);
- model misspecification (-estat ovtest-), that may well include endogeneity of one or more regressors;
- correctness of the functional form of the regressand (see -linktest-), that is a more general test for model misspecification;
- quasi-extreme multicollinearity (see -estat vif-). Please note that quasi-extreme multicolinearity is way oversold/overblown (the usual reference for this topic is https://www.hup.harvard.edu/catalog....=9780674175440, Chapter 23).
Eventually, stepwise mechanism proves the existence of devil (see why at https://www.stata.com/support/faqs/s...sion-problems/).

Kind regards,
Carlo
(Stata 19.0)
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#3

13 Jun 2021, 07:11

Dear Carlo,

But can these tests go into the "Robustness tests" in my paper? Or do they belong to the OLS itself?
I'm a bit confused. :/
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#4

13 Jun 2021, 07:47

Thomas:
the issue is that -stepwise- is not a robustness test, as you usually end up with a model specification that, at best, shows a tenuous relatioship with the original one,
In addition, whenever we talk about "robustness", we are required to define against with nuisances we want our OLS to be robust to (heteroskedasticity?else?).

Kind regards,
Carlo
(Stata 19.0)
1 like
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#5

13 Jun 2021, 14:31

And what to watch out for, when running "estat hettest" and "estat ovtest" for instance?
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#6

13 Jun 2021, 23:14

Thomas:
short answer: do those test reject the null? If yes, you might have something to inspect more carefully.
Long(er) answer: doubts on OLS building blocks are obviously legal, but I would recommend you to take a look at any decent textbook on statistics/econometrics textbook to be 100% aware of what postestimation routines are for.

Kind regards,
Carlo
(Stata 19.0)
1 like
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#7

14 Jun 2021, 05:16

Now I checked for "estat hettest" and this is what I found:

. estat hettest

Breusch-Pagan / Cook-Weisberg test for heteroskedasticity
Ho: Constant variance
Variables: fitted values of PercentageOfIntVolume

chi2(1) = 5.45
Prob > chi2 = 0.0196

----
As the chosen significance level is 5%, I imagine this is a problem, right?
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#8

14 Jun 2021, 05:20

Also did the "ovtest", but this is above the significance level, hence I imagine it means that there are no latent omitted variables in the model. So I can proceed with it.

. estat ovtest

Ramsey RESET test using powers of the fitted values of PercentageOfIntVolume
Ho: model has no omitted variables
F(3, 86) = 2.00
Prob > F = 0.1205
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#9

14 Jun 2021, 06:55

Thomas:
your postestimation tests highlight that you have heteroskedastcity in the residual dustribution; just invoke the -robust- option for your standard error.
After that, resist the temptation to run -hettest- again, because it will tell you that you still have heteroskedasticity (and you should ignore it), as the -robust- option affects standard errors to account for heteroskedasticity, but does not hqve any effect on the residual distribution (that remains as it was before invoking the -robust- option).
The -ovtest- is, unfortunately, less powerful that I hoped it to be when I was taught on statistics at the university (tons of years ago, my bad): it simply investigates whether power of the fitted values make any sense as regressors: if that is the case, there's something wrong with the functional form of the regressand; to9 have a more comprehensive idea, see -linktest- entry in Stata .pdf manual).

Kind regards,
Carlo
(Stata 19.0)
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#10

14 Jun 2021, 07:02

Thank you.

I have now (after the robust regression) run the "linktest" and all of the t-tests are insignificant. So I imagine now we're fine and happy, right?
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#11

14 Jun 2021, 07:12

Thomas:
yes, the -linktest- outcome shows no evidence of abnormalities in the functional form of the regressand.
Hence, you're only issue is heteroskedasticity, that you can manage with the -robust- option.

Kind regards,
Carlo
(Stata 19.0)
Comment
Thomas Lehnsherr

Join Date: Jun 2021

Posts: 17
#12

14 Jun 2021, 07:33

Thank you once again for patience in answering my questions.
Comment

Announcement

Stepwise regression as a robustness test after an OLS?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment