Dear Stata-users,
As a researcher I came across a method used in some empirical studies where they use a two-step regression, where:
FIRST: y is regressed on a set of variables x (establishing a "normal" level - and the residual is the "abnormal" level)
SECOND: the residual is then regressed on a second set of variables z to test whether some of these variables can explain the "abnormal" level.
Now, my question is: isn't this residual a constructed variable - thus, making inferences invalid? You kind of give the first set of variables x the right to explain y, then less is left for the variables of interest z...
Can anyone help clarify why someone would choose to adopt this two-step regression?
Thanks in advance,
Ines
As a researcher I came across a method used in some empirical studies where they use a two-step regression, where:
FIRST: y is regressed on a set of variables x (establishing a "normal" level - and the residual is the "abnormal" level)
SECOND: the residual is then regressed on a second set of variables z to test whether some of these variables can explain the "abnormal" level.
Now, my question is: isn't this residual a constructed variable - thus, making inferences invalid? You kind of give the first set of variables x the right to explain y, then less is left for the variables of interest z...
Can anyone help clarify why someone would choose to adopt this two-step regression?
Thanks in advance,
Ines
Comment