I tried using forward stepwise regression regressing the variable Conservative7 on two macro-lists of variables, xissues and xidentity, like this:
stepwise, forward pr(.0000000105) pe(.00000001): regress conservative7 `xissues' `xidentity' [pweight=weights];
This is mixed forward and backwards regression. It proceeded normally. My problem is that it doesn't follow what the online description says it does. The procedure is supposed to start by finding the x-variable that has the highest t-statistic (or correlation, or R2--all equivalent) when Conservative7 is regressed on it. It doesn't. It picks a variable that is worse than at least two others. I realized this when I looked at the simple bivariate correlations, and verified the problem by using the regress command to run bivariate (with constant) regressions. I haven't checked manually to see what stepwise does at later stages.
Since people don't do stepwise to stop after finding just one variable, this may be an overlooked bug.
Anyone have any idea what might be happening?
stepwise, forward pr(.0000000105) pe(.00000001): regress conservative7 `xissues' `xidentity' [pweight=weights];
This is mixed forward and backwards regression. It proceeded normally. My problem is that it doesn't follow what the online description says it does. The procedure is supposed to start by finding the x-variable that has the highest t-statistic (or correlation, or R2--all equivalent) when Conservative7 is regressed on it. It doesn't. It picks a variable that is worse than at least two others. I realized this when I looked at the simple bivariate correlations, and verified the problem by using the regress command to run bivariate (with constant) regressions. I haven't checked manually to see what stepwise does at later stages.
Since people don't do stepwise to stop after finding just one variable, this may be an overlooked bug.
Anyone have any idea what might be happening?
Comment