Dear Statalist,
I am using Stata SE Version 12, and researching trade flows. I ran an unclustered xtivreg2 regression with two dependent variables, with about 840,000 observations across 11 years. However, as part of my PhD viva I was advised to cluster the standard errors by importer-exporter pair. Doing that yielded the same coefficients with larger standard errors, as expected .However, what confused me was the significant change in each of the first stage test statistics (Angrist-Pischke F test, Angrist-Pischke χ², Kleibergen-Paap Wald rk F statistic, Kleibergen-Paap rk LM statistic). For example, for one of the variables, the Angrist-Pischke F test went from 3058.37 before clustering to 9.67 after.
Here are the two regressions run.
xtivreg2 y1 (endog1 endog2 = IV1 IV2) x1 x2 x3 x4 x5 x6 x7 x8 yeardum *, fe robust
xtivreg2 y1 (endog1 endog2 = IV1 IV2) x1 x2 x3 x4 x5 x6 x7 x8 yeardum*, fe cluster(pair_num) robust
Do the calcuations for the various tests use the standard errors? Or is there something I’m overlooking about using xtivreg2 with cluster?
I would be grateful for any illumination or insight anyone could provide.
Thank you
I am using Stata SE Version 12, and researching trade flows. I ran an unclustered xtivreg2 regression with two dependent variables, with about 840,000 observations across 11 years. However, as part of my PhD viva I was advised to cluster the standard errors by importer-exporter pair. Doing that yielded the same coefficients with larger standard errors, as expected .However, what confused me was the significant change in each of the first stage test statistics (Angrist-Pischke F test, Angrist-Pischke χ², Kleibergen-Paap Wald rk F statistic, Kleibergen-Paap rk LM statistic). For example, for one of the variables, the Angrist-Pischke F test went from 3058.37 before clustering to 9.67 after.
Here are the two regressions run.
xtivreg2 y1 (endog1 endog2 = IV1 IV2) x1 x2 x3 x4 x5 x6 x7 x8 yeardum *, fe robust
xtivreg2 y1 (endog1 endog2 = IV1 IV2) x1 x2 x3 x4 x5 x6 x7 x8 yeardum*, fe cluster(pair_num) robust
Do the calcuations for the various tests use the standard errors? Or is there something I’m overlooking about using xtivreg2 with cluster?
I would be grateful for any illumination or insight anyone could provide.
Thank you
Comment