Many thanks Tom, your paper does indeed very specifically address my issue, and will be very helpful in writing up the discussion about these estimation techniques in my article.
I have some further questions now, regarding your ppmlhdfe command.
For my project, I divide the world's countries into 2 groups (A and B).
I also divide trade value into two groups (interesting and not-interesting), so i have two observations for each country-pair and year.
The question I have is whether trade in the goods of interest are higher (i.e., higher shares) between A->A, A->B, B->A, or B->B.
For this purpose I create dummies for the interaction (as ppml did not allow factor variables), and drop dummies that I consider baselines (AA and 'not-interesting').
I then estimate:
If I do this in ppml (with time-varying country dummies, but excluding country-pair dummies), I get an estimate for all dummies.
If I use ppmlhdfe with the same variables, two dummies are dropped because of collinearity, and I cannot figure out why this would be. Do you have any insight?
I have some further questions now, regarding your ppmlhdfe command.
For my project, I divide the world's countries into 2 groups (A and B).
I also divide trade value into two groups (interesting and not-interesting), so i have two observations for each country-pair and year.
The question I have is whether trade in the goods of interest are higher (i.e., higher shares) between A->A, A->B, B->A, or B->B.
For this purpose I create dummies for the interaction (as ppml did not allow factor variables), and drop dummies that I consider baselines (AA and 'not-interesting').
I then estimate:
Code:
ppmlhdfe tradevalueusd AB BA BB interesting intAB intBA intBB, a(expf#year impf#year expf#impf) cluster(expf#impf) nolog
If I use ppmlhdfe with the same variables, two dummies are dropped because of collinearity, and I cannot figure out why this would be. Do you have any insight?
Code:
. ppmlhdfe tradevalueusd AB BA BB interesting intAB intBA intBB if year>2010, a(expf#year impf#year expf#impf) cluster(expf#impf) nolog (dropped 181656 observations that are either singletons or separated by a fixed effect) warning: dependent variable takes very low values after standardizing (2.1388e-10) note: 2 variables omitted because of collinearity: BA BB Converged in 20 iterations and 104 HDFE sub-iterations (tol = 1.0e-08) HDFE PPML regression No. of obs = 142,680 Absorbing 3 HDFE groups Residual df = 25,452 Statistics robust to heteroskedasticity Wald chi2(5) = 8356.41 Deviance = 8.93317e+11 Prob > chi2 = 0.0000 Log pseudolikelihood = -4.46659e+11 Pseudo R2 = 0.9975 Number of clusters (expf#impf)= 25,453 (Std. Err. adjusted for 25,453 clusters in expf#impf) ------------------------------------------------------------------------------ | Robust tradevalue~d | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- AB | -.0059766 .0827166 -0.07 0.942 -.1680982 .156145 BA | 0 (omitted) BB | 0 (omitted) interesting | -4.99851 .0725082 -68.94 0.000 -5.140623 -4.856396 intAB | -.2686544 .1358391 -1.98 0.048 -.5348942 -.0024146 intBA | .0959186 .1709414 0.56 0.575 -.2391204 .4309577 intBB | -.0540459 .2374066 -0.23 0.820 -.5193544 .4112625 _cons | 23.61517 .0165171 1429.74 0.000 23.5828 23.64754 ------------------------------------------------------------------------------ Absorbed degrees of freedom: -----------------------------------------------------+ Absorbed FE | Categories - Redundant = Num. Coefs | -------------+---------------------------------------| expf#year | 453 0 453 | impf#year | 693 3 690 | expf#impf | 25453 25453 0 *| -----------------------------------------------------+ * = FE nested within cluster; treated as redundant for DoF computation
Comment