I am running a fixed effects regression with two time periods and 3,492 units. When I run a linear regression using reghdfe, I have no issues and no observations are dropped. When I run a Poisson regression with ppmlhdfe, Stata drops "6632 observations that are either singletons or separated by a fixed effect." My question is: why would Stata drop singletons for a Poisson fixed effects regression but not a linear fixed effects regression?
Also, it seems that most of the dropped observations in the Poisson regression are the units that experience no change in the dependent variable from period 1 to period 2. Why would Stata drop these observations? It seems to me that they still provide useful information.
Here is the code and output for reghdfe, which does not drop observations:
Here is the code and output for ppmlhdfe, which does drop observations:
Also, it seems that most of the dropped observations in the Poisson regression are the units that experience no change in the dependent variable from period 1 to period 2. Why would Stata drop these observations? It seems to me that they still provide useful information.
Here is the code and output for reghdfe, which does not drop observations:
Code:
reghdfe event treat_any, absorb(cell_id) cluster(cell_id) (MWFE estimator converged in 1 iterations) HDFE Linear regression Number of obs = 6,984 Absorbing 1 HDFE group F( 1, 3491) = 12.42 Statistics robust to heteroskedasticity Prob > F = 0.0004 R-squared = 0.6370 Adj R-squared = 0.2738 Within R-sq. = 0.0070 Number of clusters (cell_id) = 3,492 Root MSE = 0.3213 (Std. Err. adjusted for 3,492 clusters in cell_id) ------------------------------------------------------------------------------ | Robust event | Coef. Std. Err. t P>|t| [95% Conf. Interval] -------------+---------------------------------------------------------------- treat_any | -.0765679 .0217247 -3.52 0.000 -.1191623 -.0339735 _cons | .0614419 .0033764 18.20 0.000 .054822 .0680619 ------------------------------------------------------------------------------ Absorbed degrees of freedom: -----------------------------------------------------+ Absorbed FE | Categories - Redundant = Num. Coefs | -------------+---------------------------------------| cell_id | 3492 3492 0 *| -----------------------------------------------------+ * = FE nested within cluster; treated as redundant for DoF computation
Code:
ppmlhdfe event treat_any, absorb(cell_id) cluster(cell_id) (dropped 6632 observations that are either singletons or separated by a fixed effect) Iteration 1: deviance = 3.3482e+02 eps = . iters = 1 tol = 1.0e-04 min(eta) = -1.40 P Iteration 2: deviance = 3.1968e+02 eps = 4.74e-02 iters = 1 tol = 1.0e-04 min(eta) = -1.54 Iteration 3: deviance = 3.1938e+02 eps = 9.43e-04 iters = 1 tol = 1.0e-04 min(eta) = -1.56 Iteration 4: deviance = 3.1938e+02 eps = 9.97e-07 iters = 1 tol = 1.0e-04 min(eta) = -1.56 Iteration 5: deviance = 3.1938e+02 eps = 3.57e-12 iters = 1 tol = 1.0e-05 min(eta) = -1.56 S O ------------------------------------------------------------------------------------------------------------ (legend: p: exact partial-out s: exact solver h: step-halving o: epsilon below tolerance) Converged in 5 iterations and 5 HDFE sub-iterations (tol = 1.0e-08) HDFE PPML regression No. of obs = 352 Absorbing 1 HDFE group Residual df = 175 Statistics robust to heteroskedasticity Wald chi2(1) = 15.97 Deviance = 319.3822431 Prob > chi2 = 0.0001 Log pseudolikelihood = -392.1367457 Pseudo R2 = 0.2276 Number of clusters (cell_id)= 176 (Std. Err. adjusted for 176 clusters in cell_id) ------------------------------------------------------------------------------ | Robust event | Coef. Std. Err. z P>|z| [95% Conf. Interval] -------------+---------------------------------------------------------------- treat_any | -.894175 .2237562 -4.00 0.000 -1.332729 -.455621 _cons | .4578038 .0352768 12.98 0.000 .3886626 .5269451 ------------------------------------------------------------------------------ Absorbed degrees of freedom: -----------------------------------------------------+ Absorbed FE | Categories - Redundant = Num. Coefs | -------------+---------------------------------------| cell_id | 176 176 0 *| -----------------------------------------------------+ * = FE nested within cluster; treated as redundant for DoF computation
Comment