ib. operator won't omit correct category

Igze Pmuts

Join Date: Mar 2025

Posts: 1
#1

ib. operator won't omit correct category

22 Mar 2025, 09:14

I am trying to run a difference-in-difference regression and see the coefficient relative to the control group before time of treatment. This DID has three categories: RGGI, Leaker, and Control.

> reghdfe log_netgen b1.category_num#i.after_RGGI, absorb(plantstate obsyear) vce(cluster plantstate) where category_num represents a type of state (RGGI, Leaker, or Control) and after_RGGI is a dummy variable where 1 means the date in the data is after 2009.

My aim is to see the coefficients for Leaker#1 and RGGI#1, so I specified b1 as the base category, as 1 = Control for my category_num variable. Stata gives the following output:

HDFE Linear regression Number of obs = 183,543
Absorbing 2 HDFE groups F( 2, 50) = 13.59
Statistics robust to heteroskedasticity Prob > F = 0.0000
R-squared = 0.0364
Adj R-squared = 0.0360
Within R-sq. = 0.0011
Number of clusters (plantstate_num) = 51 Root MSE = 3.1621

(Std. err. adjusted for 51 clusters in plantstate_num)
---------------------------------------------------------------------------------------------
| Robust
log_netgen | Coefficient std. err. t P>|t| [95% conf. interval]
----------------------------+----------------------------------------------------------------
category_numeric#after_RGGI |
Control#1 | .6075423 .1186053 5.12 0.000 .3693165 .8457681
Leaker#0 | -.697489 .2695983 -2.59 0.013 -1.238993 -.1559848
Leaker#1 | 0 (omitted)
RGGI#0 | 0 (omitted)
RGGI#1 | 0 (omitted)
|
_cons | 9.601487 .0604883 158.73 0.000 9.479993 9.722982
---------------------------------------------------------------------------------------------

Why is the regression omitting Leaker#1, RGGI#0, and RGGI#1 instead of omitting Control and 0 (since i.after_RGGI would usually make 0 the base category)

Basically, how can I make my regression output give me RGGI#1 and Leaker #1, omitting all other combinations? Thank you!
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17712
#2

22 Mar 2025, 09:22

Igze:
welcome to this forum.
Some comments about your post:
1) as per FAQ, you're supposed to use the last available Stata release, which contains -didregress- and -xtdidregress- commands for DID;
2) withouth and excerpt/example of your dataset (see -dataex- as per FAQ again), it is difficult (for me, at least) to reply positively;
3) as per FAQ again, please use CODE delimiters to share what you typed and what Stata gave you back. Thanks.

Kind regards,
Carlo
(Stata 19.0)
Comment
Hemanshu Kumar

Join Date: Mar 2015

Posts: 1403
#3

22 Mar 2025, 09:54

A couple quick things:
Control#0 is in fact omitted, as you can see from your table.

It is likely that other interactions are being omitted due to multicollinearity.

You should probably control for the levels, not just the interactions: use ##, not # in your reghdfe command. Please show us the results of this regression, using CODE delimiters as suggested in #2.
1 like
Comment

Announcement

ib. operator won't omit correct category

Comment

Comment