Dear all,
I have two questions that I need to understand, and I would really appreciate any help that I can get.
Firstly, I would like to understand why is it that -reghdfe- the 4.x version gives smaller standard erors, hence smaller p-values, than the -reghdfe- 3.x version? And which one would be the better to use?
Secondly, I have a control variable in my estimation, which is the lagged log of Income. I also created a categorical variable out of Income, and include it as an interaction with another (continous) variable. My question is, when Income is included as a categorical variable, should I still keep the lagged log of Income too, or exclude it from the regression?
My basic specification looks something like this:
now:
Would this make sense? I would instinctly exclude the logIncome_(t-1) from the regression, though I have some doubts, as I do not have much experience with empirical analysis.
I apologize if the question is a rather basic one, but would really appreciate any guidance!
Thank you in advance for your time.
I have two questions that I need to understand, and I would really appreciate any help that I can get.
Firstly, I would like to understand why is it that -reghdfe- the 4.x version gives smaller standard erors, hence smaller p-values, than the -reghdfe- 3.x version? And which one would be the better to use?
Secondly, I have a control variable in my estimation, which is the lagged log of Income. I also created a categorical variable out of Income, and include it as an interaction with another (continous) variable. My question is, when Income is included as a categorical variable, should I still keep the lagged log of Income too, or exclude it from the regression?
My basic specification looks something like this:
Code:
xtreg y x1 x2 x3 logIncome_(t-1), fe cluster(id)
Code:
xtreg y x1 c.x2##i.Incomecat x3 logIncome_(t-1), fe cluster(id)
I apologize if the question is a rather basic one, but would really appreciate any guidance!
Thank you in advance for your time.

Comment