Restrict parameters with two-way fixed effects

Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#1

Restrict parameters with two-way fixed effects

22 Jan 2023, 16:20

Hi all,

I'm trying to estimate a regression imposing the restriction that two parameters sum to one, and include two-way fixed effects.

GMM would be perfect for that, but I'm not sure how to include both fixed effects in it.

Notice that I have too many fixed effects to include them as dummies.

Do you have suggestions?

Thanks!

Last edited by Luca Gagliardone; 22 Jan 2023, 16:24.
Tags: gmm, panel, TWFE
Andrew Musau

Join Date: Oct 2014

Posts: 10195
#2

23 Jan 2023, 10:06

How many levels are there for each fixed effect?
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#3

23 Jan 2023, 10:37

I have a very large number of levels, cannot be handled with dummies.

Baltagi 2001 (demeaning) or Woodford 2021 (Mundlak) should work, but the standard errors are all over the place.

Ideally I would like to write a program to reproduce the output of ivreghdfe with two absorbing dimensions, without having to rely on dummies.
Comment
Andrew Musau

Join Date: Oct 2014

Posts: 10195
#4

23 Jan 2023, 13:10

#2 asks for numbers.

a very large number

is ambiguous.
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#5

23 Jan 2023, 13:44

Can you clarify why are you asking this?

Let's say I have 120001 levels.
Comment
Andrew Musau

Join Date: Oct 2014

Posts: 10195
#6

23 Jan 2023, 13:54

This is not a trick question and I am trying to be practical here. You state that you have 2 fixed effects. If this is panel data with an \(N\) and \(T\) dimension, you can be specific with \(N=xx\) and \(T=xx\).
1 like
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#7

25 Jan 2023, 15:31

N = 5058 and T = 20891
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2168
#8

25 Jan 2023, 16:46

I'll assume you have explanatory variables x1 and x2 and you want their coefficients to sum to unit. To impose this, plug in the restriction: b2 = 1 - b1. So

y = b1*x1 + b2*x2 + ...

y = b1*x1 + (1 - b1)*x2 + ...

or

y - x2 = b1*(x1 - x2) + ...

So apply any econometric you want after defining a new dependent variable, y - x2, and a new explanatory variable, x1 - x2. For example:

Code:

gen y_x2 = y - x2 gen x1_x2 = x1 - x2 reghdfe y_x2 x1_x2, absorb(id time)

Of course, include any other explanatory variables, or use IV if that's what you want to do.
1 like
Comment
Joro Kolev

Join Date: Aug 2018

Posts: 3050
#9

25 Jan 2023, 23:22

What Prof. Wooldridge explains -- manually imposing the constraint -- is the best policy here.

However just out of curiosity, what did the comment in #3 about "standard errors all over the place" mean?

If the panel is balanced, another way to go would be to double demean all variables, and then impose the constraint through [R] cnsreg -- Constrained linear regression.
1 like
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#10

26 Jan 2023, 13:50

Originally posted by Jeff Wooldridge View Post

I'll assume you have explanatory variables x1 and x2 and you want their coefficients to sum to unit.

Thanks for the reply! Unfortunately the trick does not work here, as the restriction is more complicated.

I've been working around the problem using reghdfe with just the FE to demean all the variables separately. Then used GMM to impose the restrictions directly on the Baltagi regression, and block bootstrap for the SE.

With the correct sample for all the steps, this replicates reghdfe in simulated data.

Let me please know if you spot something here that does not work conceptually.

Originally posted by Joro Kolev View Post

However just out of curiosity, what did the comment in #3 about "standard errors all over the place" mean?

I meant that I could not replicate the SE from reghdfe using the Baltagi or Mundlak regressions. But this is of course a know issue.
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2168
#11

27 Jan 2023, 08:22

I'm now confused. What do you mean "the restriction is more complicated"? Is it a nonlinear restriction, or involves many parameters? If it's linear you can always solve out for one parameter as a function of the others.
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#12

27 Jan 2023, 09:16

Originally posted by Jeff Wooldridge View Post

I'm now confused. What do you mean "the restriction is more complicated"? Is it a nonlinear restriction, or involves many parameters? If it's linear you can always solve out for one parameter as a function of the others.

It's nonlinear and involves many parameters. The regression is

y = (1-a_0)*(1-a_1)*x_1 + (1-a_0)*a_1*x_2 + a_0*l1.y

where a_0 and a_1 are parameters to be estimated, x_1 and x_2 are regressors.

Sorry for the confusion, I was looking for a general approach to tackle flexibly this type of issues. Should have clarified that to begin with.

Originally posted by Joro Kolev View Post

If the panel is balanced, another way to go would be to double demean all variables, and then impose the constraint through [R] cnsreg -- Constrained linear regression.

Double demeaning and then running the GMM with the demeaned variables should work as well, if I'm not mistaken. Even in the case of unbalanced panel, provided that I'm selecting the correct sample in all the steps.

Last edited by Luca Gagliardone; 27 Jan 2023, 09:35.
Comment
Joro Kolev

Join Date: Aug 2018

Posts: 3050
#13

27 Jan 2023, 10:34

The double demeaning in balanced panel data is a trivial scalar operation explained in Baltagi's book on panel data, which is given in eq(2.2) in the reference I am going to give momentarily. The double demeaning in the unbalanced case is a mess, the formula is known but the operation is matrix based, and when once upon the time I tried to program this in Stata my head started spinning and I aborted the project. The reference for the double demeaning in the unbalanced case is
Wansbeek, T., & Kapteyn, A. (1989). Estimation of the error-components model with incomplete panels. Journal of Econometrics, 41(3), 341-361.
the procedure for double demeaning in unbalanced panels is eq.(2.9) to eq.(2.13).

If I were you I would do the following:

1. I would appeal to Frisch–Waugh–Lovell and residualise y, x1 and x2 in your equation with respect the two dimensional fixed effects, say using Correia's -reghdfe-.

2. Then I would use nonlinear least squares (-nl-) to fit the nonlinear in the parameters equation you display in #12, where I use the residualised y, x1, and x2.
1 like
Comment
Luca Gagliardone

Join Date: Oct 2016

Posts: 54
#14

27 Jan 2023, 10:57

Originally posted by Joro Kolev View Post

The double demeaning in balanced panel data is a trivial scalar operation explained in Baltagi's book on panel data, which is given in eq(2.2) in the reference I am going to give momentarily. The double demeaning in the unbalanced case is a mess, the formula is known but the operation is matrix based, and when once upon the time I tried to program this in Stata my head started spinning and I aborted the project. The reference for the double demeaning in the unbalanced case is
Wansbeek, T., & Kapteyn, A. (1989). Estimation of the error-components model with incomplete panels. Journal of Econometrics, 41(3), 341-361.
the procedure for double demeaning in unbalanced panels is eq.(2.9) to eq.(2.13).

If I were you I would do the following:

1. I would appeal to Frisch–Waugh–Lovell and residualise y, x1 and x2 in your equation with respect the two dimensional fixed effects, say using Correia's -reghdfe-.

2. Then I would use nonlinear least squares (-nl-) to fit the nonlinear in the parameters equation you display in #12, where I use the residualised y, x1, and x2.

Perfect, thank you!
Comment

Announcement

Restrict parameters with two-way fixed effects

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment