XTDPDGMM: new Stata command for efficient GMM estimation of linear (dynamic) panel models with nonlinear moment conditions

Joseph L. Staats

Join Date: Aug 2015

Posts: 28
#451

30 Jul 2022, 09:13

Sebastian,

I am using xtdpdgmm for system GMM models having ten independent and control variables. I have the following questions:

1. I find that I can improve model fit in terms of overidentification, underidentification, and AIC and BIC if I sometimes use: (a) different instrument lag ranges as between the variables (e.g., x1 will use lag(1 1) and x2 will use lag(1 3); and (b) lag ranges as between the fod equation and the level equation for a single variable (e.g., in the fod equation, x4 will be lag(1 2) and in the level equation lag(1 3). Is there anything improper in doing this? I hadn't thought there was until reading your response to a question in #395 about "cherry picking" (which I realize related to comparing system GMM results with difference GMM results, which is different from my question).

2. In my project, I am proposing that my dependent variable not only is acted upon by my main independent variable of interest but also acts in the opposite direction on the independent variable. There is strong theoretical support for the former, but the theory for the latter is novel, but one which I believe I can support. In using system GMM for the latter, is there anything special I can do to make a stronger argument that the reverse-direction effects are not spurious? One thing I have tried is starting the instrument lags for the formerly dependent variable, now independent variable, at lag 3 rather than lag 1 (this produces good results, as does starting at lag 4).

Thanks.
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#452

01 Aug 2022, 06:59

1. From a theoretical perspective, as long as your instruments are valid, your estimator will be consistent. However, none of the statistics you mentioned (overidentification, underidentification, AIC/BIC) are qualified procedures to select a subset of instruments, assuming all instruments are valid, for the purpose of improving the fit. These tests have no asymptotic power for discriminating among the models you are comparing with each other. Nevertheless, limiting the maximum lag order can be beneficial to reduce weak-instruments problems. Appropriate instrument selection procedures for this purpose are unfortunately not implemented in Stata. From an applied perspective, a reviewer or reader might wonder about the rationale behind your choices. Picking models in the way you have done could be interpreted as cherry picking or data mining unless you have a good justification, say, that higher-order lags of x1 are relatively weaker than higher-order lags of x2. For the level model in a system GMM approach, it is quite uncommon to use any higher-order lags. It is not necessarily wrong, but it can be hard to justify, especially when you are not consistent with your choice across variables and/or specifications.

2. Instead of starting with higher-order lags (which might be relatively weak instruments), my suggestion would be to include lags of that variable as additional regressors. This way, you can check whether there are contemporaneous and/or lagged effects.

https://www.kripfganz.de/stata/
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#453

01 Aug 2022, 11:27

New week, new update, new feature: Version 2.6.1 of xtdpdgmm is shipped with the new postestimation command estat serialpm, which computes the Jochmans (2020) portmanteau test for absence of serial correlation in the idiosyncratic error component. Unlike the Arellano and Bond (1991) test - implemented in estat serial - the portmanteau test does not test for autocorrelation of the first-differenced residuals at a specific order (e.g. second order), but jointly tests for the absence of autocorrelation of the idiosyncratic level errors at any order. (Technical comment: Because the portmanteau test involves level residuals, it is not invariant to the exclusion of an intercept.) The Jochmans portmanteau test can be a more powerful alternative to the Arellano-Bond test, especially if T is relatively small or if there is very strong serial correlation (close to a unit root).

In this new version, the option ar() of estat serial has been renamed as order(). The former name was borrowed from xtdpd and xtabond2, but it is actually confusing because it is not actually a test for autoregressive residuals. For backward compatibility, the ar() option continues to work but it is no longer documented.

To install the latest version of the command, type the following in Stata's command window:

Code:

net install xtdpdgmm, from(http://www.kripfganz.de/stata) replace

References:
Arellano, M., and S. R. Bond (1991). Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. Review of Economic Studies 58, 277-297.

Jochmans, K. (2020). Testing for correlation in error-component models. Journal of Applied Econometrics 35, 860-878.

https://www.kripfganz.de/stata/
Comment
Joseph L. Staats

Join Date: Aug 2015

Posts: 28
#454

02 Aug 2022, 01:20

As always, thanks for your help. I have two additional questions that relate to the answers you provided for my two previous questions.

1. I would like clarification of this part of your answer: "For the level model in a system GMM approach, it is quite uncommon to use any higher-order lags." Do you mean by this that when structuring the instrument lags, the level lags should remain fixed at a low maximum (e.g., 1) no matter what the maximum instrument lag might be for the fod model (e.g., 2)? Or do you mean instead that the maximum instrument lag for the level model should normally be the same as provided in the fod model?

2. For my reverse-direction model, I have been using one lag of the independent variable of interest as a regressor. I obtain very substantial and statistically significant coefficients for each of the contemporaneous and lagged variables, but the coefficient for the former is positive and for the latter negative. I don't obtain anything satisfactory when I use a second lag. I would have confidence in my model (it passes overidentification and underidentification tests) were it not for the fact that I also have constructed a model where the effects go in the opposite direction. Other than convincing a reader or reviewer that the reverse direction is supported by theory, is there anything else I need to worry about in constructing models that work in opposite directions?
Comment
Nicu Sprincean

Join Date: Nov 2016

Posts: 47
#455

02 Aug 2022, 02:46

Sebastian Kripfganz,
There seems to be an error when trying to run the Jochmans portmanteau test: Here is the error:

Code:

xtdpdgmm_serial_pm(): 3021 class compiled at different times <istmt>: - function returned error r(3021);
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#456

02 Aug 2022, 05:26

Nicu Sprincean
This happens if the previous version of the command is still in memory when you perform the update. There are two solutions:
(a) restart Stata, or (b) run the command clear mata.

Joseph L. Staats
If you were using all available lags for the FOD model, then additional lags (beyond the first lag) of instruments in levels are redundant. (Essentially, you would be introducing perfect collinearity among the instruments.) Technically, this is no longer the case when you restrict the maximum lag order for the FOD model. However, in the empirical practice there is hardly ever more than 1 lag used for the model in levels.

It is quite common that you observe opposite signs for the coefficients of the contemporaneous and the lagged variables. This often indicates that a large initial effect is dampened (or sometimes entirely counteracted) in the following period. It is perfectly fine to have effects going in both directions. This is the very nature of simultaneity as a form of endogeneity. While you have constructed models with effects that go in opposite directions, you achieved identification using lags of the respective regressors as instruments.

https://www.kripfganz.de/stata/
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#457

06 Aug 2022, 09:42

Thanks to Kit Baum, the latest version 2.6.2 of xtdpdgmm with all the updates from the recent weeks is now also available on SSC.

https://www.kripfganz.de/stata/
Comment
Zainab Mariam

Join Date: Jul 2022

Posts: 51
#458

11 Aug 2022, 16:50

Dear Professor Sebastian,

I am using Stata 14. The data type of my research is panel data (unbalanced), the time period is 22 years, 5084 firms. My model includes 10 explanatory variables (L.x1, L.x2, L.x3, L.x4, L.x5, L.x6, L.x7, L.x8, L.x9, x10). The dependent variable y of my research is a limited dependent variable and truncated between zero and one.

I have a dynamic model (my regression model includes the lagged dependent variable L.y as a regressor).

1) I will apply the Difference GMM estimator using your command ‘xtdpdgmm’. I will consider the lagged dependent variable L.y as endogenous, the independent variable L.x1 as endogenous, while the variables L.x2, L.x3, L.x4, L.x5, L.x6, L.x7, L.x8, L.x9 as predetermined, and the variable x10 (firm age) as exogenous. Thus, my first question is: which of the following codes is/are correct and I can use to implement the Difference GMM estimator?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(. .)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 2)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(0 2)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(0 2)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(0 2)) gmm(x10, lag(. .)) ///
> nocons two vce(r)

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(2 .)) gmm(L.x1, lag(2 .)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(0 .)) gmm(x10, lag(. .)) ///
> nocons two vce(r)

2) If none of the previous codes is correct, what is the correct code I have to use in order to implement the Difference GMM estimator?

3) What is the contemporaneous term of the lagged control variable? For instance, is x5 (i.e., the variable x5 at time 0) the contemporaneous value of the lagged control variable L.x5? or is L.x5 (i.e., the variable x5 at time t minus 1) the contemporaneous value of the lagged control variable L.x5?

Sorry for the long message.
Thank you in advance.
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#459

14 Aug 2022, 13:10

One would normally consider the lagged dependent variable as predetermined, not endogenous. In this case, all codes would be "correct", although the first two codes would be preferable because they already use the second lag of y (i.e. the first lag of L.y) as an instrument, which is stronger than only the second lag of L.y (i.e. the third lag of y). As far as I can see, the first two codes are equivalent.

With the latest version of the xtdpdgmm package, you can use the xtdpdgmmfe command to specify predetermined and endogenous variables. It will also show you the appropriate xtdpdgmm code; see my post #450 above.

As "contemporaneous value of the lagged control variable" L.x5, I would consider L.x5. x5 is the "contemporaneous value of the control variable" x5.

https://www.kripfganz.de/stata/
Comment
Zainab Mariam

Join Date: Jul 2022

Posts: 51
#460

14 Aug 2022, 14:59

Dear Professor Sebastian,

Thank you for your reply.

In the first gmm brackets, do I have to put the dependent variable y itself?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

or do I have to put the lagged dependent variable L.y (the regressor)?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(1 3)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

Your cooperation is highly appreciated.
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#461

15 Aug 2022, 07:00

Both codes are equivalent. You may choose whichever you find more intuitive.

https://www.kripfganz.de/stata/
Comment
Zainab Mariam

Join Date: Jul 2022

Posts: 51
#462

15 Aug 2022, 07:47

Dear Professor Sebastian,

Thank you for your response.

1) To implement the Difference GMM estimator using your command ‘xtdpdgmm’, I wonder whether the following codes are different or equivalent.

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(L.y, lag(1 3)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm y L.y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

. xtdpdgmm y L.(y x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

Where:
y is the dependent variable;
L.y is the lagged dependent variable as a regressor;
L.x1 is the independent variable;
L.x2, L.x3, L.x4, L.x5, L.x6, L.x7, L.x8, L.x9, x10 are the control variables.

2) Is there any difference between the options: gmmiv( ), gmm( ), and iv( )?

I do appreciate your cooperation.

Last edited by Zainab Mariam; 15 Aug 2022, 08:06.
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#463

16 Aug 2022, 03:55

All three codes appear to be equivalent.

gmm() is just an abbreviation of gmmiv(). iv() is a collapsed version of gmmiv(). The help file states:

gmmiv(varlist, lagrange(#_1 #_2) collapse) is equivalent to iv(varlist, lagrange(#_1 #_2)).

https://www.kripfganz.de/stata/
Comment
Zainab Mariam

Join Date: Jul 2022

Posts: 51
#464

16 Aug 2022, 06:47

Dear Professor Sebastian,

Many thanks for your reply.

1) To implement the Difference GMM estimator using your command ‘xtdpdgmm’, do I have to mention the option model(diff) only once (specifically, after mentioning the model’s variables)?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

Or do I have to mention the option model(diff) in each gmm brackets?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, collapse gmm(y, lag(2 4) model(diff)) gmm(L.x1, lag(2 4) model(diff)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3) model(diff)) gmm(x10, lag(0 0) model(diff)) ///
> nocons two vce(r)

2) To implement the Difference GMM estimator using your command ‘xtdpdgmm’, do I have to mention the option collapse only once (specifically, before the first gmm brackets)?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) collapse gmm(y, lag(2 4)) gmm(L.x1, lag(2 4)) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3)) gmm(x10, lag(0 0)) ///
> nocons two vce(r)

Or do I have to mention the option collapse in each gmm brackets?

. xtdpdgmm L.(0/1) y L.(x1 x2 x3 x4 x5 x6 x7 x8 x9) x10, model(diff) gmm(y, lag(2 4) collapse) gmm(L.x1, lag(2 4) collapse) gmm(L.x2 L.x3 L.x4 L.x5 L.x6 L.x7 L.x8 L.x9, lag(1 3) collapse) gmm(x10, lag(0 0) collapse) ///
> nocons two vce(r)

3) To implement the Difference GMM estimator using your command ‘xtdpdgmm’, can I include dummies (such as industry, country and year dummies) in my regression model? If so, what do I have to include in the code?

Your cooperation is highly appreciated.
Comment
Sebastian Kripfganz

Join Date: May 2014

Posts: 2606
#465

16 Aug 2022, 14:51

1) For the difference-GMM estimator, the two versions are equivalent. In the first version, the "global" option model(diff) sets the default for all gmm() options.

2) Similarly, the "global" option collapse sets the default for all gmm() options. Thus, the two versions are again equivalent.

3) You would add the dummies as additional regressors and as instruments in an iv() option. For time dummies, you can alternatively simply combine the two options teffects and nolevel.

https://www.kripfganz.de/stata/
Comment

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment