XTDPDGMM: new Stata command for efficient GMM estimation of linear (dynamic) panel models with nonlinear moment conditions

Sebastian Kripfganz replied

24 Dec 2023, 06:23
Arkangel Cordero

There are 3 issues here:
1) When replicating the xtdpdgmm results with ivreg2, you need to specify the untransformed variables.
2) You need to specify all untransformed regressors as endogenous, including time dummies.
3) ivreg2 applies collinearity checks that drop some of the instruments, even though it should not. You need to specify the nocollin option with ivreg2.

Code:

webuse abdata xtdpdgmm L(0/1).n, model(diff) gmm(L.n, l(2 5)) nocons vce(cluster id) teffects nolevel twostep quietly predict iv*, iv ivreg2 n (L1.n i.year = iv*), noconstant gmm2s cluster(id) nocollin
Leave a comment:

Arkangel Cordero replied

19 Dec 2023, 12:44

Dear Professor @Sebastian Kripfganz,

To provide some more context. I am interested in estimating a "conventional" difference gmm model (as in your 1.d. in your Post # 481) . I have been able to replicate the results of your xtdpdgmm command with with "teffects" and "year-dummies" and xtabond2 with year-dummies as below.

Code:

webuse abdata
xtdpdgmm L(0/1).n ,                    model(diff) gmm(L.n , l(2 5))  nocons   vce(cluster id)  teffects            nolevel      twostep overid
estimate store m_xtdpdgmm_teffects
estat overid

xtdpdgmm L(0/1).n  (yr1978 - yr1984),  model(diff) gmm(L.n , l(2 5))  nocons   vce(cluster id)  iv(yr1978 - yr1984) nolevel      twostep overid
estimate store m_xtdpdgmm_timeDum
estat overid

xtabond2 L(0/1).n (yr1978 - yr1984), gmm(L.n , lag(2 5)) iv(yr1978 - yr1984) nocons nolevel cluster(id) twostep ar(6)
estimate store m_xtbound2_timeDum

esttab m_xtdpdgmm_teffects m_xtdpdgmm_timeDum m_xtbound2_timeDum

Results:

HTML Code:

------------------------------------------------------------
                      (1)             (2)             (3)  
                        n               n               n  
------------------------------------------------------------
L.n                 0.698**         0.698**         0.698**
                   (2.99)          (2.99)          (2.99)  

1978.year         -0.0103                                  
                  (-0.84)                                  

1979.year         -0.0107                                  
                  (-0.71)                                  

1980.year         -0.0528**                                
                  (-3.19)                                  

1981.year          -0.156***                                
                  (-7.87)                                  

1982.year          -0.171***                                
                  (-4.72)                                  

1983.year          -0.155*                                  
                  (-2.47)                                  

1984.year          -0.153*                                  
                  (-2.21)                                  

yr1978                            -0.0103         -0.0103  
                                  (-0.84)         (-0.84)  

yr1979                            -0.0107         -0.0107  
                                  (-0.71)         (-0.71)  

yr1980                            -0.0528**       -0.0528**
                                  (-3.19)         (-3.19)  

yr1981                             -0.156***       -0.156***
                                  (-7.87)         (-7.87)  

yr1982                             -0.171***       -0.171***
                                  (-4.72)         (-4.72)  

yr1983                             -0.155*         -0.155*  
                                  (-2.47)         (-2.47)  

yr1984                             -0.153*         -0.153*  
                                  (-2.21)         (-2.21)  
------------------------------------------------------------
N                     891             891             751  
------------------------------------------------------------
t statistics in parentheses
* p<0.05, ** p<0.01, *** p<0.001

What I am trying to do is replicate these results with "ivreg2" (as per your 2019 presentation in slides 39 -42) in order to obtain the under-identification and weak-identification tests available in the latter. I have tried the following, without success.

Code:

xtdpdgmm L(0/1).n ,   model(diff) gmm(L.n , l(2 5))  nocons   vce(cluster id)  teffects    nolevel   twostep overid
estimate store m_xtdpdgmm_teffects
estat overid
drop iv*
quietly predict iv*, iv

ivreg2 n iv19-iv25 (L1.n   = iv1 - iv18 iv19-iv25),  gmm2s noconstant  first  cluster(id) endog(L1.n)
estimate store m_ivreg2_lev1

ivreg2 n iv19-iv25 (L1.n   = iv1 - iv18), partial(iv19-iv25) gmm2s noconstant  first  cluster(id) endog(L1.n)
estimate store m_ivreg2_lev2

ivreg2 D1.n D1.(iv19-iv25) (D1.L1.n   = D1.(iv1 - iv18 iv19-iv25)),  gmm2s noconstant  first  cluster(id) endog(D1.L1.n)
estimate store m_ivreg2_dif1

ivreg2 D1.n D1.(iv19-iv25) (D1.L1.n   = D1.(iv1 - iv18)), partial(D1.(iv19-iv25)) gmm2s noconstant  first  cluster(id) endog(D1.L1.n)
estimate store m_ivreg2_dif2


esttab  m_ivreg2_lev1  m_ivreg2_dif1  m_ivreg2_lev2  m_ivreg2_dif2

With the following results:

HTML Code:


--------------------------------------------------------------------------------------------
                      (1)             (2)             (3)             (4)             (5)  
                        n               n             D.n               n             D.n  
--------------------------------------------------------------------------------------------
L.n                 0.698**         0.953***                        0.953***                
                   (2.99)          (9.75)                          (9.75)                  

LD.n                                                0.172                           0.172  
                                                   (1.88)                          (1.88)  

1978.year         -0.0103                                                                  
                  (-0.84)                                                                  

1979.year         -0.0107                                                                  
                  (-0.71)                                                                  

1980.year         -0.0528**                                                                
                  (-3.19)                                                                  

1981.year          -0.156***                                                                
                  (-7.87)                                                                  

1982.year          -0.171***                                                                
                  (-4.72)                                                                  

1983.year          -0.155*                                                                  
                  (-2.47)                                                                  

1984.year          -0.153*                                                                  
                  (-2.21)                                                                  

iv19                              -0.0488***                                                
                                  (-5.18)                                                  

iv20                              -0.0847***                                                
                                  (-8.13)                                                  

iv21                               -0.137***                                                
                                  (-9.34)                                                  

iv22                               -0.149***                                                
                                  (-7.47)                                                  

iv23                              -0.0684***                                                
                                  (-3.44)                                                  

iv24                             -0.00832                                                  
                                  (-0.46)                                                  

iv25                             0.000355                                                  
                                   (0.03)                                                  

D.iv19                                            -0.0348***                                
                                                  (-6.05)                                  

D.iv20                                            -0.0749***                                
                                                  (-7.22)                                  

D.iv21                                             -0.143***                                
                                                  (-8.58)                                  

D.iv22                                             -0.191***                                
                                                  (-9.15)                                  

D.iv23                                             -0.143***                                
                                                  (-7.71)                                  

D.iv24                                            -0.0736***                                
                                                  (-4.95)                                  

D.iv25                                            -0.0311**                                
                                                  (-3.07)                                  
--------------------------------------------------------------------------------------------
N                     891             891             751             891             751  
--------------------------------------------------------------------------------------------
t statistics in parentheses
* p<0.05, ** p<0.01, *** p<0.001

I have only been able to replicate the results of an "unconventional" difference gmm model ( as in 3. in your Post # 482), i.e. one in which the time dummies are instruments for the untransformed model, as shown below.

Code:

xtdpdgmm L(0/1).n ,                    model(diff) gmm(L.n , l(2 5))  nocons   vce(cluster id)  teffects                  twostep overid
estimate store m_xtdpdgmm_teffects
estat overid
drop iv*
quietly predict iv*, iv 

ivreg2 n iv19-iv26 (L1.n   = iv1 - iv18 iv19-iv26),  gmm2s noconstant  first  cluster(id) endog(L1.n) 
estimate store m_ivreg2_lev1

ivreg2 n iv19-iv26 (L1.n   = iv1 - iv18), partial(iv19-iv26) gmm2s noconstant  first  cluster(id) endog(L1.n) 
estimate store m_ivreg2_lev2

HTML Code:

esttab m_xtdpdgmm_teffects m_ivreg2_lev1  m_ivreg2_lev2  


------------------------------------------------------------
                      (1)             (2)             (3)  
                        n               n               n  
------------------------------------------------------------
L.n                 0.946***        0.946***        0.946***
                  (20.75)         (37.89)         (37.89)  

1977.year          0.0885                                  
                   (1.63)                                  

1978.year          0.0732                                  
                   (1.45)                                  

1979.year          0.0674                                  
                   (1.35)                                  

1980.year          0.0236                                  
                   (0.47)                                  

1981.year         -0.0704                                  
                  (-1.37)                                  

1982.year         -0.0563                                  
                  (-1.27)                                  

1983.year         -0.0122                                  
                  (-0.33)                                  

1984.year         -0.0172                                  
                  (-0.59)                                  

iv19                               0.0885**                
                                   (2.70)                  

iv20                               0.0732*                  
                                   (2.48)                  

iv21                               0.0674*                  
                                   (2.31)                  

iv22                               0.0236                  
                                   (0.81)                  

iv23                              -0.0704*                  
                                  (-2.33)                  

iv24                              -0.0563*                  
                                  (-2.15)                  

iv25                              -0.0122                  
                                  (-0.57)                  

iv26                              -0.0172                  
                                  (-0.87)                  
------------------------------------------------------------
N                     891             891             891  
------------------------------------------------------------
t statistics in parentheses
* p<0.05, ** p<0.01, *** p<0.001

I wonder if you could please provide some guidance in being able to replicate the results for a "conventional" (as in your 1.d. in your Post # 481, i.e., with the time-dummies as instruments in the transformed--not the level--model) difference gmm model with ivreg2?

Thank you in advance for any help in this regard.

Last edited by Arkangel Cordero; 19 Dec 2023, 12:49.

Leave a comment:

Sebastian Kripfganz replied

09 Dec 2023, 06:16
A substantial difference between those two test statistics suggests that the weighting matrix might be imprecisely estimated. A solution might be to use the iterated GMM estimator. But it could also be a symptom of a relatively small sample size or weak instruments.
Leave a comment:
Nursena Sagir replied

07 Dec 2023, 07:48
Sebastian Kripfganz Dear Sebastian, I have a question to you about Sargan-Hansen test. How should I interpret diverged 2-step and 3-step weighting matrix results like statistics below?

2-step moment functions, 2-step weighting matrix chi2(2) = 3.1408
Prob > chi2 = 0.2080

2-step moment functions, 3-step weighting matrix chi2(2) = 11.5937
Prob > chi2 = 0.0030

Thanks in advance!

Best regards,
Nursena
Leave a comment:
Sebastian Kripfganz replied

27 Oct 2023, 04:00
Originally posted by Jupp Peters View Post

Also, I struggle to find an initial candidate model that passes the specification tests. I assume this is due to the very large N and that the specification tests can already detect relatively small deviations from the null hypotheses.

This would normally be my answer, yes.

If statistical tests do not seem to be helpful, you might have to resort to economic theory as a guide for model specification / variable classification.

Another possibility might be to consider smaller subsets of the data where the Hansen test has less power. This might sound odd, but if the Hansen test does not detect small deviations from the null hypothesis anymore, it might be helpful to single out models with more severe model specifications.

Forward-orthogonal deviations indeed seem to be appropriate given the unbalanced nature of the panel.

With such a large sample size, the two-step estimator might deliver substantial efficiency gains and is therefore highly recommended.
Leave a comment:
Jupp Peters replied

27 Oct 2023, 01:32
Dear Sebastian Kripfganz,

thank you so much for your help. I have one more question that I hope may add to the collection of advice you have already given in this thread.

I struggle to find a model specification. This is a summary of my sample:
N = ~25.000 households

T = 12 years

~150.000 observations

Unbalanced panel:
Minimum observations per household: 1

Maximum observations per household: 12

The goal is to estimate the relationship between consumption of a good (dependent variable) and (1) price of this good, (2) household income, and (3) various household covariates such as household size, employment status of household head, dwelling type, ... + federal state and time dummies. Some of the household covariates are binary or categorical variables. I am also interested in the interaction effect of price and household income.

I tested fixed and random effects models as benchmarks with the following specifications:

Code:

L(0/1)y c.x_price##c.x_income X_covariates time_effects

I would like to have the same combinations of regressors in the GMM model to compare the results with the fixed and random effects models.

I started the model selection process. However, it seems impractical in my case as a single estimation run lasts ~20 minutes. Also, I struggle to find an initial candidate model that passes the specification tests. I assume this is due to the very large N and that the specification tests can already detect relatively small deviations from the null hypotheses.

My questions are:
Do you have any advice on how to configure IV lags and how to specify the variable types (endogenous/predetermined/exogenous) for the different variables?

Would you agree that I should use forward orthogonal deviations because of the unbalanced panel?

Would you agree that I should use the two-step estimator because of the large sample size?

I really appreciate any help you can provide.

Jupp
Leave a comment:
Sebastian Kripfganz replied

21 Sep 2023, 12:35
Zeenat, Rupali: Apologies; I do not currently have the time to respond to longer queries.

Jupp: If this regressor is not one of your main regressors of interest, you could remove it entirely if all coefficients have high p-values. In principle, you could retain the respective instruments if you believe that they are strong instruments for the other regressors; otherwise, you can remove them as well.
Leave a comment:
Jupp Peters replied

21 Sep 2023, 11:50
Dear Sebastian Kripfganz,

I have a question regarding the Sequential model selection process you presented at the 2019 London Stata Conference.

In step 3, you recommend to "remove lags or interaction effects with (very) high p-values in individual or joint significance tests". What if, after removing all lags of one regressor due to their high individual p-values, the remaining contemporaneous effect of that regressor still has a high p-value? Should it then be entirely removed? If yes, what should be done with the respective IVs?

I really appreciate any help you can provide.
Leave a comment:

Rupali Vashisht replied

04 Sep 2023, 06:40

Dear Prof. Sebastian Kripfganz,

Thank you very much for your previous response. I am using xtdpdgmm and the gmm technique for the first time and I would really appreciate your help in understanding if I am doing it correctly.

In my model, I consider EPS and ROAA as endogenous. The predetermined variables are - l.ROAA l.EPS TDtoTA SalesGrowth lnTA CFOtoSales CapitalIntensity l.EnvDisclScore. And, the exogenous variables include the time dummies.

I intend to incorporate fixed effects by first differences transformation. And, I wish to take care of the correlation between the error and the lagged EPS by considering their lags as their instruments.

I am using the following code:

xtdpdgmm L(0/1).EPS L(0/1).ROAA TDtoTA SalesGrowth lnTA l.EnvDisclScore i.t, model(diff) collapse gmm(EPS ROAA, lag(2 3)) gmm(EPS ROAA, lag(1 1) diff model(level)) gmm(l.ROAA l.EPS TDtoTA SalesGrowth lnTA CFOtoSales CapitalIntensity l.EnvDisclScore, lag(1 3)) gmm(l.ROAA l.EPS TDtoTA SalesGrowth lnTA CFOtoSales CapitalIntensity l.EnvDisclScore i.t, lag(0 0) diff model(level)) iv(i.t) two vce(cluster id) small overid

And I obtain the following output:

		WC-Robust
EPS	Coefficient	std. err.	t	P>t	[95% conf.	interval]

EPS
L1.	1.009836	0.0749899	13.47	0	0.8620847	1.157588


ROAA	0.2411051	0.1097758	2.2	0.029	0.0248154	0.4573949
L1.ROAA	0.0063018	0.0585874	0.11	0.914	-0.1091322	0.1217357

TDtoTA	0.0151328	0.0461458	0.33	0.743	-0.0757876	0.1060532
SalesGrowth	-0.0158857	0.0163982	-0.97	0.334	-0.0481948	0.0164234
lnTA	0.9796653	1.435696	0.68	0.496	-1.849068	3.808398

EnvDisclScore
L1.	0.0485014	0.0524756	0.92	0.356	-0.0548905	0.1518933

t
2011	0	(empty)
2012	1.378857	1.268998	1.09	0.278	-1.121434	3.879147
2013	-0.3298332	1.159062	-0.28	0.776	-2.613518	1.953851
2014	-1.774592	1.015847	-1.75	0.082	-3.776101	0.2269177
2015	0.5190836	1.136322	0.46	0.648	-1.719796	2.757963
2016	0.6554844	0.9437869	0.69	0.488	-1.204046	2.515015
2017	-0.2927053	0.8384453	-0.35	0.727	-1.944683	1.359272
2018	-2.61427	1.03919	-2.52	0.013	-4.661772	-0.5667682
2019	-0.5594183	0.8990458	-0.62	0.534	-2.330796	1.21196
2020	0	(omitted)

_cons	-11.50282	14.16942	-0.81	0.418	-39.42064	16.415

Instruments corresponding to the linear moment conditions:
1, model(diff):
L2.EPS L3.EPS L2.ROAA L3.ROAA
2, model(level):
L1.D.EPS L1.D.ROAA
3, model(diff):
L3.L.ROAA L3.L.EPS L1.TDtoTA L2.TDtoTA L3.TDtoTA L1.SalesGrowth
L2.SalesGrowth L3.SalesGrowth L1.lnTA L2.lnTA L3.lnTA L1.CFOtoSales
L2.CFOtoSales L3.CFOtoSales L1.CapitalIntensity L2.CapitalIntensity
L3.CapitalIntensity L1.L.EnvDisclScore L2.L.EnvDisclScore
L3.L.EnvDisclScore
4, model(level):
D.TDtoTA D.SalesGrowth D.lnTA D.CFOtoSales D.CapitalIntensity
D.L.EnvDisclScore
5, model(diff):
2013bn.t 2014.t 2015.t 2016.t 2017.t 2018.t 2019.t 2020.t
6, model(level):
_cons

I just want to make sure if the code above does what I want to do. I have the following doubts, I would greatly appreciate your help on these:

Does the model above specify the endogenous, predetermined and exogeneous variables correctly?
I have introduced ‘CFOtoSales’ and ‘CapitalIntensity’ variables in my set of instruments as external instruments as I wish to instrument for ROAA (which I consider endogenous). Am I correct in doing that?
When I use iv(i.t) – am I right in thinking that i.t acts as an instrument for all variables?
Is my model first differenced?
The text below the table shows that the lagged values of the endogenous variables act as instruments of the differenced model and that the level model has differenced variables as instruments (which I have specified) - is that the correct way to go about it?

Apart from the above questions, Could you please help me understand what is the benefit of specifying instruments for both level and difference models?

Thank you very much in advance!

Kind regards,
Rupali

Announcement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment: