XTDPDGMM: new Stata command for efficient GMM estimation of linear (dynamic) panel models with nonlinear moment conditions

Sebastian Kripfganz replied

08 Nov 2024, 03:03
I am not sure why you would even care about this R². It gives you the fraction of the variance of the dependent variable that is explained by the variance of the linear prediction from the explanatory variables. This linear projection does not have a causal interpretation; thus, the notion of endogeneity is not meaningful in this context.
1 like
Leave a comment:
Sarah Magd replied

07 Nov 2024, 08:33
Dear Prof. Sebastian Kripfganz

In the regression where I have the simultaneity issue (i.e., before addressing the endogeneity), can we rely on the value of the resulting R2 from this regression? or would the simultaenity issue and the associated endogeneity can distort the value of the R2?

Could you please guide me on this?
Leave a comment:
Sebastian Kripfganz replied

07 Nov 2024, 04:24
These GMM estimators are intended for panels data sets with a relatively small number of time periods (T) and a relatively large number of cross sections (N). For such panel data, stationarity tests are usually not of much interest.
Leave a comment:
streley paul replied

07 Nov 2024, 04:21
Faut-il effectuer des tests de stationnarité avant d’effectuer la modélisation GMM ?
Leave a comment:
Arkangel Cordero replied

02 Nov 2024, 17:16
Dear Professor @Sebastian Kripfganz

Thank you for clarifying!
Leave a comment:
Sebastian Kripfganz replied

02 Nov 2024, 11:07
The Windmeijer correction is a finite-sample variance correction. Asymptotically, standard errors and any tests based on them are still valid without the correction.

No, the "difference GMM" estimator does not se lagged differences as instruments for the model in level. If you look again at your iv2 instrument in post #654, you will see that the lagged level (which is the value for year 1979) is repeated with a negative sign in the previous year. This is not a difference by itself, but it mechanically ensures that the sum over time multiplied by a constant (such as the fixed effect) equates to zero. In other words, the instrument is orthogonal to any time-invariant variable. This does not require any additional assumption about uncorrelatedness.
Leave a comment:
Arkangel Cordero replied

01 Nov 2024, 12:43
Dear Professor @Sebastian Kripfganz
Wow! Thank you so much for your help. Really helpful. I have two follow up questions:

1) Does the fact that ivreg2 cannot do the Windmeijer correction invalidate the diagnostics (under, weak and over identification tests) generated by this command after "xtdpdgmm, predict iv" ? From your 2019 London Stata Conference presentation, I infer this is not the case, but I just want to double check. Why is this not the case?

2) In your last response above, it seems that when estimating the difference gmm, xtdpdgmm estimates the model in levels and uses lagged differences as instruments. Is that correct? If so, I recall a previous discussion about this making the assumption that the fixed-effects are uncorrelated with the differences (changes in the levels). Is this only problematic in the "system" gmm? Why is it not a problem in the difference gmm? I apologize in advance if I am confusing things.

Thank you again.
Leave a comment:

Sebastian Kripfganz replied

01 Nov 2024, 04:53

We might be using different versions of ivreg2.

Code:

. which ivreg2
*! ivreg2 4.1.12  14aug2024

With your code, I actually cannot replicate the first xtdpdgmm results. This only works if I add the option nocollin to the ivreg2 command line:

Code:

xtdpdgmm L(0/1).n (yr1978-yr1984) w k, model(diff) twostep nolevel nocons gmm(L.n, l(1 1)) iv(yr1978-yr1984 w k) vce(cluster id)
estimate store m1_xtdpdgmm
capture drop iv*
quietly predict iv*, iv

ivreg2 n (L.n w k yr1978-yr1984 = iv*), gmm2s cluster(id) nocons nocollin

I am still not sure what you mean by "defaults". To replicate the results with a different weighting matrix, you can supply this weighting matrix with the ivreg2 option wmatrix():

Code:

xtdpdgmm L(0/1).n (yr1978-yr1984) w k, model(diff) twostep nolevel nocons gmm(L.n, l(1 1)) iv(yr1978- yr1984 w k) vce(cluster id) wmatrix(separate)
estimate store m2_xtdpdgmm
capture drop iv*
quietly predict iv*, iv
local iv "`r(iv)'"
matrix W = e(W)
matrix roweq W = ""
matrix coleq W = ""
matrix rownames W = `iv'
matrix colnames W = `iv'
ivreg2 n (L.n w k yr1978-yr1984 = iv*), cluster(id) nocons nocollin wmatrix(W)

Note that in this example, the matrix W actually contains the optimal weighting matrix for the second step, which is why I have dropped the gmm2s option from ivreg2. Alternatively, you can obtain the initial weighting matrix from the one-step estimator:

Code:

xtdpdgmm L(0/1).n (yr1978-yr1984) w k, model(diff) twostep nolevel nocons gmm(L.n, l(1 1)) iv(yr1978- yr1984 w k) vce(cluster id) wmatrix(separate)
estimate store m2_xtdpdgmm
capture drop iv*
quietly predict iv*, iv
local iv "`r(iv)'"
quietly xtdpdgmm L(0/1).n (yr1978-yr1984) w k, model(diff) nolevel nocons gmm(L.n, l(1 1)) iv(yr1978- yr1984 w k) vce(cluster id) wmatrix(separate) onestep
matrix W = e(W)
matrix roweq W = ""
matrix coleq W = ""
matrix rownames W = `iv'
matrix colnames W = `iv'
ivreg2 n (L.n w k yr1978-yr1984 = iv*), gmm2s cluster(id) nocons nocollin wmatrix(W)

ivreg2 does not have an option for the Windmeijer correction.

The instruments creates by predict with option iv are transformed such that they always refer to the level model. Notice that in your replication of the first model, you did not specify the variables in first differences when using ivreg2. In contrast, when you created the instruments by hand, you then called ivreg2 with first-differenced variables. Please see slide 33 of my 2019 London Stata Conference presentation.

Leave a comment:

Arkangel Cordero replied

31 Oct 2024, 16:19

Dear Professor @Sebastian Kripfganz

Thank you for your help! I was not clear in my third question. What I meant to ask is whether it is possible to replicate the point estimates for xtdpdgmm with its default options using ivreg2 when the gmm-style instruments for ivreg2 are generatred by hand. For example:

Code:


webuse abdata, clear

/* xtdpdgmm with its defaults */
xtdpdgmm L(0/1).n  (yr1978- yr1984) w k,  model(diff) twostep nolevel nocons  gmm(L.n , l(1 1))  iv(yr1978- yr1984 w k)  vce(cluster id)
estimate store m1_xtdpdgmm
capture drop iv*
quietly predict iv*, iv

ivreg2 n (L.n w k yr1978- yr1984 = iv*), gmm2s cluster(id) nocons
estimate store m1_ivreg2



/* xtdpdgmm with "wmatrix(separate)" */
xtdpdgmm L(0/1).n  (yr1978- yr1984) w k,   model(diff) twostep nolevel nocons  gmm(L.n , l(1 1))  iv(yr1978- yr1984 w k)  vce(cluster id) wmatrix(separate)
estimate store m2_xtdpdgmm

* Create "gmm-style" instruments by hand    
foreach var of varlist n {
    di "`var'"
    forvalues lag = 2(1)2 {
        display `lag'
         capture drop il`lag'`var'
         gen il`lag'`var' = L`lag'.`var'
         replace il`lag'`var' = 0 if il`lag'`var' ==.
 

        foreach year of varlist yr1978- yr1984 {
            capture drop i`year'l`lag'`var'
            gen i`year'l`lag'`var' = `year' * il`lag'`var'
            replace i`year'l`lag'`var' = 0 if i`year'l`lag'`var'  == .
            
}
    drop il`lag'`var'
}
}

sum iyr*


ivreg2  D.n yr1978 - yr1984 (D.L.n D.w D.k  = iyr* w k), gmm2s nocons cluster(id)
estimate store m2_ivreg2


esttab  m1_xtdpdgmm m1_ivreg2 m2_xtdpdgmm m2_ivreg2, order(L.n LD.n w D.w  k D.k) b(7) se(8)


-----------------------------------------------------------------------------
                      (1)             (2)             (3)             (4)  
                        n               n               n             D.n  
----------------------------------------------------------------------------
L.n             0.5506190       0.5506191**     0.4111800                  
             (0.30921166)    (0.20419585)    (0.26059471)                  

LD.n                                                            0.4111800*  
                                                             (0.18245773)  

w              -1.0048009**    -1.0048008***   -1.0111454**                
             (0.30828038)    (0.24825200)    (0.33363667)                  

D.w                                                            -1.0111454***
                                                             (0.26820416)  

k               0.2998862       0.2998862       0.3981729                  
             (0.21708074)    (0.15620364)    (0.22582202)                  

D.k                                                             0.3981729*  
                                                             (0.16615589)  

yr1978         -0.0309086*     -0.0309086*     -0.0351301*     -0.0351301**
             (0.01374235)    (0.01249613)    (0.01521336)    (0.01288389)  

yr1979         -0.0388346*     -0.0388346*     -0.0433236*     -0.0081935  
             (0.01775684)    (0.01648030)    (0.02045487)    (0.00863804)  

yr1980         -0.0585632***   -0.0585632***   -0.0630416***   -0.0197180  
             (0.01731379)    (0.01627237)    (0.01833015)    (0.01054987)  

yr1981         -0.0950787***   -0.0950787***   -0.0925442***   -0.0295026  
             (0.02820044)    (0.02412458)    (0.02471889)    (0.01662232)  

yr1982         -0.0621763*     -0.0621763*     -0.0631975*      0.0293467*  
             (0.02868215)    (0.02467994)    (0.02990375)    (0.01438922)  

yr1983         -0.0079399      -0.0079399      -0.0160096       0.0471880**
             (0.02561728)    (0.02466424)    (0.03242727)    (0.01520720)  

yr1984          0.0218494       0.0218494       0.0080898       0.0240994  
             (0.03371198)    (0.03195472)    (0.04064090)    (0.01903547)  
----------------------------------------------------------------------------
N                     891             891             891             751  
----------------------------------------------------------------------------
Standard errors in parentheses
* p<0.05, ** p<0.01, *** p<0.001

So, the first ivreg2 (m1_ivreg2) replicates xtdpdgmm with its defaults, while the second ivreg2 (m2_ivreg2) replicates xtdpdgmm with "wmatrix(separate)". My questions are:

1) Are there any options that can be added to the second "ivreg2" command (m2_ivreg2)--which has the gmm-style instruments generated by hand--so that it replicates the point estimates for "xtdpdgmm" with its defaults (m1_xtdpdgmm)?

2) Does "ivreg2" have any options that would allow it to make the Windmeijer correction in SE?

3) I noticed that the instruments generated by "predict, iv" after "xtdpdgmm" are similar, but not exactly the same as those generated by hand (please see below). The column "iv2" corresponds to an instrument created by "predict, iv" after "xtdpdgmm" whereas "iyr1979l2n" is the equivalent instrument created by the loop above. How can one make sense of a negative version of the lag in the preceding year in the "iv2" column?

Code:

clear
input float(id year iv2 iyr1979l2n)
 1 1977          .          0
 1 1978 -1.6176045          0
 1 1979  1.6176045  1.6176045
 1 1980          0          0
 1 1981          0          0
 1 1982          0          0
 1 1983          0          0
 2 1977          .          0
 2 1978  -4.267163          0
 2 1979   4.267163   4.267163
 2 1980          0          0
 2 1981          0          0
 2 1982          0          0
 2 1983          0          0
 3 1977          .          0
 3 1978  -2.952616          0
 3 1979   2.952616   2.952616
 3 1980          0          0
 3 1981          0          0
 3 1982          0          0
 3 1983          0          0
 4 1977          .          0
 4 1978 -3.2642314          0
 4 1979  3.2642314  3.2642314
 4 1980          0          0
 4 1981          0          0
 4 1982          0          0
 4 1983          0          0

Thank you for your always kind help!

Last edited by Arkangel Cordero; 31 Oct 2024, 16:25.

Leave a comment:

Sebastian Kripfganz replied

31 Oct 2024, 04:56
The following xtdpdgmm specifications exactly replicates your estimates (the first with Windmeijer correction as in xtabond2, and the second without as in ivreg2):

Code:

xtdpdgmm L(0/1).n w k, teffects model(diff) gmm(L.n, lag(1 2)) iv(w k) nocons nolevel vce(cluster id) twostep wmatrix(separate) xtdpdgmm L(0/1).n w k, teffects model(diff) gmm(L.n, lag(1 2)) iv(w k) nocons nolevel twostep wmatrix(separate)

The following is the replication command line for gmm (although I am not sure whether it is possible to replicate standard errors):

Code:

gmm (D.n - {xb:DL.n D.w D.k yr1978-yr1984}), xtinstruments(L.n, lags(1 2)) instruments(w k yr1978-yr1984, nocons) winitial(xt L)

I am not sure what you are aiming for in your last question beyond the above replication. You will always have to generate the GMM-style instruments manually for ivreg2. The main reason is that you need to replace missing values by zeros to avoid those observations being dropped. xtdpdgmm produces those instruments as new variables with the postestimation command predict (with option iv), which can then be fed into ivreg2.
Leave a comment:

Arkangel Cordero replied

30 Oct 2024, 15:16

Dear Professor @Sebastian Kripfganz

I have some follow-up questions. For pedagogical purposes, I am trying to understand how to replicate various specifications of xtdpdgmm with both xtabond2 and ivereg2, and vice versa. Specifically, I would like to know how to replicate the specifications below with xtdpdgmm and Stata’s native “gmm” command. I am aware that the following specifications make the incorrect assumption of homoskedasticity:

Code:

webuse abdata, clear

xtabond2 L(0/1).n (yr1978 - yr1984) w k, ///
 gmm(L.n , lag(1 2) eq(diff) passthru)  iv(yr1978 - yr1984 w k, eq(diff) passthru) ///
 nocons nolevel cluster(id) twostep ar(4) h(1)
estimate store m_xtabond2

* Replicate above results using ivreg2 exclusively
* Create gmm-style instruments
foreach var of varlist n {
   forvalues lag = 2(1)3 {
      capture drop il`lag'`var'
      gen il`lag'`var' = L`lag'.`var'
      replace il`lag'`var' = 0 if il`lag'`var' ==.
 
      foreach year of varlist yr1978- yr1984 {
            capture drop i`year'l`lag'`var' 
            gen i`year'l`lag'`var' = `year' * il`lag'`var'
            replace i`year'l`lag'`var' = 0 if i`year'l`lag'`var'  == .          
}
    drop il`lag'`var' 
}
}

sum iyr*
* Drop all gmm-style instruments that are always "0"
findname, all(@==0)
drop `r(varlist)'


ivreg2  D.n yr1978 - yr1984 (D.L.n D.w D.k  = iyr* w k), gmm2s nocons cluster(id) 
estimate store m_ivreg2_gmm2s

esttab  m_xtabond2 m_ivreg2_gmm2s, order(L.n LD.n w D.w  k D.k) b(7) se(8)

With the results below

HTML Code:

--------------------------------------------
                      (1)             (2)   
                        n             D.n   
--------------------------------------------
L.n             0.1248564                   
             (0.26002198)                   

LD.n                            0.1248564   
                             (0.12566118)   

w              -1.0278240**                 
             (0.32938896)                   

D.w                            -1.0278240***
                             (0.19269549)   

k               0.6161085*                  
             (0.25809647)                   

D.k                             0.6161085***
                             (0.14412866)   

yr1978         -0.0490880**    -0.0490880***
             (0.01844949)    (0.01429634)   

yr1979         -0.0608093*     -0.0117213   
             (0.02464175)    (0.00994148)   

yr1980         -0.0716865***   -0.0108772   
             (0.02055941)    (0.01055265)   

yr1981         -0.0870178***   -0.0153313   
             (0.02482253)    (0.01516805)   

yr1982         -0.0650888*      0.0219290   
             (0.03236609)    (0.01593615)   

yr1983         -0.0343555       0.0307333*  
             (0.04061727)    (0.01495625)   

yr1984         -0.0190336       0.0153218   
             (0.04712421)    (0.01630749)   
--------------------------------------------
N                     751             751   
--------------------------------------------
Standard errors in parentheses
* p<0.05, ** p<0.01, *** p<0.001

In short, I am able to reproduce the point estimates (not the SEs because ivreg2 does not allow for the Windmeijer correction) for "xtabond2, h(1)" using "ivreg2" exclusively, i.e. by creating the gmm-style instruments by hand. Again, my interest in doing so is merely pedagogical and in order to understand the different specifications possible with each command. My questions are the following:

1) Is it possible to replicate the estimates above with xtdpdgmm? What options would I need to use?

2) Is it possible to replicate the estimates above with Stata’s native “gmm” command? What options would I need to use?

3) Do you know of any options for ivreg2 that would result in the correct point estimates as those generated by the default options in xtdpdgmm? I understand that the SEs will always be smaller in ivreg2 because it cannot do the Windmeijer correction, but is it possible to obtain the correct point estimates?

Thank you in advance for your guidance.

Announcement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment: