Singleton dummy variable

Ariel Karlinsky

Join Date: Jun 2015

Posts: 491
#1

Singleton dummy variable

28 Dec 2015, 02:50

Hi all,

Running xtivreg2 I get the foloowing error message regaring the Hansen's J statstic:

Code:

Warning: estimated covariance matrix of moment conditions not of full rank. model tests should be interpreted with caution. Possible causes: singleton dummy variable (dummy with one 1 and N-1 0s or vice versa) partial option may address problem.

My question is: How can I check which variables in which observations are singleton dummies? I was unable to find anything useful by googling this... Any help would be greatly appreciated
Tags: None
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17708
#2

28 Dec 2015, 04:05

Ariel:
can't you simply -describe- or -xttab- your variables?

Kind regards,
Carlo
(Stata 19.0)
Comment
Ariel Karlinsky

Join Date: Jun 2015

Posts: 491
#3

04 Jan 2016, 06:54

I'll try that (my access to that station is limited) - but my question is: What do I expect to see in xttab when a variable is a singleton dummy? I'm not quite sure about that. of course I have variables that do not vary by panel unit such as gender and variables that do not vary by time such as year fixed effects.

Also, this error occues in xtivreg2\xtoverid only when the vce(robust) option is requested. I googled it and found several threads regarding the matter, but I still don't understand what can cause this...
Comment

Nick Cox

Join Date: Mar 2014
Posts: 35698

04 Jan 2016, 07:03

Singleton dummies have sum 1. That can be approached in several ways. In your case you should know which of your variables are dummies, but if all else fails you can also get Stata to find them. Here findname needs to be downloaded first from theStata Journal site. (You won't need to use it if you can specify your dummies directly.)

Code:

. sysuse auto, clear
(1978 Automobile Data)

. gen foo = _n == 42

. tabstat price-foo, s(sum) col(stat)

    variable |       sum
-------------+----------
       price |    456229
         mpg |      1576
       rep78 |       235
    headroom |     221.5
       trunk |      1018
      weight |    223440
      length |     13907
        turn |      2934
displacement |     14600
  gear_ratio |     223.1
     foreign |        22
         foo |         1
------------------------

. findname , all(inlist(@, 0, 1, .))
foreign  foo

. tabstat `r(varlist)' , s(sum) col(stat)

    variable |       sum
-------------+----------
     foreign |        22
         foo |         1
------------------------

Comment

Mark Schaffer

Join Date: Mar 2014

Posts: 324
#5

04 Jan 2016, 10:57

The best discussion about this is an oldie but a goodie by Vince Wiggins:

http://www.stata.com/statalist/archi.../msg00594.html
Comment
Ariel Karlinsky

Join Date: Jun 2015

Posts: 491
#6

06 Jan 2016, 01:11

Thanks Nick! I do know which variables are dummies (most of them are virtual dummies from factor variables), so I used tabstat, sum(on e(sample)) like you show and I think I found the culprit.

Thanks for the link Mark, it explains brilliantly why this is happening.
Of course I especially liked the ending:

You might wonder why I always use the word "indicator" to describe binary
regressors, even when the original poster called them "dummy" variables. I was once briefing a group of mainly military personal about the implications of some model. About the third time I referred to the "colonel dummy" everyone at the table broke out in laughter, everyone that is except the older gentleman at the head of the table with lots of bars on his label and little bird emblems on his shoulders. I long ago forgot the subject of the talk, but I never forgot the lesson.

since the culprit is part of a FE and also on just 1 observation I will re-estimate by omitting the observation and also by omitting the indicator, things should stay pretty much the same.

Mark also suggested to partial out the variables i'm not interested in their coefficients. unfortunately xtoverid does not have a partial() option like ivreg2/xtivreg2 and so I might need to manaully partial these out. the issue is that I'm not really sure how to partial these out in a Random Effects model. Say my model is:
xtreg y (x1 = z1 z2) x2 x3 i.year, re
and I wish to partial out the time FE of i.year . in a non IV, non RE model I would just regress all of the variables (including y) on i.year, get residuals and then run a regression on all residuals.
I tried experimenting with this on my data but was unable to get results that seem to be correct, I suspect it's because the first stage in a RE model is some combination of BE and FE and this makes the usual way of partialling out somehow different.
Comment

Announcement

Singleton dummy variable

Comment

Comment

Comment

Comment

Comment