Hi,
We have an unbalanced panel for which we want to run logit regressions (Stata 17). We use register data and can unfortunately not share any data, and haven't been able to see a similar problem in any of the Stata datasets, so we sill try to describe the problem. The panel is basically cross-sections of households between ages 18-35 between 2012-2020, meaning that households disappear from and enter the datasets over the years because of the ages.
We have a model (the real model contains other variables, but the problem is with this single variable) in which we regress mobility (dichotomous) on housing tenure (dichotomous; 1 for rental housing). It is well established from previous studies that our coefficient should be positive, and our results seem to confirm that when we run logit by year, or as a pooled sample, using the following commands:
,
the sign is the expected. We have also double-checked with
which confirms that mobility rates are higher under rental tenure. So, the dataset quite clearly indicates a positive relationship.
But when we move into panel estimations we have problems. We did Hausmann tests which indicate we should use FE. We use:
and get the messages:
This means we lose around half of the observations. Also, with FE we consistently get the 'wrong' sign (and we have tried a lot of specifications of multiple regressions). No other coefficients seem to be affected when running multiple regressions. RE gives us the expected sign. Could someone explain why this is?
Thanks in advance for comments!
We have an unbalanced panel for which we want to run logit regressions (Stata 17). We use register data and can unfortunately not share any data, and haven't been able to see a similar problem in any of the Stata datasets, so we sill try to describe the problem. The panel is basically cross-sections of households between ages 18-35 between 2012-2020, meaning that households disappear from and enter the datasets over the years because of the ages.
We have a model (the real model contains other variables, but the problem is with this single variable) in which we regress mobility (dichotomous) on housing tenure (dichotomous; 1 for rental housing). It is well established from previous studies that our coefficient should be positive, and our results seem to confirm that when we run logit by year, or as a pooled sample, using the following commands:
Code:
logit FlyttLok_D rentalDummy
Code:
bysort year: logit FlyttLok_D rentalDummy
the sign is the expected. We have also double-checked with
Code:
tabstat FlyttLok_D, statistics(mean) by(rentalDummy)
But when we move into panel estimations we have problems. We did Hausmann tests which indicate we should use FE. We use:
Code:
xtset ID year xtlogit FlyttLok_D rentalDummy, fe
note: multiple positive outcomes within groups encountered.
note: 546,209 groups (1,291,930 obs) omitted because of all positive or
all negative outcomes.
note: 546,209 groups (1,291,930 obs) omitted because of all positive or
all negative outcomes.
Thanks in advance for comments!
Comment