I have been trying to estimate a binary outcome variable on a large, unbalanced panel dataset. I have Stata MP and good hardware (SSD, 60GB RAM, 8 core). And yet, it is almost impossible to estimate logit or probit models (takes several days before Stata became non-responsive, then crashed). Any suggestions?
I have tried the following, and neither has worked (i.e. finishing the estimation task) --
1. xi:logit y x1 x2 x3 ... x10 i.userid (also tried probit)
2. xtlogit y x1 x2 ... x10, fe (also tried probit)
My sample:
N= 40810
t=11215
Highly unbalanced with some units having very long series, whereas some just a few periods.
So far it seems a linear probability model is the only thing feasible (I tried -areg y x1 x2 ... x10, absorb(userid)-), but I was hoping to see if the estimates are similar to those from a logit or probit model. Yet, I have not been able to finish running either logit or probit model, even once. Other than taking random samples of units, are there anything else I can try?
Thanks to anyone who can share some suggestions.
I have tried the following, and neither has worked (i.e. finishing the estimation task) --
1. xi:logit y x1 x2 x3 ... x10 i.userid (also tried probit)
2. xtlogit y x1 x2 ... x10, fe (also tried probit)
My sample:
N= 40810
t=11215
Highly unbalanced with some units having very long series, whereas some just a few periods.
So far it seems a linear probability model is the only thing feasible (I tried -areg y x1 x2 ... x10, absorb(userid)-), but I was hoping to see if the estimates are similar to those from a logit or probit model. Yet, I have not been able to finish running either logit or probit model, even once. Other than taking random samples of units, are there anything else I can try?
Thanks to anyone who can share some suggestions.
Comment