Issues in convergence of fractional logit model

Jaya Jumrani

Join Date: Jul 2014

Posts: 12
#1

Issues in convergence of fractional logit model

19 Jul 2014, 03:12

Dear Stata users,

I'm working on a survey dataset in Stata 11.2 with around 50,000 households. My dependent variable is a proportion variable and as a result I'm using the fractional logit model i.e. the command glm Y X, family(binomial) link(logit) robust where Y is a proportion ranging between 0 and 1 with majority being zeroes. X consists of continuous, dummy variables and some variables which are interactions among those variables. The squared terms of some of these variables are also included.

The problem that I'm facing is that the model is not able to converge and the log pseudolikelihood gets backed up. The independent variables whose squared terms have been incorporated have also been centered. The problem persists even after changing the ml technique to bhhh.

Any help in this regard will be highly appreciated.

Thanks in anticipation.

Best,
Jaya
Tags: None
Richard Williams

Join Date: Apr 2014

Posts: 5011
#2

19 Jul 2014, 06:45

The -difficult- option sometimes works miracles. Other than that I would suggest starting with a very simple model and then building it up. Maybe you can identify a variable that is causing all the grief.

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://academicweb.nd.edu/~rwilliam/
Comment
Andrew Lover

Join Date: Apr 2014

Posts: 182
#3

19 Jul 2014, 07:14

Sometimes cycling through multiple ml algorithms helps: eg, -technique(bhhh nr dfp bfgs)- or adding the -difficult- option

Also, take a look at section 4 here: Cummings P (2009) Methods for estimating adjusted risk ratios. Stata Journal 9: 175–196.
However, this might be problematic with svy data....

__________________________________________________ __
Assistant Professor, Department of Biostatistics and Epidemiology
School of Public Health and Health Sciences
University of Massachusetts- Amherst
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2189
#4

19 Jul 2014, 10:32

This comment may be uninformed, but I wonder if the problem is due to perfectly predicting some zero outcomes based on certain combinations of the covariates. For binary responses, logit, probit, and glm handle this problem and drop the relevant observations. I don't know whether glm has a way of recognizing perfect predictions with fractional responses. I haven't encountered this problem in the examples I've done.
Comment
Jaya Jumrani

Join Date: Jul 2014

Posts: 12
#5

21 Jul 2014, 04:43

Thanks Dr. Williams. The -difficult- option unfortunately doesn't solve the problem. I did start from a simple model and then started building up on it and the grief causing variables are continuous and interaction (continuous*dummy) variables, which are also theoretically important variables for the model.
Comment
Jaya Jumrani

Join Date: Jul 2014

Posts: 12
#6

21 Jul 2014, 04:56

Thanks Mr. Andrew.. The command as suggested in "Cummings P (2009) Methods for estimating adjusted risk ratios. Stata Journal 9: 175–196" is applicable but is also leading to the iterations getting backed up. The cycling through multiple ml algorithms is in progress yet.
Comment
Jaya Jumrani

Join Date: Jul 2014

Posts: 12
#7

21 Jul 2014, 05:19

Thanks Dr. Woolridge. I don't get any such warning messages when I use the glm command. However, when I run the normal logit command I do get the error message of some of the independent variables perfectly predicting the data.
Comment
Maarten Buis

Join Date: Mar 2014

Posts: 3464
#8

21 Jul 2014, 05:43

Originally posted by Jaya Jumrani View Post

I don't get any such warning messages when I use the glm command. However, when I run the normal logit command I do get the error message of some of the independent variables perfectly predicting the data.

I suspect that glm won't check for perfect prediction, so I suspect that that is the cause of your problem. I have no idea how to solve it though.

---------------------------------
Maarten L. Buis
University of Konstanz
Department of history and sociology
box 40
78457 Konstanz
Germany
http://www.maartenbuis.nl
---------------------------------
Comment
Jaya Jumrani

Join Date: Jul 2014

Posts: 12
#9

22 Jul 2014, 04:04

Thanks Dr. Buis. If you happen to come across any solution for the same, please do share.
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2189
#10

22 Jul 2014, 06:32

As I wrote to Jaya in an email, one should not use the logit (or probit) commands in Stata for fractional response. Those commands automatically transform the dependent variable into a zero-one variable. One should always use the glm command for fractional response, which leaves the Y variable in its original form. Jaya's problem seems to go away when glm is used, which means there is sufficient variation in the outcome that cannot be perfectly predicted by Y.
Comment
Richard Williams

Join Date: Apr 2014

Posts: 5011
#11

22 Jul 2014, 06:44

Originally posted by Jeff Wooldridge View Post

Jaya's problem seems to go away when glm is used, which means there is sufficient variation in the outcome that cannot be perfectly predicted by Y.

However, Jaya said in her first post that she was using glm. So I assume there was also some other change that solved the problem? If so, what was it?

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://academicweb.nd.edu/~rwilliam/
Comment
Richard Williams

Join Date: Apr 2014

Posts: 5011
#12

22 Jul 2014, 06:45

Originally posted by Jeff Wooldridge View Post

Jaya's problem seems to go away when glm is used, which means there is sufficient variation in the outcome that cannot be perfectly predicted by Y.

However, Jaya said in her first post that she was using glm. So I assume there was also some other change that solved the problem? If so, what was it? (Just in case I ever need this solution myself!)

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://academicweb.nd.edu/~rwilliam/
Comment
Maarten Buis

Join Date: Mar 2014

Posts: 3464
#13

22 Jul 2014, 07:54

Originally posted by Jeff Wooldridge View Post

As I wrote to Jaya in an email, one should not use the logit (or probit) commands in Stata for fractional response.

The way I understand the problem is that with glm the fractional logit model does not converge. If logit is used there is an warning message that says there are 0s that are exactly predicted.

Jeff is (obviously) right that logit is not the correct way to estimate a fractional logit. However, the warning message that 0s are exactly predict can be useful: The 0s in logit match the 0s in glm (the 1s in logit don't, they match any value larger than 0 in glm). So the exact prediction of 0s in logit should match the exact prediction of 0s in glm, which could be the reason why glm does not converge.

---------------------------------
Maarten L. Buis
University of Konstanz
Department of history and sociology
box 40
78457 Konstanz
Germany
http://www.maartenbuis.nl
---------------------------------
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2189
#14

22 Jul 2014, 09:47

Here is what Jaya wrote in her recent email to me: "I don't get any such warning messages when I use the glm command. However, when I run the normal logit command I do get the error message of some of the independent variables perfectly predicting the data. " I assume she made a mistake earlier in describing the problem. I agree that using glm could still have led to a problem, but it doesn't seem to now in Jaya's particular application.
Comment
Richard Williams

Join Date: Apr 2014

Posts: 5011
#15

22 Jul 2014, 09:58

To me that sounds like glm doesn't give an error about perfect prediction, but whatever problem caused her to post in the first place is still there or has been solved some other way. But Jaya is the ultimate authority on this!

-------------------------------------------
Richard Williams, Notre Dame Dept of Sociology
StataNow Version: 19.5 MP (2 processor)
EMAIL: [email protected]
WWW: https://academicweb.nd.edu/~rwilliam/
Comment

Announcement

Issues in convergence of fractional logit model

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment