Should I use a Fractional outcome regression?

Dan Eran

Join Date: Oct 2021

Posts: 17
#1

Should I use a Fractional outcome regression?

16 Nov 2021, 08:01

I have panel data with an outcome variable that ranges from 0-0.6 and theoretically can get up to 1 (but never does). t=20 and n=130.
The outcome variable is the number of positive sentences a person says divided by the number of all sentences a person says. There are, of course, countless combinations. e.g., 0/100, 6/13, 1/128, 240/4578, and so forth.
Can I use the "classical"

HTML Code:

xtreg Y X i.year, fe vce(cluster unit)

or should I use a fractional outcome regression?

Thanks,
Dan

Last edited by Dan Eran; 16 Nov 2021, 08:04.
Tags: fixed effects, Fractional outcome, panel data
Fei Wang

Join Date: Oct 2021

Posts: 726
#2

16 Nov 2021, 09:27

Yes, -xtreg- is a good start for your case. You may further use -fracreg logit- or -fracreg probit- with Mundlak device to incorporate fixed effects. Jeff gave an example code in #4 of https://www.statalist.org/forums/for...for-panel-data. Replacing "oprobit" there with "fracreg logit" or "fracreg probit" will do.
Comment
Emmanuel De-Graft Quarshie

Join Date: Nov 2021

Posts: 1
#3

16 Nov 2021, 09:33

if my regression has low r squared and the variables insignificant, can i rely on the module given the hypothesis? im confused and need assistance.
Comment
Weiwen Ng

Join Date: Jun 2015

Posts: 1241
#4

16 Nov 2021, 13:23

Originally posted by Emmanuel De-Graft Quarshie View Post

if my regression has low r squared and the variables insignificant, can i rely on the module given the hypothesis? im confused and need assistance.

Is this related to the original question? If not, you are better off starting a new question - and it would help to provide some output and a quick description of your study and data. As a very general answer, low R^2 means that your independent variables aren't strongly related to the outcome. This is a separate problem from the p-values being over 0.05. It's quite possible to have a low R^2 and statistically significant and substantively meaningful associations (I work with some data like that quite regularly).

If your p-values are all > 0.05 and the magnitude of the associations is small, then assuming the data aren't erroneous and you're using a type of regression that's at least approximately correct for the outcome, then there's nothing to do about it per se. The question of can I rely on the result is the wrong one to ask. Your explanatory variables just aren't working. You can ask why they aren't, but there may not be anything to do about it except collect new data and measure things better (and put more thought into what is the correct thing to measure).

If you have a fractional outcome, then a fractional logit model is probably the best model to use. However, I think many researchers would accept the use of linear regression and its derivatives. It's unlikely that you would get non-significant effects with linear regression and significant ones with fractional regression (or vice versa), although you should feel free to try.

Be aware that it can be very hard to answer a question without sample data. You can use the dataex command for this. Type help dataex at the command line.

When presenting code or results, please use the code delimiters format them. Use the # button on the formatting toolbar, between the " (double quote) and <> buttons.
Comment

Announcement

Should I use a Fractional outcome regression?

Comment

Comment

Comment