Should i use fixed effects or random effects for my paneldata ??

Simone Skovshoved

Join Date: Mar 2020

Posts: 5
#1

Should i use fixed effects or random effects for my paneldata ??

26 Mar 2020, 05:26

Hello,
I am currently writing a thesis on democratization in Latin America and the Caribbean.
I have paneldata consisting of 22 countries over a period of time from 1975-2010 with 5 years intervals.
My dependent variable is "democracy" and my independent variable is an index called "economicdevelopment", consisting of GDP per capita, education, urbanization and industrialization.
We're basically testing the modernization theory, expecting that economic development has a positive effect on democracy.

We would like to know whether we should use random- or fixed-effects. Our Hausman test suggest RE due to a high p-value 0.996.
We're unsure about the results, because we cant really seem to figure out WHY we should use RE. We cant find any reason that supports our argumentation, besides the Hausman test, on why we should use RE. We have to write som eadvantages in our study, so we would like to ask you, what the advantages of RE is and why it is good for our thesis.
So far our OLS and RE regressions get the same results (which is good?) but we need another argument for using RE besides the Hausman test.
My reg looks like this:

xtreg democracy economicdevelopment i.year, re robust

We are not experts at stata, so if you could please try to explain it as simple as possible that would be fantastic! Thank you.

Hope you can help us.
Best regards Simone

Last edited by Simone Skovshoved; 26 Mar 2020, 05:50.
Tags: fixed effects, panel data, random effects
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17728
#2

26 Mar 2020, 08:30

Simone:
welcome to this forum.
I do not think that interested listers can tell you whther or not -re- specification is good for your thesis.
In addition, I cannot get wht -re- specification cannot be ok for your regression analysis.
However, other issues seem relevant:
- did you check whether your model is ill-specified?
- how could you have used -hausman- with non-default standard error?

Kind regards,
Carlo
(Stata 19.0)
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2192
#3

26 Mar 2020, 09:44

Simone: Following up on Carlo's helpful comments, we really need to see your output. There are several reasons the Hausman test can be leading you astray. For one, it gets the degrees-of-freedom wrong if you use the standard form and have included year dummy variables (as you have). Second, your comment that OLS and RE give the "same" results suggests that RE is using an estimate of zero for the unobserved effect. That's not good.

Even without seeing your results, it's likely you should be using fixed effects. With large aggregated units you need to control for differences that are correlated with your covariates. No one will really believe the results, especially if your RE estimates suffer the way I think they do. Plus, you should only use a robust version of the Hausman test that also gets the degrees-of-freedom right. But I doubt I'd even compute the test. I'd go with FE so you eliminate systematic cross-country differences.

JW
1 like
Comment
Simone Skovshoved

Join Date: Mar 2020

Posts: 5
#4

26 Mar 2020, 10:50

Hello. Thank you so much for your responses. I am not quite sure which output you would like to see. So here is the output from my regression:

xtreg demokrati økonomiskudvikling politiskeinstitutioner naturressourcer etniskfragmentering i.year, re robust

... Im not advanced in STATA, so i dont know what ill-specified or non-default standard error means. Please tell me if you would like to see another output, that can better visualize my problem.
Kind regards

Attached Files
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2192
#5

26 Mar 2020, 11:25

A good start. Now show the FE estimates and the output from the Hausman test. For the Hausman test, you will have to drop "robust" in both cases (which is the same as cluster(countryid)). I'm guessing it will show 11 degrees-of-freedom when the correct df is 4.
Comment
Simone Skovshoved

Join Date: Mar 2020

Posts: 5
#6

26 Mar 2020, 12:32

Thank you for your response. Here are the outputs from the FE regression and the Hausman test.
I have also read your chapter on simple panel data methods, and it has helped me along way! So another thank you for writing such a good book I hope I have the outputs right.
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17728
#7

27 Mar 2020, 02:05

Simone;
if the sky-rocketing -hausman- test p-value can be considered available, you shoul go -re-.
As an aside, ignoring something as misspecification test (see -linktest- entry in Stata .pdf manual for a procedure that. with some tweaks can be used for panel data regression, too) or non-default standard errors (ie, robust, cluster, bootstrap, jacknife) can make your analysis really difficult to manage: Stata commands imply that the researcher knows at least the theoretical building blocks of the statistical procedure(s) she/he is interested in.

Kind regards,
Carlo
(Stata 19.0)
Comment
Simone Skovshoved

Join Date: Mar 2020

Posts: 5
#8

27 Mar 2020, 05:46

Okay, thank you. I will read upon that. I did the estimates without i.year, and now the Hausman suggests fixed-effects (p=0.0174) So i will go with that. But thank you for your repsonses and help
Comment
Carlo Lazzaro

Join Date: Apr 2014

Posts: 17728
#9

27 Mar 2020, 06:06

Simone:
omitting variables trying to force -hausman- outcome to confirm you can go -fe- sounds not that scientific.
Besides, if -year- are actually part of the data generating process, your model is misspecified.
You should live with the data (and their results) as they are.

Kind regards,
Carlo
(Stata 19.0)
Comment
Jeff Wooldridge

Join Date: Apr 2014

Posts: 2192
#10

27 Mar 2020, 09:49

Simone: Here are some thoughts. As a general rule, do not be dictated too much by the outcome of these specification tests. It is known that the Hausman test -- even when properly done -- can have low power. Patrik Guggenberger has an article on that in the Journal of Econometrics. Therefore, failing to reject RE doesn't mean you should use it. Context matters, too. Since FE is clearly the more robust of the two estimators, I lean towards it unless the standard errors of FE are "too large." This is something we all need to remember: if you use FE, and the estimates and cluster-robust standard errors are acceptable, there's no reason to even consider RE. We can't become automatons and just go on the basis of the Hausman test.

What I'd like to see is the FE estimates with vce(cluster id) standard errors. It seems to me like these will be almost as precise as RE, which points to using FE no matter what. RE, conceptually, is weird with aggregated units.

More specific observations.

1. Two of your variables don't change over time, I guess, and so the fall out of FE. That's why Stata reports 9 df rather than the 11 I predicted. But the 9 is incorrect. It should only be two. One way to get the correct df is to use

Code:

hausman fixed random, sigmamore

In my experience, this forces Stata to compute the correct df. However, the test is still nonrobust, so I wouldn't really rely too much on it.

2. There is a simple way to obtain a robust test, and to also add back in the time-constant variables. This is the so-called Mundlak approach.

Code:

egen økonomiskudvikling_bar = mean(økonomiskudvikling), by(countryid) egen naturressourcer_bar = mean(naturressourcer), by(countryid) xtreg demokrati økonomiskudvikling politiskeinstitutioner naturressourcer etniskfragmentering økonomiskudvikling_bar naturressourcer_bar i.year, re vce(cluster countryid) test økonomiskudvikling_bar naturressourcer_bar

3. Be sure to include the year fixed effects. The analysis is not convincing without them. In these kinds of examples, including country and time fixed effects is imperative. My discussion here of the robust Hausman test is mostly for future reference.

So, use FE, inlcude i.year, and cluster your standard errors. That's my advice. You don't have to compute the Hausman test if you're not considering using RE.

JW
1 like
Comment
Simone Skovshoved

Join Date: Mar 2020

Posts: 5
#11

27 Mar 2020, 10:36

Carlo: Yes you are right. I just have to argue my way out of it and accept the dataset.
Woolridge: I understand your point and have used your suggestions for the further proces. After i used following code:
hausman fixed random, sigmamore the test shows 2 df I will go with FE since i have time constant variables.
Thank you both your time and adequate suggestions. You have been very helpful. Have a great weekend.
Comment

Announcement

Should i use fixed effects or random effects for my paneldata ??

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment