Hi all,
I am facing a model specification problem related to the Fixed-effects model and the Fixed-effects Poisson model. I am working with an unbalanced panel data set that contains 3,588 observations and 110 groups, spanning 45 years. The continuous outcome variable, which represents military aid from the U.S., contains 1,519 zeros. Upon examining both the outcome variable and the independent variables, I found that all of them are right-skewed. Therefore, as my first step, I took the natural logarithm of the dependent variable (DV) and all independent variables (IVs), except for the interaction term. I then tested the hypothesis using the Fixed-effects model. The relevant code is provided below:
I used predict and kdensity functions to plot the residuals. As below: 
However, even after taking the natural logarithm of the outcome variable, it still exhibits excess zeros and remains right-skewed. See below:
This leads me to question whether the Fixed-effects model is a good fit for the data, even though the distribution of residuals appears to be acceptable. This is my first question.
I began searching for solutions to address the excess zeros in the outcome variable. Dr. Wooldridge (1999) suggests that continuous variables can also be used in Fixed-effects Poisson models in "Distribution-free estimation of some nonlinear panel data models." Therefore, I tested my data using 'xtpoisson'. Below are two different models, the first one is logged DV and the second one is unlogged DV.
Things are becoming tricky for me. First, should/ can I take the log of the outcome variable in a FE Poisson model? Second, can I even take the log of any variable in a FE Poisson model? Third, can I even use FE Poisson to estimate my question since the outcome variable is continuous?
I am grateful for your time and consideration in reading this post. Any help you can provide is appreciated.
I am facing a model specification problem related to the Fixed-effects model and the Fixed-effects Poisson model. I am working with an unbalanced panel data set that contains 3,588 observations and 110 groups, spanning 45 years. The continuous outcome variable, which represents military aid from the U.S., contains 1,519 zeros. Upon examining both the outcome variable and the independent variables, I found that all of them are right-skewed. Therefore, as my first step, I took the natural logarithm of the dependent variable (DV) and all independent variables (IVs), except for the interaction term. I then tested the hypothesis using the Fixed-effects model. The relevant code is provided below:
Code:
xtreg log_US_militiary_aidit i.Dummy1##i.Dummy2 lag_log_X1t lag_log_X2t lag_log_X3t lag_log_X4t lag_log_X5t, fe vce(cluster cow_code)
However, even after taking the natural logarithm of the outcome variable, it still exhibits excess zeros and remains right-skewed. See below:
This leads me to question whether the Fixed-effects model is a good fit for the data, even though the distribution of residuals appears to be acceptable. This is my first question.
I began searching for solutions to address the excess zeros in the outcome variable. Dr. Wooldridge (1999) suggests that continuous variables can also be used in Fixed-effects Poisson models in "Distribution-free estimation of some nonlinear panel data models." Therefore, I tested my data using 'xtpoisson'. Below are two different models, the first one is logged DV and the second one is unlogged DV.
Code:
xtpoisson log_US_militiary_aidit i.Dummy1##i.Dummy2 lag_log_X1t lag_log_X2t lag_log_X3t lag_log_X4t lag_log_X5t, fe vce(robust)
Code:
xtpoisson US_militiary_aidit i.Dummy1##i.Dummy2 lag_log_X1t lag_log_X2t lag_log_X3t lag_log_X4t lag_log_X5t, fe vce(robust)
I am grateful for your time and consideration in reading this post. Any help you can provide is appreciated.
Comment