Hello all,
my Name ist Marcus Fiedler. I am writing a dissertation (PhD) about the human resources director.
UP to now i am working with Stata 14. The structure will be set with "xtset ID Year".
The literature suggests that there are three different methods for getting "same" results:
Panel FE regression (FE procedure involves group-mean-centering all iV, but I have time invariant variables...) or
Panel Quasi-RE Regression or
RE with Controls (controls are mean variables from iv)
What I want to do:
In my data there are time-invariant variables which means they don´t get an coefficient with FE regression and omitted. That means, I have to do a Quasi-RE (a), including group-mean-centering variables, or RE with controls (b), iv and mean iv.
Example (simple):
(a) x= (y1-y1m)+(y2-y2m)...
(b) x = y1+y1m+y2+y2m...
Both methods produce the same results like FE. The first method is also called "hybrid".
What my problem is:
I will produce RE with controls. My problems is that I have missing data and it isn´t possible calculating and including MEANS iv before the regression is running.
Example:
t a b
2005 12 40
2006 17 30
2007 13 x
2008 x 20
Calculating before running: mean a = 14
Calculation before running: mean b = 30
BUT calculation means before running the RE regression is misleading because my calculation doesn´t involve missing data. Missing data would "erase" two lines and technical means (right or real means) would be:
a = 14,5 (2007 and 2008 deleted)
b = 35 (2007 and 2008 deleted)
Manual calucalting is very tricky and so my question:
Is it possible that I can save mean variables, produced by Stata (under investigation of missing), after running a regression?
That would I mean that I don´t have to calculate means before.
I hope you can understand my problem and let me know a practical solution
Have a nice day,
Marcus
my Name ist Marcus Fiedler. I am writing a dissertation (PhD) about the human resources director.
UP to now i am working with Stata 14. The structure will be set with "xtset ID Year".
The literature suggests that there are three different methods for getting "same" results:
Panel FE regression (FE procedure involves group-mean-centering all iV, but I have time invariant variables...) or
Panel Quasi-RE Regression or
RE with Controls (controls are mean variables from iv)
What I want to do:
In my data there are time-invariant variables which means they don´t get an coefficient with FE regression and omitted. That means, I have to do a Quasi-RE (a), including group-mean-centering variables, or RE with controls (b), iv and mean iv.
Example (simple):
(a) x= (y1-y1m)+(y2-y2m)...
(b) x = y1+y1m+y2+y2m...
Both methods produce the same results like FE. The first method is also called "hybrid".
What my problem is:
I will produce RE with controls. My problems is that I have missing data and it isn´t possible calculating and including MEANS iv before the regression is running.
Example:
t a b
2005 12 40
2006 17 30
2007 13 x
2008 x 20
Calculating before running: mean a = 14
Calculation before running: mean b = 30
BUT calculation means before running the RE regression is misleading because my calculation doesn´t involve missing data. Missing data would "erase" two lines and technical means (right or real means) would be:
a = 14,5 (2007 and 2008 deleted)
b = 35 (2007 and 2008 deleted)
Manual calucalting is very tricky and so my question:
Is it possible that I can save mean variables, produced by Stata (under investigation of missing), after running a regression?
That would I mean that I don´t have to calculate means before.
I hope you can understand my problem and let me know a practical solution

Have a nice day,
Marcus
Comment