Hi Statalisters,
I am currently working on my Thesis and I am having problems on STATA - which I am using for the first time.
I have a panel data which spans from 2008 through 2015 and covers 181 Italian listed family firms. My main interest is the relation between founding-family ownership and firm performance. The analysis also incorporates variables that identify CEOs as firm founders, descendants of the firm's founder, or outsiders. I would like to use a two-way fixed effects model for my regression analysis.
The paper I have read that does something similar describes the fixed effects to be dummy variables for each year of the sample and dummy variables for each two-digit SIC code (I would like to use ATECO 2007 Code since I am talking about Italy), and the regression they employ is the following:
Firm Performance= δ0 + δ1 (Family Firm) + δ3 (control Variables) + δ3 + δ54 (Two digit ATECO Code) + δ'93-'99 (Year Dummy Variables) + 𝛆
where
Firm Performance = ROA based on EBITDA and net income, and Tobin's q;
Family Firm = binary variable that equals one when the founding family is pre- sent in the firm, and zero otherwise; Control Variables = officer and director holdings less family holdings, fraction of independent directors serving on the board, research and development expenses divided by total sales, long-term debt divided by total as- sets, stock return volatility, natural log of total assets, and the natural log of firm age;
Two-Digit ATECO Code = 1.0 for each two-digit SIC code in our sample;
Year Dummy Variables = 1.0 for each year of our sample period."
Now, my Professor has suggested me to do this in STATA:
- encode Company, gen (Company1)
- xtset Company1
- xtset Company1 Year
- encode ATECOCode2007, gen(ATECO)
- xtset ATECO
- egen id = group(Year ATECO)
*I hereby created a joined variable of Year and ATECO in order to replicate via an OLS the two-way fixed effects model - then I performed the areg code*
- generate young = (Firmageyears<50)
- generate old = (Firmageyears>50)
- generate CEOhire = (CEOfam<0.5)
- generate CEOfounder = (CEOfound>0.5)
- generate CEOdescendant = (CEOdesc>0.5)
*To specify, in my excel the variable FamilyFirm is already a dummy - do I have to generate one on STATA as well?*
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
- areg ROE FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
- areg Tobinsq FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
*Removed Returnperemployee, CapexPPE1 - First set: FF/NFF*
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
*Second set - young/old*
- areg ROA young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROE young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg Tobinsq young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
*Third set: CEOhire/CEOfounder/CEOdescendant*
- areg ROA CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROE CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg Tobinsq CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
** PROBLEMS **
0. Is first of all, right the areg code, or is there a better way to create a two-way fixed effects model?
1. I have NO INTERCEPT in my regression results
2. If I were to add the variable x Outsidedirectors, my variable FamilyFirm is omitted from the regression because of Collinearity problems.... how come? This would be an important variable for my model and I did not have this kind of problem in my old database...is there a problem on Excel maybe?
Thank you A LOT in advance for your help!
I am currently working on my Thesis and I am having problems on STATA - which I am using for the first time.
I have a panel data which spans from 2008 through 2015 and covers 181 Italian listed family firms. My main interest is the relation between founding-family ownership and firm performance. The analysis also incorporates variables that identify CEOs as firm founders, descendants of the firm's founder, or outsiders. I would like to use a two-way fixed effects model for my regression analysis.
The paper I have read that does something similar describes the fixed effects to be dummy variables for each year of the sample and dummy variables for each two-digit SIC code (I would like to use ATECO 2007 Code since I am talking about Italy), and the regression they employ is the following:
Firm Performance= δ0 + δ1 (Family Firm) + δ3 (control Variables) + δ3 + δ54 (Two digit ATECO Code) + δ'93-'99 (Year Dummy Variables) + 𝛆
where
Firm Performance = ROA based on EBITDA and net income, and Tobin's q;
Family Firm = binary variable that equals one when the founding family is pre- sent in the firm, and zero otherwise; Control Variables = officer and director holdings less family holdings, fraction of independent directors serving on the board, research and development expenses divided by total sales, long-term debt divided by total as- sets, stock return volatility, natural log of total assets, and the natural log of firm age;
Two-Digit ATECO Code = 1.0 for each two-digit SIC code in our sample;
Year Dummy Variables = 1.0 for each year of our sample period."
Now, my Professor has suggested me to do this in STATA:
- encode Company, gen (Company1)
- xtset Company1
- xtset Company1 Year
- encode ATECOCode2007, gen(ATECO)
- xtset ATECO
- egen id = group(Year ATECO)
*I hereby created a joined variable of Year and ATECO in order to replicate via an OLS the two-way fixed effects model - then I performed the areg code*
- generate young = (Firmageyears<50)
- generate old = (Firmageyears>50)
- generate CEOhire = (CEOfam<0.5)
- generate CEOfounder = (CEOfound>0.5)
- generate CEOdescendant = (CEOdesc>0.5)
*To specify, in my excel the variable FamilyFirm is already a dummy - do I have to generate one on STATA as well?*
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
- areg ROE FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
- areg Tobinsq FamilyFirm Officersanddirectorsownless DE DEBITDA Returnperemployee Salesgrowth CapexPPE1 LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb (id)
*Removed Returnperemployee, CapexPPE1 - First set: FF/NFF*
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROA FamilyFirm Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
*Second set - young/old*
- areg ROA young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROE young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg Tobinsq young old Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
*Third set: CEOhire/CEOfounder/CEOdescendant*
- areg ROA CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg ROE CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
- areg Tobinsq CEOhire CEOfounder CEOdescendant Officersanddirectorsownless DE DEBITDA LTDTA1 Returnvolatility1 lnTotalassets Lnfirmage, absorb(id)
** PROBLEMS **
0. Is first of all, right the areg code, or is there a better way to create a two-way fixed effects model?
1. I have NO INTERCEPT in my regression results
2. If I were to add the variable x Outsidedirectors, my variable FamilyFirm is omitted from the regression because of Collinearity problems.... how come? This would be an important variable for my model and I did not have this kind of problem in my old database...is there a problem on Excel maybe?
Thank you A LOT in advance for your help!
Comment