Dear all,
I have a question regarding how to perform a specific linear regression in Stata 14.2. I would like to compare the change in adjusted R squared over a number of consecutive years for two groups, in order to be able to say something about the value-relevance of accounting information for these two groups. I know that this might be not the best statistical method to do so, though it seems to be an accepted method in the field ofaccounting research.
The regression equation which I want to use is from this paper: Francis, J., & Schipper, K. (1999). Have Financial Statements Lost Their Relevance? Journal of Accounting Research, 37(2), 319-352. doi:10.2307/2491412. The equation for the linear OLS regression is as follows: ADJUSTEDR2i,t = k0 + k1HIGHi,t * t + k2LOWi,t * t + εi,t . In this equation, the variable HIGH has the value 1 if a company is classified as a hightech company and has the value 0 if a company is classiefied as a lowtech company. The opposite applies for the variable LOW. The variable t is an indicator for the year, which starts at 1 for the first year and ends in this paper at 43 for the final year. Please let me know if this description is too elaborate.
As I am unfortunately unable to share my real data on the internet, here is some code to generate some 'fake' data.
Based on this data I tried to to regress the equation in Stata using the following code:
As the output is quite long I won't post it here, as my post is already a long one. However, after regressing Stata shows several notes that the i.hightech#i.yearid and i.lowtech#i.yearid terms have been omitted because of collinearity. As a result of that I only get values for the coefficients, but not for the standard error and t-values etc. Therefore I was wondering whether I am doing something wrong in Stata, or that this model is subject to the dummy variable trap (because of the collinearity). As both the hightech and lowtech indicators appear in the regression equation, the collinearity seems logical to me.
Thanks in advance!
I have a question regarding how to perform a specific linear regression in Stata 14.2. I would like to compare the change in adjusted R squared over a number of consecutive years for two groups, in order to be able to say something about the value-relevance of accounting information for these two groups. I know that this might be not the best statistical method to do so, though it seems to be an accepted method in the field ofaccounting research.
The regression equation which I want to use is from this paper: Francis, J., & Schipper, K. (1999). Have Financial Statements Lost Their Relevance? Journal of Accounting Research, 37(2), 319-352. doi:10.2307/2491412. The equation for the linear OLS regression is as follows: ADJUSTEDR2i,t = k0 + k1HIGHi,t * t + k2LOWi,t * t + εi,t . In this equation, the variable HIGH has the value 1 if a company is classified as a hightech company and has the value 0 if a company is classiefied as a lowtech company. The opposite applies for the variable LOW. The variable t is an indicator for the year, which starts at 1 for the first year and ends in this paper at 43 for the final year. Please let me know if this description is too elaborate.
As I am unfortunately unable to share my real data on the internet, here is some code to generate some 'fake' data.
Code:
clear set obs 24 input fyear hightech lowtech 1993 0 1 1994 0 1 1995 0 1 1996 0 1 1997 0 1 1998 0 1 1999 0 1 2000 0 1 2001 0 1 2002 0 1 2003 0 1 2004 0 1 1993 1 0 1994 1 0 1995 1 0 1996 1 0 1997 1 0 1998 1 0 1999 1 0 2000 1 0 2001 1 0 2002 1 0 2003 1 0 2004 1 0 set seed 12345 gen adj_r2 = runiform() label variable fyear "Fiscal year" label variable hightech "Classified as a hightech company" label variable lowtech "Classified as a lowtech company" label variable adj_r2 "Adjusted R squared" gen yearid = fyear-1992 label variable yearid "ID for the fiscal year"
Code:
regress adj_r2 i.hightech#i.yearid i.lowtech#i.yearid
Thanks in advance!
Comment