Dear Statalist members
I am running a regression with Log stock returns on 1st quarter of 2020 as my dependent variable;
and industries classified into 48 groups (Fama French-48) as my independent variables. My intention is to check whether stock returns varied among industries during the outbreak of Corona(which is obvious logically). Since it is advised to cluster the standard errors at the aggregate level, I ran the following code with clustering at the industry level and I am attaching a subset of my results
As the results indicate all my standard errors are very big, bizarre (+ve &-ve) and F statistic is missing. My intention of clustering by industries is to account for correlation among firms in the same industry but I know that during this period correlation can exist amongst industries also. Hence I tried the following command by clustering at the company level and my results are attached
Now, which one should I consider for my interpretation? I doubt that the model which cluster at industry level is usable since all p values are significant there. Which one should I use for interpretation purposes? If further clarification is required, I am happy to provide it.
Thanks in advance
I am running a regression with Log stock returns on 1st quarter of 2020 as my dependent variable;
Code:
Ln (Stock price on 31-03-2020/ Stock price on 01-01-2020)
Code:
reg quart_ret i.ff48,vce(cluster ff48)
Code:
. reg quart_ret i.ff48,vce(cluster ff48)
Linear regression Number of obs = 1924
F( 0, 29) = .
Prob > F = .
R-squared = 0.0586
Root MSE = .30878
(Std. Err. adjusted for 30 clusters in ff48)
------------------------------------------------------------------------------
| Robust
quart_ret | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ff48 |
5 | -.0006924 1.29e-15 -5.4e+11 0.000 -.0006924 -.0006924
7 | .0184726 1.29e-15 1.4e+13 0.000 .0184726 .0184726
8 | -.0564544 1.29e-15 -4.4e+13 0.000 -.0564544 -.0564544
9 | -.0447615 1.29e-15 -3.5e+13 0.000 -.0447615 -.0447615
10 | .1450836 1.29e-15 1.1e+14 0.000 .1450836 .1450836
11 | .1361723 1.29e-15 1.1e+14 0.000 .1361723 .1361723
13 | .1527319 1.43e-15 1.1e+14 0.000 .1527319 .1527319
14 | .0309398 1.30e-15 2.4e+13 0.000 .0309398 .0309398
16 | .0442209 1.29e-15 3.4e+13 0.000 .0442209 .0442209
As the results indicate all my standard errors are very big, bizarre (+ve &-ve) and F statistic is missing. My intention of clustering by industries is to account for correlation among firms in the same industry but I know that during this period correlation can exist amongst industries also. Hence I tried the following command by clustering at the company level and my results are attached
Code:
reg quart_ret i.ff48,vce(cluster companyname)
Linear regression Number of obs = 1924
F( 29, 1923) = 4.70
Prob > F = 0.0000
R-squared = 0.0586
Root MSE = .30878
(Std. Err. adjusted for 1924 clusters in companyname)
------------------------------------------------------------------------------
| Robust
quart_ret | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
ff48 |
5 | -.0006924 .0914865 -0.01 0.994 -.1801155 .1787308
7 | .0184726 .0776587 0.24 0.812 -.1338315 .1707766
8 | -.0564544 .0526913 -1.07 0.284 -.1597924 .0468836
9 | -.0447615 .0512153 -0.87 0.382 -.1452049 .0556818
10 | .1450836 .0650334 2.23 0.026 .0175402 .2726271
11 | .1361723 .0814712 1.67 0.095 -.0236089 .2959536
13 | .1527319 .0408991 3.73 0.000 .0725207 .2329432
14 | .0309398 .0304394 1.02 0.310 -.0287578 .0906375
16 | .0442209 .0357533 1.24 0.216 -.0258985 .1143403
17 | -.0232826 .0417664 -0.56 0.577 -.1051948 .0586297
18 | -.1212549 .0403227 -3.01 0.003 -.2003357 -.042174
19 | -.0859658 .036367 -2.36 0.018 -.1572888 -.0146428
21 | -.0131784 .0371308 -0.35 0.723 -.0859993 .059642
Thanks in advance

Comment