Dear Statalist community,
I have a problem regarding my bond panel. I’d like to run a fixed and random effects regression on the bond panel, with “YieldatIssuance” as the dependent variable and the Dummy “Green” as explanatory variable. I would also like to control the fixed effects of the following variables: Ticker (stands for the Issuer), Currency, PaymentRank, Maturity, AmountIssue, YearMonth(Issue Date), Rating. I converted the string variables into numeric variables using the encode command and dropped all variables I don’t need as controls from the panel.
However, whenever I run the panel regression the variable “Green” is omitted due to collinearity. Therefore, I ran a regression of Green with every control variable of my panel to check for high R-squared values as I have read in a thread of a similar problem. However, the R-squared of each regression was below 1%.
That’s the point where I am irritated now. I could not find any high correlation among my explanatory and control variables. Note that the variable “Green” displays relatively few “true” observations compared to the observations that do not fall under that category (Green==1 has 828 observations vs. 50,106 observations, if Green=0). Do you guys have any alternative ideas that might cause this collinearity? And do you have any suggestions to solve this problem, so I can use "Green" as explanatory variable? Any help will be highly appreciated!
Thank you very much,
Hans
This is my code and output:
I have a problem regarding my bond panel. I’d like to run a fixed and random effects regression on the bond panel, with “YieldatIssuance” as the dependent variable and the Dummy “Green” as explanatory variable. I would also like to control the fixed effects of the following variables: Ticker (stands for the Issuer), Currency, PaymentRank, Maturity, AmountIssue, YearMonth(Issue Date), Rating. I converted the string variables into numeric variables using the encode command and dropped all variables I don’t need as controls from the panel.
However, whenever I run the panel regression the variable “Green” is omitted due to collinearity. Therefore, I ran a regression of Green with every control variable of my panel to check for high R-squared values as I have read in a thread of a similar problem. However, the R-squared of each regression was below 1%.
That’s the point where I am irritated now. I could not find any high correlation among my explanatory and control variables. Note that the variable “Green” displays relatively few “true” observations compared to the observations that do not fall under that category (Green==1 has 828 observations vs. 50,106 observations, if Green=0). Do you guys have any alternative ideas that might cause this collinearity? And do you have any suggestions to solve this problem, so I can use "Green" as explanatory variable? Any help will be highly appreciated!
Thank you very much,
Hans
This is my code and output:
Code:
. duplicates drop ISIN_num, force Duplicates in terms of ISIN_num (36 observations deleted) . xtset ISIN_num IssueDate panel variable: ISIN_num (weakly balanced) time variable: IssueDate, 1/11/2007 to 12/6/2020 delta: 1 day . xtreg YieldatIssuance Green, fe note: Green omitted because of collinearity Fixed-effects (within) regression Number of obs = 50,934 Group variable: ISIN_num Number of groups = 50,934 R-sq: Obs per group: within = . min = 1 between = . avg = 1.0 overall = . max = 1 F(0,0) = 0.00 corr(u_i, Xb) = . Prob > F = . ------------------------------------------------------------------------------ YieldatIss~e | Coef. Std. Err. t P>|t| [95% Conf. Interval] -------------+---------------------------------------------------------------- Green | 0 (omitted) _cons | 3.118114 . . . . . -------------+---------------------------------------------------------------- sigma_u | 2.5257234 sigma_e | . rho | . (fraction of variance due to u_i) ------------------------------------------------------------------------------ F test that all u_i=0: F(50933, 0) = . Prob > F = . . http://www.statalist.org/auth.key command http is unrecognized r(199); . type http://www.statalist.org/auth.key hx2AytymrtzbuzkbhomjtijpOslldynzgutsazy2 . sum YieldatIssuance if Green==1 Variable | Obs Mean Std. Dev. Min Max -------------+--------------------------------------------------------- YieldatIss~e | 828 3.291529 2.469753 .001 12.875 . sum YieldatIssuance if Green==0 Variable | Obs Mean Std. Dev. Min Max -------------+--------------------------------------------------------- YieldatIss~e | 50,106 3.115248 2.526562 .001 28
Comment