Hello Stata Listers,
I am using a quarterly panel dataset of U.S. commercial banks from 2000 to 2015 to predict bank performance. One of my dependent variables, failure, is 1 when a bank fails in a given quarter and 0 otherwise. Because bank failures do not occur often, only 0.15% of observations out of 355,844 observations are a "1" for failure while the other 99.85% of observations are "0". My independent variables are a standard capital ratio, year dummies, and quarter dummies. I have attempted running the following commands:
xi: xtlogit failure capital_ratio i.year i.quarter, fe vce(bootstrap)
xi: logit failure capital_ratio i.year i.quarter, cluster(bank_id)
xi: probit failure capital_ratio i.year i.quarter, cluster(bank_id)
xi: cloglog failure capital_ratio i.year i.quarter, cluster(bank_id)
I was not able to obtain convergence with any of the above estimators. I believe my highly skewed dependent variable is causing the problem. I am able to obtain linear probability model estimates using the following command:
xi: xtreg failure capital_ratio i.year i.quarter, fe vce(robust)
I believe my highly skewed dependent variable is causing the problem with convergence using logit, probit, or cloglog. If it is and I can obtain results from the linear probability model that are flawed (e.g. predicted negative values for failure) but useful for my research question (in my case, I only care about the sign of the capital ratio coefficient and its magnitude does not matter much), should I use the linear probability model results? Is there another estimator or technique for handling a dependent variable that is as heavily skewed as the failure variable?
Thank you for any help you can provide.
Sincerely,
Neil
I am using a quarterly panel dataset of U.S. commercial banks from 2000 to 2015 to predict bank performance. One of my dependent variables, failure, is 1 when a bank fails in a given quarter and 0 otherwise. Because bank failures do not occur often, only 0.15% of observations out of 355,844 observations are a "1" for failure while the other 99.85% of observations are "0". My independent variables are a standard capital ratio, year dummies, and quarter dummies. I have attempted running the following commands:
xi: xtlogit failure capital_ratio i.year i.quarter, fe vce(bootstrap)
xi: logit failure capital_ratio i.year i.quarter, cluster(bank_id)
xi: probit failure capital_ratio i.year i.quarter, cluster(bank_id)
xi: cloglog failure capital_ratio i.year i.quarter, cluster(bank_id)
I was not able to obtain convergence with any of the above estimators. I believe my highly skewed dependent variable is causing the problem. I am able to obtain linear probability model estimates using the following command:
xi: xtreg failure capital_ratio i.year i.quarter, fe vce(robust)
I believe my highly skewed dependent variable is causing the problem with convergence using logit, probit, or cloglog. If it is and I can obtain results from the linear probability model that are flawed (e.g. predicted negative values for failure) but useful for my research question (in my case, I only care about the sign of the capital ratio coefficient and its magnitude does not matter much), should I use the linear probability model results? Is there another estimator or technique for handling a dependent variable that is as heavily skewed as the failure variable?
Thank you for any help you can provide.
Sincerely,
Neil
Comment