Dear all,
I am writing my masters thesis on the effect of market competition on risk-taking behavior of banks. My main explanatory variable will be a measure of market concentration, HHI index or the Lerner Index.
I am working with the Call Report data for US banks which is a panel data set that has all the banks in the US per year (2000 - 2020) and I have generated the HHI Index using assets with the following code:
entropyetc BankID [w=Assets], by(qdate) gen(4=HHI_Assets)
qdate is a quarterly time variable indicating the year and the respective quarter
After looking at similar literature on HHI values for deposits and assets, the calculated HHI values seem too low to be correct. For example here is the histogram of HHI for the 588,888 observations I have in the dataset:
HHI_Assets.gph
Here is also a little part of the data:

I don't see the mistake I've done with the formula I used above. I appreciate any input on this matter.
Thank you,
Viktor
I am writing my masters thesis on the effect of market competition on risk-taking behavior of banks. My main explanatory variable will be a measure of market concentration, HHI index or the Lerner Index.
I am working with the Call Report data for US banks which is a panel data set that has all the banks in the US per year (2000 - 2020) and I have generated the HHI Index using assets with the following code:
entropyetc BankID [w=Assets], by(qdate) gen(4=HHI_Assets)
qdate is a quarterly time variable indicating the year and the respective quarter
After looking at similar literature on HHI values for deposits and assets, the calculated HHI values seem too low to be correct. For example here is the histogram of HHI for the 588,888 observations I have in the dataset:
HHI_Assets.gph
Here is also a little part of the data:
I don't see the mistake I've done with the formula I used above. I appreciate any input on this matter.
Thank you,
Viktor
Comment