Hi there!
I'm carrying out (wards, using calinski stopping rule) cluster analysis on a basket of bank balance sheet variables to identify different types of bank behaviour and change overtime in Europe. However, when I run the clusters for a year, it might give me 3 clusters as the preferred (according to the pseudo-F) but one of the clusters just has 1 bank in it, whilst the other 2 have hundreds. I can exclude it, but then the next time run the cluster there will be a cluster with just 1 or 2 banks compared to the others.
Does anyone here know if these cases would likely be clearly outliers I need to deal with prior to running the cluster commands (and what might be some recommended outlier identifying methods?), or this always likely to occur when running cluster analysis?
General details - sample size is off approx.3000 banks, for the years 2010-2016 inclusive being clustered against a basket off 8 balance sheet variables - data is in long format, no missing data).
Many thanks for any insight you can offer, it's much appreciated,
Olly
I'm carrying out (wards, using calinski stopping rule) cluster analysis on a basket of bank balance sheet variables to identify different types of bank behaviour and change overtime in Europe. However, when I run the clusters for a year, it might give me 3 clusters as the preferred (according to the pseudo-F) but one of the clusters just has 1 bank in it, whilst the other 2 have hundreds. I can exclude it, but then the next time run the cluster there will be a cluster with just 1 or 2 banks compared to the others.
Does anyone here know if these cases would likely be clearly outliers I need to deal with prior to running the cluster commands (and what might be some recommended outlier identifying methods?), or this always likely to occur when running cluster analysis?
General details - sample size is off approx.3000 banks, for the years 2010-2016 inclusive being clustered against a basket off 8 balance sheet variables - data is in long format, no missing data).
Many thanks for any insight you can offer, it's much appreciated,
Olly
Comment