I am performing a cluster analysis and want to combine hierarchical and non-hierarchical techniques.
I want to proceed as follow:
I know that I can obtain the means of all clustering variables for the different clusters with the following code:
tabstat varlist, by (clusterXY)
Is this vector of means then the cluster centroid? And how can I employ these centroids in kmeans?
I found the random [(seed#)] and prandom [(seed#)] option, where one can define a random number seed. Is that the option I need? And if yes, how do I proceed? As far as I understood it, I can only type in one number as a “seed”, but I have 12 variable means as my cluster centroids.
I want to proceed as follow:
- Hierarchical Cluster analysis using Wards method to determine the appropriate number of clusters and the cluster centroids
- Kmeans cluster analysis, by employing the number of groups and their centroids generated by the solution of wards method.
I know that I can obtain the means of all clustering variables for the different clusters with the following code:
tabstat varlist, by (clusterXY)
Is this vector of means then the cluster centroid? And how can I employ these centroids in kmeans?
I found the random [(seed#)] and prandom [(seed#)] option, where one can define a random number seed. Is that the option I need? And if yes, how do I proceed? As far as I understood it, I can only type in one number as a “seed”, but I have 12 variable means as my cluster centroids.
Comment