Determining number of clusters
one common method is the silhouette score
s = \frac{b-a}{max(a,b)}
where a
is averaged within-cluster distance, and b
is average distance to all non-current clusters.
Higher values of s
suggest good clustering.
https://towardsdatascience.com/silhouette-coefficient-validating-clustering-techniques-e976bb81d10c