Sklearn k-means clustering (weighted), determining optimum sample weight for each feature?

832 Views Asked by zlatko At 27 July 2025 at 21:33

K-means clustering in sklearn, number of clusters is known in advance (it is 2). There are multiple features. Feature values are initially without any weight assigned, i.e. they are treated equally weighted. However, task is to assign custom weights to each feature, in order to get best possible clusters separation. How to determine optimum sample weights (sample_weight) for each feature, in order to get best possible separation of the two clusters? If this is not possible for k-means, or for sklearn, I am interested in any alternative clustering solution, the point is that I need method of automatic determination of appropriate weights for multivariate features, in order to maximize clusters separation.

Original Q&A

There are 2 best solutions below

Ebo On 09 June 2023 at 07:40

What I understand from sklearn docs, sample_weight is used to give weights for each observations (samples), not features.

If you want to give weight to your features, you can refer to this post: How can I change feature's weight for K-Means clustering?

zlatko On 11 January 2021 at 15:45

In meantime, I have implemented following: clustering by each component separately, then calculating silhouette score, calinski harabasz score, dunn score and inverse davies bouldin score for each component (feature) separately. Then scaling those scores to same magnitude, then PCA them to 1 feature. This produced weights for each component. It seems this approach produces reasonable results. I suppose better approach would be full factorial experiment (DOE), but it seems that this simple approach produces satisfactory results as well.

Sklearn k-means clustering (weighted), determining optimum sample weight for each feature?

There are 2 best solutions below

Related Questions in MACHINE-LEARNING

Related Questions in SCIKIT-LEARN

Related Questions in CLUSTER-ANALYSIS

Related Questions in UNSUPERVISED-LEARNING

Related Questions in FEATURE-CLUSTERING

Trending Questions

Popular # Hahtags

Popular Questions