
























Abstract:The Minkowski weighted $k$-means ($mwk$-means) algorithm extends classical $k$-means by incorporating feature weights and a Minkowski distance. We first show that the $mwk$-means objective can be expressed as a power-mean aggregation of within-cluster dispersions, with the order determined by the Minkowski exponent $p$. This formulation reveals how $p$ controls the transition between selective and uniform use of features. Using this representation, we derive bounds for the objective function and characterise the structure of the feature weights, showing that they depend only on relative dispersion and follow a power-law relationship with dispersion ratios. This leads to explicit guarantees on the suppression of high-dispersion features, and we establish convergence of the algorithm. Building on these theoretical results, we introduce Cluster-Adaptive Feature Extraction (CAFE), a method that uses the $mwk$-means feature weights to rescale the data prior to unsupervised feature extraction. We prove that this rescaling reverses the within-cluster dispersion ordering, suppressing noisy features and amplifying informative ones. Numerous experiments conducted under controlled within-cluster noise show that CAFE consistently improves the results of traditional feature extraction methods.
| Subjects: | Machine Learning (cs.LG) |
| Cite as: | arXiv:2603.25958 [cs.LG] |
| (or arXiv:2603.25958v2 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2603.25958 arXiv-issued DOI via DataCite |
From: Renato Cordeiro de Amorim [view email]
[v1]
Thu, 26 Mar 2026 22:57:47 UTC (152 KB)
[v2]
Thu, 21 May 2026 13:16:00 UTC (160 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。