惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

math updates on arXiv.org

Riemannian Archetypal Analysis: Interpretable non-linear data analysis on deformed star distributions Variational formulation of hyperbolic conservation laws TriVAL: A Tri-Validation Framework for Faithful Automatic Optimization Modeling A computational phase transition for learning-to-sample from Ising models Minimax Limits of k-Fold Cross-Validation via Majority Chaotic and periodic behavior of jeu de taquin on infinite Young tableaux On the complexity of extensions of non-archimedean Polish groups admitting a compatible complete left-invariant metric Implicit Binarization via Complex Phase Dynamics in Combinatorial Optimization Autoregression-Free Neural Operators for Time-Dependent PDEs On the binary digits of the Erdős-Borwein constant Topological analysis in $\mathcal{R}(p,q)-$anisotropic sector and nuclear space on $\mathcal{R}(p,q)-$quantum deformed algebra On the Differential-Geometric Equivalence of Hellinger-Kantorovich and Cone-Wasserstein Spaces Quantitative Stability of Generalized $p$-Area Minimizing Surfaces Geometry-Preserving Nudged Elastic Band and Dimer Methods under Anisotropic Force Uncertainty SinFormer: A Tailored Transformer for Robust Radio Frequency Fingerprint Identification Global linear convergence of entropy-regularized softmax policy gradient beyond tabular MDPs Polynomial Context-Truncation Sensitivity in Autoregressive Language Models: Sequential Wyner-Ziv Bounds for KV Cache Compression Lifted Schrödinger Bridges for Gaussian Mixture Endpoints: Projection Gaps and Path-Space Obstructions Complement Submodular Information Measures for Balanced and Robust Data Selection Long Memory in Intrinsically Dynamic Factor Models On potentials of distributions in Orlicz-Hardy type spaces on the Heisenberg group Betti numbers of inductively pierced codes Convergence and non-asymptotic error analysis for kinetic Langevin samplers using the exact harmonic Langevin integrator An Exact Continuous Conductance Formulation of the Hamiltonian Path Problem Universality theorems for generalized splines Improving the Accuracy of the Exponentially Fitted Scheme on Piecewise Uniform Meshes The complexity of frugal digraph homomorphisms Fibrations, the First Betti Number, and Almost Nonnegative Ricci Curvature On the Shirshov--Cohn theorem for JB-algebras On higher order Morrey's inequalities over RCD{\boldmath$(K,N)$}-spaces The $θ= \infty$ Conjecture and the Riemann Hypothesis for Automorphic $L$-functions Optimal and Order-optimal Gated Priority-based Greedy Policies for Two-layer Multi-item Order Fulfillment Fundamental Limitation in Explaining AI Feature Learning in Wide Neural Networks under $μ$P: Identifiability and Sparse-Dictionary Decomposition of the Mean-Field Limit WINO: A Weak-Form Physics Informed Neural Operator for Hyperelasticity on Variable Domains Data-Specific Hyper-Parameter Design: A Paradigm Shift in Reservoir Computing RECTOR: Priority-Aware Rule-Based Reranking for Compliance-Aware Autonomous Driving Trajectory Selection Learning dynamical systems with biochemically informed neural ordinary differential equations Leveraging Gauge Freedom for Learning Non-Gradient Population Dynamics of Stochastic Systems Decoupling for AD-regular sets on the parabola From Halin's Edge Removability to Matching Removability in $k$-Connected Graphs Branched Signature Kernel Solvers for ODEs with rough Single-Trajectory signals Embedding more than 8 symplectic balls in $\mathbb{C}\mathrm{P}^2$ Associative Structures in Pseudo-Riemannian Lie Algebras Refined asymptotics of the steady Navier Stokes equation around small Landau solutions Bayesian Conformal-Projective Prediction Variational Contraction Conditions for Iterative Algorithms in Multi-Population Discrete-Time Regularized Mean-Field Games A Quasi Maximum Likelihood Estimation Method for Bergomi-Type Volatility Models Arakelov inequality for families of pairs Crossing limit cycles of discontinuous piecewise differential systems with nilpotent saddles separated by a nonregular line Framing Triangulations for Arbitrary Integer Flow Polytopes The limits of Schur multipliers in Pólya conversion problems for the $q$-permanent function Precise asymptotic estimates and non-degeneracy of solutions to a biharmonic problem with large exponents in dimension four On the Common Generalization of Gentle Algebras and Framed Directed Acyclic Graphs Geometry of Neighborhoods of Minimal Rational Curves Hydrodynamic Limit of the Boltzmann Equation toward Generic Riemann Solutions with Shocks Spectral analysis and sine transform based preconditioning for a structure preserving stabilized scheme approximating the space-fractional Allen Cahn equation with logarithmic potential Strichartz estimates for Schrödinger equations with the multipole Aharonov--Bohm Hamiltonian Exponential mixing properties of nonlinear functional autoregressive models On the fractional parts of certain sequences of $\xi \alpha^{n}$ Comparison Geometry on Manifolds with Density via Modified Hessians Non-property (t) for SO(n,1) Designs, linear codes, plateaued functions, and their interconnections Efficient DP-SGD for LLMs with Randomized Clipping Deep ZakaiJ: Structured Filtering for Jump-Diffusion Time Series Forecasting EMA-Nesterov: Stabilizing Nesterov's Lookahead for Accelerated Deep Learning Optimization Fuzzy PyTorch: Rapid Numerical Variability Evaluation for Deep Learning Models On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits Constraint-Anchored Attribution: Feasibility-Certified Counterfactuals and Bonferroni-PAC Sufficient Subsets for Neural CO Policies The Normalized Maximum Likelihood for Regular Non-Smooth Models: Measure-Theoretic Foundations and Geometric Sampling IV-Net: A neural network for elliptic PDEs with random and highly varying coefficients On the Sample Complexity of Robust Binary Hypothesis Testing Optimal Non-Asymptotic Edgeworth Expansions for Multivariate Neural Network Outputs Random Neural Network Expressivity for Non-Linear Partial Differential Equations Algorithms with Polynomially-Improved Approximation Factors for the $2 \rightarrow q$ Norm, and Applications Nyström Kernel Stein Discrepancy Tests Estimating Mixture Distributions via Stochastic Mirror Descent BoxLitE: A Faithful Knowledge Base Embedding Based on Convex Optimization Repeated Sequences Reveal Gaps between Large Language Models and Natural Language Accuracy Analysis of the Proxy Point Method with Applications to Some Toeplitz Matrices PAC Learning with Bandit Feedback: Sharp Sample Complexity in the Realizable Setting Irremediably singular quantum branched covers Equivariant homotopy dense subsets in the realm of uniform G-ANR spaces Enumerating Pattern Avoiding Parking Functions From dual connections to gravitational field equations -- the curvature and Einstein tensors of the $α$ - connection of a quasi-statistical manifold Learning manifold diffusion semigroups from graph transition matrices Threshold-Safe Shock Absorption in a Compartmental Voter-Flow Model:\ A Conservative Impulse-Control Benchmark Consistent Identification of Top-$K$ Nodes in Noisy Networks On the Harris-Viehmann conjecture for Hodge-Newton reducible local Shimura data of abelian type New inverse problems for a time-switched system of wave and diffusion equations Odd Koschorke classes Distributional Conformal Prediction for Markov Processes A magnetic eigenvalue bound in the disk On some arithmetic conditions of recurrent sequences modulo prime p Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers Kernel Embedding for Operator-Valued Measures and Its Application to Quantum Tomography The petrol station game: the regional average price. A mean field analysis Boundary stabilization of an Euler-Bernoulli beam with axial force and internal delay On Permutation Groups of Cyclic Codes over Finite Fields A Parameterized Algorithm for Testing whether the Limit of a Diagram is Empty
Computing k-means in mixed precision
Erin Carson, · 2026-05-26 · via math updates on arXiv.org

View PDF HTML (experimental)

Abstract:Motivated by the increasing availability of low- and mixed-precision arithmetic on modern hardware, we develop mixed-precision variants of Lloyd's algorithm for k-means clustering. The main ingredient is a family of mixed-precision kernels for Euclidean distance computation. These kernels are guided by rounding-error analysis and use a simple reliability test to decide whether the expanded distance formula can be evaluated safely with low precision or a higher-precision correction by the direct distance formula is required. Thus, most distance computations can be carried out with low precision, while high-precision arithmetic is used selectively when cancellation may lead to a loss of accuracy. We evaluate the proposed methods on large-scale distance-computation benchmarks, synthetic clustering problems, and image-segmentation tasks. The experiments verify that the mixed-precision kernels on GPUs can substantially improve performance while retaining the accuracy and convergence behavior of higher-precision baselines. In particular, our CUDA implementations achieve orders-of-magnitude speedups over the CPU implementation in \texttt{scikit-learn} and up to $4\times$ faster than the IEEE double-precision \texttt{cdist} routine of \texttt{PyTorch} on NVIDIA A100 GPU, while providing improved numerical robustness in cancellation-prone regimes. The resulting mixed-precision k-means methods are effective for clustering and image segmentation, although the observed gains depend on the dataset, feature dimension, and number of clusters. These results demonstrate that mixed-precision distance kernels can offer a useful trade-off between performance and accuracy for k-means clustering and suggest that similar ideas may be beneficial for other distance-based machine learning methods.
Subjects: Numerical Analysis (math.NA)
MSC classes: 65G50, 62H30, 68T05, 68W10
Cite as: arXiv:2407.12208 [math.NA]
  (or arXiv:2407.12208v3 [math.NA] for this version)
  https://doi.org/10.48550/arXiv.2407.12208

arXiv-issued DOI via DataCite

Submission history

From: Xiaobo Liu [view email]
[v1] Tue, 16 Jul 2024 22:48:35 UTC (5,736 KB)
[v2] Sat, 18 Apr 2026 15:51:30 UTC (5,267 KB)
[v3] Mon, 25 May 2026 12:19:39 UTC (59 KB)