惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Proofpoint News Feed
The Last Watchdog
The Last Watchdog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
P
Privacy & Cybersecurity Law Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
T
Threatpost
www.infosecurity-magazine.com
www.infosecurity-magazine.com
W
WeLiveSecurity
Scott Helme
Scott Helme
Google DeepMind News
Google DeepMind News
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
G
GRAHAM CLULEY
M
MIT News - Artificial intelligence
博客园 - 【当耐特】
V
Visual Studio Blog
Apple Machine Learning Research
Apple Machine Learning Research
Attack and Defense Labs
Attack and Defense Labs
Google Online Security Blog
Google Online Security Blog
S
Security @ Cisco Blogs
博客园_首页
J
Java Code Geeks
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
H
Hacker News: Front Page
雷峰网
雷峰网
K
Kaspersky official blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 司徒正美
T
Tor Project blog
阮一峰的网络日志
阮一峰的网络日志
L
LangChain Blog
I
Intezer
C
CXSECURITY Database RSS Feed - CXSecurity.com
G
Google Developers Blog
Help Net Security
Help Net Security
博客园 - Franky
U
Unit 42
P
Proofpoint News Feed
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
量子位
L
LINUX DO - 热门话题
N
News and Events Feed by Topic
MyScale Blog
MyScale Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
N
News and Events Feed by Topic
H
Help Net Security
Blog — PlanetScale
Blog — PlanetScale
T
Threat Research - Cisco Blogs
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
TaoSecurity Blog
TaoSecurity Blog

cs.LG updates on arXiv.org

Portfolio Optimization Proxies under Label Scarcity and Regime Shifts via Bayesian and Deterministic Students under Semi-Supervised Sandwich Training TOPCELL: Topology Optimization of Standard Cell via LLMs Metric-Aware Principal Component Analysis (MAPCA):A Unified Framework for Scale-Invariant Representation Learning Calibrate-Then-Delegate: Safety Monitoring with Risk and Budget Guarantees via Model Cascades Heat and Matérn Kernels on Matchings When Missing Becomes Structure: Intent-Preserving Policy Completion from Financial KOL Discourse Path-Sampled Integrated Gradients Non-intrusive Learning of Physics-Informed Spatio-temporal Surrogate for Accelerating Design Asynchronous Probability Ensembling for Federated Disaster Detection Scouting By Reward: VLM-TO-IRL-Driven Player Selection For Esports Quantization of Spiking Neural Networks Beyond Accuracy An unsupervised decision-support framework for multivariate biomarker analysis in athlete monitoring Predicting Post-Traumatic Epilepsy from Clinical Records using Large Language Model Embeddings Material-Agnostic Zero-Shot Thermal Inference for Metal Additive Manufacturing via a Parametric PINN Framework Physics-Informed Machine Learning for Pouch Cell Temperature Estimation From Risk to Rescue: An Agentic Survival Analysis Framework for Liquidation Prevention CLion: Efficient Cautious Lion Optimizer with Enhanced Generalization Zeroth-Order Optimization at the Edge of Stability Mean Flow Policy Optimization Gating Enables Curvature: A Geometric Expressivity Gap in Attention A Mechanistic Account of Attention Sinks in GPT-2: One Circuit, Broader Implications for Mitigation Expressivity of Transformers: A Tropical Geometry Perspective Assessing the Performance-Efficiency Trade-off of Foundation Models in Probabilistic Electricity Price Forecasting Wasserstein Formulation of Reinforcement Learning. An Optimal Transport Perspective on Policy Optimization Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization Learning Ad Hoc Network Dynamics via Graph-Structured World Models Adaptive Test-Time Compute Allocation for Reasoning LLMs via Constrained Policy Optimization Curvature-Aligned Probing for Local Loss-Landscape Stabilization Does RL Expand the Capability Boundary of LLM Agents? A PASS@(k,T) Analysis xFODE+: Explainable Type-2 Fuzzy Additive ODEs for Uncertainty Quantification xFODE: An Explainable Fuzzy Additive ODE Framework for System Identification Multi-User mmWave Beam and Rate Adaptation via Combinatorial Satisficing Bandits SegWithU: Uncertainty as Perturbation Energy for Single-Forward-Pass Risk-Aware Medical Image Segmentation An Analysis of Regularization and Fokker-Planck Residuals in Diffusion Models for Image Generation Class Unlearning via Depth-Aware Removal of Forget-Specific Directions Beyond Independent Frames: Latent Attention Masked Autoencoders for Multi-View Echocardiography When Fairness Metrics Disagree: Evaluating the Reliability of Demographic Fairness Assessment in Machine Learning STEP-Parts: Geometric Partitioning of Boundary Representations for Large-Scale CAD Processing Beyond Importance Sampling: Rejection-Gated Policy Optimization Can LLMs Score Medical Diagnoses and Clinical Reasoning as well as Expert Panels? Reasoning Dynamics and the Limits of Monitoring Modality Reliance in Vision-Language Models SOLIS: Physics-Informed Learning of Interpretable Neural Surrogates for Nonlinear Systems Catching Every Ripple: Enhanced Anomaly Awareness via Dynamic Concept Adaptation HAMSA: Scanning-Free Vision State Space Models via SpectralPulseNet Seen-to-Scene: Keep the Seen, Generate the Unseen for Video Outpainting Physically-Induced Atmospheric Adversarial Perturbations: Enhancing Transferability and Robustness in Remote Sensing Image Classification CMTM: Cross-Modal Token Modulation for Unsupervised Video Object Segmentation ELMoE-3D: Leveraging Intrinsic Elasticity of MoE for Hybrid-Bonding-Enabled Self-Speculative Decoding in On-Premises Serving ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding Synthetic Tabular Generators Fail to Preserve Behavioral Fraud Patterns: A Benchmark on Temporal, Velocity, and Multi-Account Signals Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning Bias-Corrected Adaptive Conformal Inference for Multi-Horizon Time Series Forecasting Counterfactual Peptide Editing for Causal TCR--pMHC Binding Inference Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation Enhancing Confidence Estimation in Telco LLMs via Twin-Pass CoT-Ensembling MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models Physics-informed reservoir characterization from bulk and extreme pressure events with a differentiable simulator Some Theoretical Limitations of t-SNE Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Multi-Task LLM with LoRA Fine-Tuning for Automated Cancer Staging and Biomarker Extraction Text-Attributed Knowledge Graph Enrichment with Large Language Models for Medical Concept Representation Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models When Less Latent Leads to Better Relay: Information-Preserving Compression for Latent Multi-Agent LLM Collaboration BioTrain: Sub-MB, Sub-50mW On-Device Fine-Tuning for Edge-AI on Biosignals Linear Probe Accuracy Scales with Model Size and Benefits from Multi-Layer Ensembling Dataset-Level Metrics Attenuate Non-Determinism: A Fine-Grained Non-Determinism Evaluation in Diffusion Language Models WIN-U: Woodbury-Informed Newton-Unlearning as a retain-free Machine Unlearning Framework FAST: A Synergistic Framework of Attention and State-space Models for Spatiotemporal Traffic Prediction Adaptive Unknown Fault Detection and Few-Shot Continual Learning for Condition Monitoring in Ultrasonic Metal Welding Universality of Gaussian-Mixture Reverse Kernels in Conditional Diffusion Computational framework for multistep metabolic pathway design LEGO-MOF: Equivariant Latent Manipulation for Editable, Generative, and Optimizable MOF Design Learning Inference Concurrency in DynamicGate MLP Structural and Mathematical Justification Parameter-efficient Quantum Multi-task Learning Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Self-Organizing Maps with Optimized Latent Positions A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification Optimization with SpotOptim Physics-Informed Neural Networks for Solving Derivative-Constrained PDEs Spectral Thompson sampling Online learning with noisy side observations Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate Composite Silhouette: A Subsampling-based Aggregation Strategy RPS: Information Elicitation with Reinforcement Prompt Selection UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization Beyond State Consistency: Behavior Consistency in Text-Based World Models Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket Hardware-Efficient Neuro-Symbolic Networks with the Exp-Minus-Log Operator Drowsiness-Aware Adaptive Autonomous Braking System based on Deep Reinforcement Learning for Enhanced Road Safety MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study Quantum Machine Learning for Colorectal Cancer Data: Anastomotic Leak Classification and Risk Factors Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation BOAT: Navigating the Sea of In Silico Predictors for Antibody Design via Multi-Objective Bayesian Optimization PRiMeFlow: Capturing Complex Expression Heterogeneity in Perturbation Response Modelling Unsupervised domain transfer: Overcoming signal degradation in sleep monitoring by increasing scoring realism
Price of metric universality in vector quantization is at most 0.11 bit
Alina Harbuzova, Or Ordentlich, Yury Polyanskiy · 2026-02-05 · via cs.LG updates on arXiv.org

Fast computation of a matrix product $W^\top X$ is a workhorse of modern LLMs. To make their deployment more efficient, a popular approach is that of using a low-precision approximation $\widehat W$ in place of true $W$ (``weight-only quantization''). Information theory demonstrates that an optimal algorithm for reducing precision of $W$ depends on the (second order) statistics of $X$ and requires a careful alignment of vector quantization codebook with PCA directions of $X$ (a process known as ``waterfilling allocation''). Dependence of the codebook on statistics of $X$, however, is highly impractical. This paper proves that there exist a universal codebook that is simultaneously near-optimal for all possible statistics of $X$, in the sense of being at least as good as an $X$-adapted waterfilling codebook with rate reduced by 0.11 bit per dimension in the case when $W$ is Gaussian. Such universal codebook would be an ideal candidate for the low-precision storage format, a topic of active modern research, but alas the existence proof is non-constructive. Equivalently, our result shows existence of a net in $\mathbb{R}^n$ that is a nearly-optimal covering of a sphere simultaneously with respect to all Hilbert norms.