惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

A
Arctic Wolf
V
V2EX
P
Proofpoint News Feed
The Hacker News
The Hacker News
GbyAI
GbyAI
G
Google Developers Blog
S
Schneier on Security
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
W
WeLiveSecurity
Security Archives - TechRepublic
Security Archives - TechRepublic
博客园 - Franky
Recent Announcements
Recent Announcements
腾讯CDC
Hacker News - Newest:
Hacker News - Newest: "LLM"
K
Kaspersky official blog
U
Unit 42
Engineering at Meta
Engineering at Meta
J
Java Code Geeks
Google Online Security Blog
Google Online Security Blog
Last Week in AI
Last Week in AI
V
Vulnerabilities – Threatpost
N
News and Events Feed by Topic
O
OpenAI News
量子位
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Y
Y Combinator Blog
博客园 - 【当耐特】
Vercel News
Vercel News
Hacker News: Ask HN
Hacker News: Ask HN
T
Tor Project blog
Apple Machine Learning Research
Apple Machine Learning Research
Microsoft Security Blog
Microsoft Security Blog
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
AWS News Blog
AWS News Blog
MongoDB | Blog
MongoDB | Blog
S
Security Affairs
A
About on SuperTechFans
Project Zero
Project Zero
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园 - 聂微东
Webroot Blog
Webroot Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Cloudbric
Cloudbric
T
Tenable Blog
月光博客
月光博客
C
Check Point Blog
宝玉的分享
宝玉的分享
V
Visual Studio Blog
T
The Blog of Author Tim Ferriss
NISL@THU
NISL@THU

cs.IT updates on arXiv.org

Construction of codes over a commutative non-unital ring from simplicial complexes and their applications A Hypothesis-Testing Analysis of Blind Recognition for Polar Codes The 2026 Algorithmic Information Theory Data Compression Challenge Information-Theoretic Meta Dynamic Programming for Signalling and Control of POMDPs High-throughput Low-latency Hardware Implementation of BCH Decoders SA-RA-JSCC: SNR-Adaptive and Semantic-Rate-Aware Joint Source-Channel Coding Auto-correlation Function Keying Sensing-Native Over-the-Air Federated Learning Breaking the bicycle frame: Coset-based quantum LDPC codes Local Fault Repair of Perfect Resource Placements in Eisenstein--Jacobi Networks Exploiting RIS Optimization Limits for Multi-User Beamforming and Signal Suppression Local Fault Repair of Perfect Resource Placements in Dense Gaussian Networks Multi-Orientation Edge-Minimum Repair for Non-Redundant Fault-Tolerant Broadcasting in Dense Gaussian Networks Deep Learning-Empowered Movable-Antenna Position Optimization with Partial CSI Distributed Experimental Design: Bayes-optimal Fusion of Local Designs Feedforward and Iterative Phase Noise Compensation for Channels with Chromatic Dispersion On Injectivity of Phase Retrieval Average entropy of Bogoliubov-Kubo-Mori random state ensemble A Generic Multi-dimensional Symbol Construction for Digital Over-the-Air Computation and Practical Aspects On the Reliability of Networks of AI Agents: Density Evolution, Stopping Sets, and Architecture Optimization Spatial and Temporal Generalization of CSI-based Neural Positioning Channel Charting for Position and Orientation Sign-Rank, Index, and List Replicability: Connections and Separations On the Construction of Recursively Differentiable Quasigroups and an Example of a Recursive $[4,2,3]_{26}$-Code Minimal Oversight: Uncertainty-Aware Governance for Delegated AI Systems Input-Dependent Fisher Information for Local Sensitivity Analysis of Medical Image Classifiers Explainable Task-Oriented Token Communication for AI-Native 6G Networks Joint 3D Trajectory Design and Resource Allocation for Secure Dual-UAV-aided Underlay Systems Two-Timescale Design for Downlink Multiuser Transmission with Dynamic Metasurface Antennas Testing for a Hidden Geometry in Random Graphs New bounds for covering codes under insertions or deletions Sparse Channel Estimation for SIM-based mmWave Near-Field Communications Twin-in-the-Loop Optimization and Fundamental Limits of Position--Velocity Estimation in Cell-Free ISAC Systems Best Arm Identification with Minimal Regret Sharp One-Dimensional Sub-Gaussian Comparison in Convex Order Information aging in massive MIMO systems affected by phase noise Rate-Distortion for Reversible Causal Nets under Closure-Preserving Fidelity Enhancing Secret Key Generation for UAV Communications via Codeword Reconstruction A geometric approach to generalized covering radii of linear codes A Lean-Certified Proof of $K_8(4, 2) = 23$ Set Shaping Theory Applied to Universal Coding Mixed Block Markov Superposition Transmission Codes Quantum uniformity norms are pullbacks of matrix-valued uniformity norms q-Exponential Random Graphs: higher-order networks from simple constraints Moving Target SAR Imaging Using Planar Arrays And Multidimensional Chinese Remainder Theorem (MD-CRT)--Part I: A General Framework Moving Target SAR Imaging Using Planar Arrays And Multidimensional Chinese Remainder Theorem (MD-CRT)--Part II: Two Subarray Designs Geometrically Constrained Decentralized Independent Vector Analysis for Distributed Microphone Arrays Wavelet Localisation and Local Modulation Freezing in MRW Unwrapping The Information-Theoretic Benefit of Shared Representations under Orthogonality Constraints Context-Aware Markov VAE for CSI Compression in Wireless Systems Cross-Silo De-Anonymization Under Local Differential Privacy: Threat Model, Phase Transition, and Coordination Necessity How Controlling the Variance can Improve Training Stability of Sparsely Activated DNNs and CNNs Reducing Turbulence-Induced Outages in a Deployed Terrestrial Free-Space Optical Communication Link via Interleaving Higher-order Common Information Secret-key-based physical layer security for feedback-aided unsourced random access A Narrowband Fully-Analog Multi-Antenna Transmitter Retrocausal capacity of a quantum channel: Communicating through noisy closed timelike curves An Information Theory of Finite Abstractions and their Fundamental Scalability Limits Compressed Qubit Noise Spectroscopy: Piecewise-Linear Modeling and Rademacher Measurements Re-Rooting-Based Fault-Tolerant Broadcasting in Dense Gaussian Networks A Unified Constant-Time Switch Rule for Constructing Edge-Disjoint Hamiltonian Cycles in Gaussian Networks Superdirectivity as a Spectral-Collision RKHS Limit Prime Event Languages: An Information-Theoretic Investigation of Twin-Prime Event Structure Belief-Space Control for Personalized Cancer Treatment via Active Inference STC: Reversible Digit-Context Decomposition for BWT-Family Text Compression Upper Bounds on Multiple $b$-Burst Deletion-Correcting Codes Estimating Mutual Information between Time Series and Temporal Event Sequences Across Diverse Analysis Tasks Finite-Blocklength Analysis for Noisy Permutation Channels Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains Beyond Rebalancing: Benchmarking Binary Classifiers Under Class Imbalance Without Rebalancing Techniques Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons
Price of metric universality in vector quantization is at most 0.11 bit
[Submitted on 5 Feb 2026 (v1), last revised 15 Jun 2026 (this ve · 2026-06-17 · via cs.IT updates on arXiv.org

View PDF HTML (experimental)

Abstract:Fast computation of a matrix product $W^\top X$ is a workhorse of modern LLMs. To make their deployment more efficient, a popular approach is that of using a low-precision approximation $\widehat W$ in place of true $W$ (``weight-only quantization''). Information theory demonstrates that an optimal algorithm for reducing precision of $W$ depends on the (second order) statistics of $X$ and requires a careful alignment of vector quantization codebook with PCA directions of $X$ (a process known as ``waterfilling allocation''). Dependence of the codebook on statistics of $X$, however, is highly impractical. This paper proves that there exist a universal codebook that is simultaneously near-optimal for all possible statistics of $X$, in the sense of being at least as good as an $X$-adapted waterfilling codebook with rate reduced by 0.11 bit per dimension in the case when $W$ is Gaussian. Such universal codebook would be an ideal candidate for the low-precision storage format, a topic of active modern research, but alas the existence proof is non-constructive.
Equivalently, our result shows existence of a net in $\mathbb{R}^n$ that is a nearly-optimal covering of a sphere simultaneously with respect to all Hilbert norms.

Submission history

From: Alina Harbuzova [view email]
[v1] Thu, 5 Feb 2026 15:46:53 UTC (307 KB)
[v2] Mon, 15 Jun 2026 20:50:08 UTC (315 KB)