惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Apple Machine Learning Research
Apple Machine Learning Research
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Jina AI
Jina AI
F
Fortinet All Blogs
有赞技术团队
有赞技术团队
月光博客
月光博客
爱范儿
爱范儿
U
Unit 42
B
Blog RSS Feed
aimingoo的专栏
aimingoo的专栏
P
Palo Alto Networks Blog
WordPress大学
WordPress大学
D
DataBreaches.Net
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
大猫的无限游戏
大猫的无限游戏
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园 - Franky
T
Threatpost
W
WeLiveSecurity
S
SegmentFault 最新的问题
Scott Helme
Scott Helme
C
Cisco Blogs
Application and Cybersecurity Blog
Application and Cybersecurity Blog
Security Archives - TechRepublic
Security Archives - TechRepublic
G
Google Developers Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
博客园 - 聂微东
Forbes - Security
Forbes - Security
L
LINUX DO - 最新话题
Simon Willison's Weblog
Simon Willison's Weblog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Hacker News - Newest:
Hacker News - Newest: "LLM"
I
InfoQ
T
Tor Project blog
S
Security @ Cisco Blogs
Know Your Adversary
Know Your Adversary
MongoDB | Blog
MongoDB | Blog
Google Online Security Blog
Google Online Security Blog
P
Privacy & Cybersecurity Law Blog
Hugging Face - Blog
Hugging Face - Blog
C
CERT Recently Published Vulnerability Notes
N
News and Events Feed by Topic
博客园 - 叶小钗
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
博客园 - 司徒正美
V2EX - 技术
V2EX - 技术
Cisco Talos Blog
Cisco Talos Blog
Cloudbric
Cloudbric
Google DeepMind News
Google DeepMind News

cs.LG updates on arXiv.org

Synthetic Tabular Generators Fail to Preserve Behavioral Fraud Patterns: A Benchmark on Temporal, Velocity, and Multi-Account Signals Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning Counterfactual Peptide Editing for Causal TCR--pMHC Binding Inference Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation Enhancing Confidence Estimation in Telco LLMs via Twin-Pass CoT-Ensembling MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models Physics-informed reservoir characterization from bulk and extreme pressure events with a differentiable simulator Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Multi-Task LLM with LoRA Fine-Tuning for Automated Cancer Staging and Biomarker Extraction Text-Attributed Knowledge Graph Enrichment with Large Language Models for Medical Concept Representation Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models When Less Latent Leads to Better Relay: Information-Preserving Compression for Latent Multi-Agent LLM Collaboration BioTrain: Sub-MB, Sub-50mW On-Device Fine-Tuning for Edge-AI on Biosignals Linear Probe Accuracy Scales with Model Size and Benefits from Multi-Layer Ensembling Dataset-Level Metrics Attenuate Non-Determinism: A Fine-Grained Non-Determinism Evaluation in Diffusion Language Models WIN-U: Woodbury-Informed Newton-Unlearning as a retain-free Machine Unlearning Framework FAST: A Synergistic Framework of Attention and State-space Models for Spatiotemporal Traffic Prediction Adaptive Unknown Fault Detection and Few-Shot Continual Learning for Condition Monitoring in Ultrasonic Metal Welding Computational framework for multistep metabolic pathway design LEGO-MOF: Equivariant Latent Manipulation for Editable, Generative, and Optimizable MOF Design Learning Inference Concurrency in DynamicGate MLP Structural and Mathematical Justification Parameter-efficient Quantum Multi-task Learning Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Self-Organizing Maps with Optimized Latent Positions A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification Optimization with SpotOptim Physics-Informed Neural Networks for Solving Derivative-Constrained PDEs Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate Composite Silhouette: A Subsampling-based Aggregation Strategy RPS: Information Elicitation with Reinforcement Prompt Selection UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization Beyond State Consistency: Behavior Consistency in Text-Based World Models Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket Hardware-Efficient Neuro-Symbolic Networks with the Exp-Minus-Log Operator Drowsiness-Aware Adaptive Autonomous Braking System based on Deep Reinforcement Learning for Enhanced Road Safety MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study Quantum Machine Learning for Colorectal Cancer Data: Anastomotic Leak Classification and Risk Factors Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation PRiMeFlow: Capturing Complex Expression Heterogeneity in Perturbation Response Modelling Unsupervised domain transfer: Overcoming signal degradation in sleep monitoring by increasing scoring realism Physics-Informed Neural Networks for Methane Sorption: Cross-Gas Transfer Learning, Ensemble Collapse Under Physics Constraints, and Monte Carlo Dropout Uncertainty Quantification Complex Interpolation of Matrices with an application to Multi-Manifold Learning HUANet: Hard-Constrained Unrolled ADMM for Constrained Convex Optimization Fast Voxelization and Level of Detail for Microgeometry Rendering From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning Momentum Further Constrains Sharpness at the Edge of Stochastic Stability TIP: Token Importance in On-Policy Distillation Neural architectures for resolving references in program code $π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data A Complete Symmetry Classification of Shallow ReLU Networks First-See-Then-Design: A Multi-Stakeholder View for Optimal Performance-Fairness Trade-Offs MAny: Merge Anything for Multimodal Continual Instruction Tuning Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning BOAT: Navigating the Sea of In Silico Predictors for Antibody Design via Multi-Objective Bayesian Optimization HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark ASTER: Latent Pseudo-Anomaly Generation for Unsupervised Time-Series Anomaly Detection Context Sensitivity Improves Human-Machine Visual Alignment Evaluating Supervised Machine Learning Models: Principles, Pitfalls, and Metric Selection SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention Artificial intelligence application in lymphoma diagnosis with Vision Transformer using weakly supervised training Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation Online learning with noisy side observations Spectral Thompson sampling Vision-Language-Action Jump-Starting for Reinforcement Learning Robotic Agents Automatic Charge State Tuning of 300 mm FDSOI Quantum Dots Using Neural Network Segmentation of Charge Stability Diagram Ordinary Least Squares is a Special Case of Transformer (How) Learning Rates Regulate Catastrophic Overtraining Golden Handcuffs make safer AI agents Design Space Exploration of Hybrid Quantum Neural Networks for Chronic Kidney Disease C-voting: Confidence-Based Test-Time Voting without Explicit Energy Functions From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning Representation over Routing: Diagnosing Temporal Routing Pathologies in Multi-Timescale PPO SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning Monthly Diffusion v0.9: A Latent Diffusion Model for the First AI-MIP Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus Universality of Gaussian-Mixture Reverse Kernels in Conditional Diffusion From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning Asymmetric-Loss-Guided Hybrid CNN-BiLSTM-Attention Model for Industrial RUL Prediction with Interpretable Failure Heatmaps MyoVision: A Mobile Research Tool and NEATBoost-Attention Ensemble Framework for Real Time Chicken Breast Myopathy Detection Outperforming Self-Attention Mechanisms in Solar Irradiance Forecasting via Physics-Guided Neural Networks A KL Lens on Quantization: Fast, Forward-Only Sensitivity for Mixed-Precision SSM-Transformer Models Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence Diffusion Sequence Models for Generative In-Context Meta-Learning of Robot Dynamics Beyond Uniform Sampling: Synergistic Active Learning and Input Denoising for Robust Neural Operators The Spectrascapes Dataset: Street-view imagery beyond the visible captured using a mobile platform Deep Spatially-Regularized and Superpixel-Based Diffusion Learning for Unsupervised Hyperspectral Image Clustering Some Theoretical Limitations of t-SNE DroneScan-YOLO: Redundancy-Aware Lightweight Detection for Tiny Objects in UAV Imagery Rethinking Uncertainty in Segmentation: From Estimation to Decision Bias-Corrected Adaptive Conformal Inference for Multi-Horizon Time Series Forecasting Out of Context: Reliability in Multimodal Anomaly Detection Requires Contextual Inference Analog Optical Inference on Million-Record Mortgage Data A High-Resolution Landscape Dataset for Concept-Based XAI With Application to Species Distribution Models
Dimension-Free Convergence of Discrete Diffusion Models: Adjoint Equations Induce the Right Space
Kelvin Kan, Xingjian Li, Benjamin J. Zhang, Tuhin Sahai, Stanley · 2026-05-17 · via cs.LG updates on arXiv.org

Discrete diffusion has become a leading framework for generative modeling in various applications including language, vision, and biology. Existing convergence theory, however, exhibits fundamental limitations. KL-based analyses diverge under singular priors such as the masked distribution, while bounds in total variation (TV) depend on the state space size $S$ and become vacuous for modern language tasks, where vocabularies contain hundreds of thousands of tokens. We develop a unified adjoint-equation-based framework that establishes dimension-free convergence guarantees in any integral probability metric (IPM). To the best of our knowledge, our bounds are the first to be entirely free of $S$ and applicable to both masked and uniform priors. Importantly, our theory relies only on a single standard rate-matrix regularity assumption and is compatible with time-inhomogeneous schedules. Four novel techniques drive our improvements: working in the space of observables via adjoint equations rather than directly with probability measures, a regularity analysis that yields bounds on any IPM, a coupling argument that removes $S$-dependence under uniform transitions, and a score-marginal cancellation technique that removes $S$-dependence under masked transitions. Our framework thus sharply departs from prior analyses and avoids the shortcomings of pathspace-KL and existing TV-based approaches. Beyond convergence bounds, our framework provides a versatile toolkit for further theoretical study of discrete diffusion models.