惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Vercel News
Vercel News
SecWiki News
SecWiki News
WordPress大学
WordPress大学
小众软件
小众软件
博客园 - 司徒正美
酷 壳 – CoolShell
酷 壳 – CoolShell
V
Visual Studio Blog
Y
Y Combinator Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
云风的 BLOG
云风的 BLOG
MyScale Blog
MyScale Blog
K
Kaspersky official blog
T
The Exploit Database - CXSecurity.com
腾讯CDC
Scott Helme
Scott Helme
I
InfoQ
Cyberwarzone
Cyberwarzone
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Security Latest
Security Latest
The Register - Security
The Register - Security
Project Zero
Project Zero
F
Fortinet All Blogs
C
CERT Recently Published Vulnerability Notes
A
Arctic Wolf
C
Cisco Blogs
L
LINUX DO - 热门话题
P
Privacy International News Feed
IT之家
IT之家
U
Unit 42
P
Privacy & Cybersecurity Law Blog
H
Help Net Security
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
C
Cyber Attacks, Cyber Crime and Cyber Security
P
Palo Alto Networks Blog
F
Full Disclosure
宝玉的分享
宝玉的分享
Simon Willison's Weblog
Simon Willison's Weblog
L
Lohrmann on Cybersecurity
Google DeepMind News
Google DeepMind News
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
H
Hacker News: Front Page
Know Your Adversary
Know Your Adversary
PCI Perspectives
PCI Perspectives
Hugging Face - Blog
Hugging Face - Blog
AWS News Blog
AWS News Blog
MongoDB | Blog
MongoDB | Blog
S
Schneier on Security
Recent Announcements
Recent Announcements
Forbes - Security
Forbes - Security
Cisco Talos Blog
Cisco Talos Blog

math.PR updates on arXiv.org

Visibility in the Boolean Model on Harmonic Manifolds Global estimates on the Brenier map Geodesics and Wandering Exponents in Brochette First-Passage Percolation State-dependent inverse-subordinator time changes of regenerative processes: Excursion structure and multiscale occupation-time limits Randomly twisted transfer operators and singular values statistics Generalized Bessel-Dunkl diffusions An almost sure invariance principle for the Takagi-van der Waerden class functions Central limit theorems for high dimensional lattice polytopes: cosmological polytopes Convergence rate estimates for semigroups and heat kernels associated with resistance forms Second-order Poincaré inequalities and localization on the Poisson space Maximum Probability of Independence in Transitive Matroids On global solutions to the semidiscrete stochastic heat equation The Poisson Tail Conjecture for Primes in Short Intervals A Complete Spectral Analysis of the CEV Operator with Applications to Arbitrage Holographic functions and neural networks From Betting to Empirical Bernstein LIL Concentration of General Stochastic Approximation Under Heavy-Tailed Markovian Noise Pointwise Generalization in Deep Neural Networks Bayesian Latent Space Models for Graphs Are Misspecified: Toward Robust Inference via Generalized Posteriors Wasserstein bounds for denoising diffusion probabilistic models via the Föllmer process A note on connections between the Föllmer process and the denoising diffusion probabilistic model Simple Approximation and Derivative Free Inference-Time Scaling for Diffusion Models via Sequential Monte Carlo on Path Measures Diffusion-Based Stochastic Operator Networks for Uncertainty Quantification in Stochastic Partial Differential Equations A Fourier perspective on the learning dynamics of neural networks: from sample complexities to mechanistic insights Propagation of Chaos in Contextual Flow Maps Dimension-Uniform Discretization Analysis of Preconditioned Annealed Langevin Dynamics for Multimodal Gaussian Mixtures $α$-TCAV: A Unified Framework for Testing with Concept Activation Vectors Scaling Laws from Sequential Feature Recovery: A Solvable Hierarchical Model On the Limits of Latent Reuse in Diffusion Models State-of-art minibatches via novel DPP kernels: discretization, wavelets, and rough objectives A Unified Framework for Critical Scaling of Inverse Temperature in Self-Attention Expected Batch Optimal Transport Plans and Consequences for Flow Matching Partial Model Sharing Improves Byzantine Resilience in Federated Conformal Prediction GRAFT-ATHENA: Self-Improving Agentic Teams for Autonomous Discovery and Evolutionary Numerical Algorithms Uniform Scaling Limits in AdamW-Trained Transformers Constant-Target Energy Matching: A Unified Framework for Continuous and Discrete Density Estimation Scaling Limits of Long-Context Transformers Generalized Wasserstein Flow Matching: Transport Plans, Everywhere, All at Once Convergence Analysis of Newton's Method for Neural Networks in the Overparameterized Limit Convergent Stochastic Training of Attention and Understanding LoRA Universality of the fluctuations of the free energy in generalized Sherrington-Kirkpatrick models and the log likelihood ratio in spiked Wigner models Expressivity of Bi-Lipschitz Normalizing Flows: A Score-Based Diffusion Perspective Time-Inhomogeneous Preconditioned Langevin Dynamics Matrix-Decoupled Concentration for Autoregressive Sequences: Dimension-Free Guarantees for Sparse Long-Context Rewards Convex-Geometric Error Bounds for Positive-Weight Kernel Quadrature Variational Smoothing and Inference for SDEs from Sparse Data with Dynamic Neural Flows Grokability in five inequalities Almost-Orthogonality in Lp Spaces: A Case Study with Grok On Computing Total Variation Distance Between Mixtures of Product Distributions Universality in Deep Neural Networks: An approach via the Lindeberg exchange principle Soft-to-Hard Routing in Sparse Mixture-of-Experts Models Learning Discriminators for Resampling in the Ensemble Gaussian Mixture Filter through a Normalizing Flow Approach Decentralized Proximal Stochastic Gradient Langevin Dynamics A Review of the Receiver Operating Characteristic Curve and a Proof About the Area Beneath It Stochastic Scaling Limits and Synchronization by Noise in Deep Transformer Models Well-Conditioned Oblivious Perturbations in Linear Space Mathematical Foundations for Peer-to-Peer Lattice Computation Achieving the Kesten-Stigum bound in the non-uniform hypergraph stochastic block model Phase Transitions in the Fluctuations of Functionals of Random Neural Networks Ultrametric OGP - parametric RDT \emph{symmetric} binary perceptron connection Geometric regularization of autoencoders via observed stochastic dynamics A Wasserstein Geometric Framework for Hebbian Plasticity Neural Continuous-Time Markov Chain: Discrete Diffusion via Decoupled Jump Timing and Direction One-Shot Generative Flows: Existence and Obstructions Wasserstein Formulation of Reinforcement Learning. An Optimal Transport Perspective on Policy Optimization node2vec or triangle-biased random walks: stationarity, regularity & recurrence Some Theoretical Limitations of t-SNE Adaptive Learning via Off-Model Training and Importance Sampling for Fully Non-Markovian Optimal Stochastic Control. Complete version Tail-Aware Information-Theoretic Generalization for RLHF and SGLD Diffusion Processes on Implicit Manifolds Degrees, Levels, and Profiles of Contextuality High-accuracy log-concave sampling with stochastic queries Variational Optimality of Föllmer Processes in Generative Diffusions Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold Dimension-Free Multimodal Sampling via Preconditioned Annealed Langevin Dynamics A Review of Diffusion-based Simulation-Based Inference: Foundations and Applications in Non-Ideal Data Scenarios Feature Learning Dynamics in Infinite-Depth Neural Networks On The Hidden Biases of Flow Matching Samplers Fast and Robust Diffusion Posterior Sampling for MR Image Reconstruction Using the Preconditioned Unadjusted Langevin Algorithm Normalizing Flows on Quotient Manifolds via Boundary Quotients Differentiable Filtering for Learning Hidden Markov Models Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks Posterior Bayesian Neural Networks with Dependent Weights Exponentially Fading Memory Signature A decision-theoretic approach to dealing with uncertainty in quantum mechanics On Statistical Estimation of Edge-Reinforced Random Walks Efficiency of Parallel and Restart Exploration Strategies in Model Free Stochastic Simulations The feasibility of multi-graph alignment: a Bayesian approach Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent Mean-field limit from general mixtures of experts to quantum neural networks On an $L^2$ norm for stationary ARMA processes Mirror Descent-Ascent for mean-field min-max problems Universal approximation property of Banach space-valued random feature models including random neural networks Deep neural networks with ReLU, leaky ReLU, and softplus activation provably overcome the curse of dimensionality for Kolmogorov partial differential equations with Lipschitz nonlinearities in the $L^p$-sense Conditional stochastic differential equations driven by fractional Brownian motion Large deviations for the mean-field limit of Hawkes processes Distribution-Free Stochastic Analysis and Robust Multilevel Vector Field Anomaly Detection Change of measure through the Legendre transform On quantitative Laplace-type convergence results for some exponential probability measures, with two applications Convergence rates for gradient descent in the training of overparameterized artificial neural networks with piecewise affine activation
The Stein-log-Sobolev inequality and the exponential rate of convergence for the continuous Stein variational gradient descent method
José A. Carrillo, Jakub Skrzeczkowski, Jethro Warnett · 2024-12-14 · via math.PR updates on arXiv.org

The Stein Variational Gradient Descent method is a variational inference method in statistics that has recently received a lot of attention. The method provides a deterministic approximation of the target distribution, by introducing a nonlocal interaction with a kernel. Despite the significant interest, the exponential rate of convergence for the continuous method has remained an open problem, due to the difficulty of establishing the related so-called Stein-log-Sobolev inequality. Here, we prove that the inequality is satisfied for each space dimension and every kernel whose Fourier transform has a quadratic decay at infinity and is locally bounded away from zero and infinity. Moreover, we construct weak solutions to the related PDE satisfying exponential rate of decay towards the equilibrium. The main novelty in our approach is to interpret the Stein-Fisher information, also called the squared Stein discrepancy, as a duality pairing between $H^{-1}(\mathbb{R}^d)$ and $H^{1}(\mathbb{R}^d)$, which allows us to employ the Fourier transform. We also provide several examples of kernels for which the Stein-log-Sobolev inequality fails, partially showing the necessity of our assumptions.