惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

The Hacker News
The Hacker News
月光博客
月光博客
Last Week in AI
Last Week in AI
D
DataBreaches.Net
MyScale Blog
MyScale Blog
The Register - Security
The Register - Security
D
Docker
酷 壳 – CoolShell
酷 壳 – CoolShell
Y
Y Combinator Blog
WordPress大学
WordPress大学
Microsoft Security Blog
Microsoft Security Blog
I
InfoQ
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
P
Privacy International News Feed
云风的 BLOG
云风的 BLOG
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
L
LangChain Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
C
Check Point Blog
V
V2EX
P
Palo Alto Networks Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
小众软件
小众软件
博客园 - 叶小钗
A
Arctic Wolf
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
Martin Fowler
Martin Fowler
Simon Willison's Weblog
Simon Willison's Weblog
Security Latest
Security Latest
阮一峰的网络日志
阮一峰的网络日志
博客园 - 【当耐特】
Know Your Adversary
Know Your Adversary
N
Netflix TechBlog - Medium
Recorded Future
Recorded Future
B
Blog RSS Feed
T
Tenable Blog
S
Secure Thoughts
Vercel News
Vercel News
Hugging Face - Blog
Hugging Face - Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
PCI Perspectives
PCI Perspectives
T
Tor Project blog
MongoDB | Blog
MongoDB | Blog
A
About on SuperTechFans
罗磊的独立博客
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
F
Fortinet All Blogs
Webroot Blog
Webroot Blog
T
Threat Research - Cisco Blogs

math.ST updates on arXiv.org

What is Learnable in Valiant's Theory of the Learnable? Learning Perturbations to Extrapolate Your LLM Byzantine-Robust Distributed Sparse Learning Revisited The Sample Complexity of Multiple Change Point Identification under Bandit Feedback A proximal gradient algorithm for composite log-concave sampling Model-based Bootstrap of Controlled Markov Chains Approximation of Maximally Monotone Operators : A Graph Convergence Perspective Posterior Contraction Rates for Sparse Kolmogorov-Arnold Networks in Anisotropic Besov Spaces MIST: Reliable Streaming Decision Trees for Online Class-Incremental Learning via McDiarmid Bound A Spectral Framework for Closed-Form Relative Density Estimation Fast Rates for Offline Contextual Bandits with Forward-KL Regularization under Single-Policy Concentrability Higher-Order Equilibrium Tracking for EM-Compressible Online Estimation Scaling Limits of Long-Context Transformers A Note on Non-Negative $L_1$-Approximating Polynomials Susceptibilities and Patterning: A Primer on Linear Response in Bayesian Learning Linear Response Estimators for Singular Statistical Models Statistical inference with belief functions: A survey Robust stochastic first order methods in heavy-tailed noise via medoid mini-batch gradient sampling Every Feedforward Neural Network Definable in an o-Minimal Structure Has Finite Sample Complexity Adaptive auditing of AI systems with anytime-valid guarantees Locally Near Optimal Piecewise Linear Regression in High Dimensions via Difference of Max-Affine Functions Risk-Controlled Post-Processing of Decision Policies Covariate Balancing and Riesz Regression Should Be Guided by the Neyman Orthogonal Score in Debiased Machine Learning A Unified Pair-GRPO Family: From Implicit to Explicit Preference Constraints for Stable and General RL Alignment Time-Inhomogeneous Preconditioned Langevin Dynamics A Fine-Grained Understanding of Uniform Convergence for Halfspaces CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency Ratio-based Loss Functions Optimal Confidence Band for Kernel Gradient Flow Estimator A renormalization-group inspired lattice-based framework for piecewise generalized linear models Direct Estimation of Schrödinger Bridge Time-Series Drifts: Finite-Sample, Asymptotic, and Adaptive Guarantees Information-theoretic Limits of Learning and Estimation Adaptivity Under Realizability Constraints: Comparing In-Context and Agentic Learning Multiscale Euclidean Network Trajectories: Second-Moment Geometry, Attribution, and Change Points Causal discovery under mean independence and linearity Perturbation is All You Need for Extrapolating Language Models Realizable Bayes-Consistency for General Metric Losses Vanishing L2 regularization for the softmax Multi Armed Bandit Imbalanced Classification under Capacity Constraints Intrinsic effective sample size for manifold-valued Markov chain Monte Carlo via kernel discrepancy On the Optimal Sample Complexity of Offline Multi-Armed Bandits with KL Regularization Extrapolation in Statistical Learning with Extreme Value Theory Adaptive Estimation and Inference in Semi-parametric Heterogeneous Clustered Multitask Learning via Neyman Orthogonality Beyond ECE: Calibrated Size Ratio, Risk Assessment, and Confidence-Weighted Metrics Self-Normalized Martingales and Uniform Regret Bounds for Linear Regression Mean Testing under Truncation beyond Gaussian Decoupled Descent: Exact Test Error Tracking Via Approximate Message Passing Hyper Input Convex Neural Networks for Shape Constrained Learning and Optimal Transport Observable Neural ODEs for Identifiable Causal Forecasting in Continuous Time Elite-Driven Support Vector Machines for Classification A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws Learning Curves and Benign Overfitting of Spectral Algorithms in Large Dimensions Concave Statistical Utility Maximization Bandits via Influence-Function Gradients The Sample Complexity of Multicalibration Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret Achieving the Kesten-Stigum bound in the non-uniform hypergraph stochastic block model On two ways to use determinantal point processes for Monte Carlo integration Recovery Guarantees for Continual Learning of Dependent Tasks: Memory, Data-Dependent Regularization, and Data-Dependent Weights Structural interpretability in SVMs with truncated orthogonal polynomial kernels Cloning is as Hard as Learning for Stabilizer States Ordinary Least Squares is a Special Case of Transformer Identifiability of Potentially Degenerate Gaussian Mixture Models With Piecewise Affine Mixing NetworkNet: A Deep Neural Network Approach for Random Networks with Sparse Nodal Attributes and Complex Nodal Heterogeneity ADD for Multi-Bit Image Watermarking Cost-optimal Sequential Testing via Doubly Robust Q-learning Query Lower Bounds for Diffusion Sampling Tail-Aware Information-Theoretic Generalization for RLHF and SGLD Spatio-temporal probabilistic forecast using MMAF-guided learning The Geometry of Knowing: From Possibilistic Ignorance to Probabilistic Certainty -- A Measure-Theoretic Framework for Epistemic Convergence Generalization Properties of Score-matching Diffusion Models for Intrinsically Low-dimensional Data Conformal Policy Control Continuous-time reinforcement learning: ellipticity enables model-free value function approximation High-accuracy sampling for diffusion models and log-concave distributions Analyzing Shapley Additive Explanations to Understand Anomaly Detection Algorithm Behaviors and Their Complementarity Optimal Lower Bounds for Online Multicalibration Understanding Overparametrization in Survival Models through Interpolation Eventually LIL Regret: Almost Sure $\ln\ln T$ Regret for a sub-Gaussian Mixture on Unbounded Data Limit Theorems for Stochastic Gradient Descent in High-Dimensional Single-Layer Networks Optimal In-context Adaptivity and Distributional Robustness of Transformers Don't Pass@k: A Bayesian Framework for Large Language Model Evaluation The Good, the Bad, and the Sampled: a No-Regret Approach to Safe Online Classification GOSPA and T-GOSPA quasi-metrics for evaluation of multi-object tracking algorithms A note on the unique properties of the Kullback--Leibler divergence for sampling via gradient flows Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards Efficient compression of neural networks and datasets Out-of-Distribution Generalization of In-Context Learning: A Low-Dimensional Subspace Perspective Super-fast Rates of Convergence for Neural Network Classifiers under the Hard Margin Condition Sharp Gaussian approximations for Decentralized Federated Learning Learning Operators by Regularized Stochastic Gradient Descent with Operator-valued Kernels Smoothed Analysis of Learning from Positive Samples Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium Sharp Risk Bounds for Early-Stopping in Gaussian Linear Regression Gaussian Approximation and Multiplier Bootstrap for Stochastic Gradient Descent Copula-enhanced Vision Transformer for high myopia diagnosis through OU UWF fundus images General Frameworks for Conditional Two-Sample Testing Improved Hardness Results for Learning Intersections of Halfspaces Consistency of Lloyd's Algorithm Under Perturbations Convergence Rates for Non-Log-Concave Sampling and Log-Partition Estimation Distribution-Free Stochastic Analysis and Robust Multilevel Vector Field Anomaly Detection Efficient Parameter Estimation of Truncated Boolean Product Distributions
Spiked separable covariance matrices and principal components
Xiucai Ding, Fan Yang · 2019-05-30 · via math.ST updates on arXiv.org

We introduce a class of separable sample covariance matrices of the form $\widetilde{\mathcal{Q}}_1:=\widetilde A^{1/2} X \widetilde B X^* \widetilde A^{1/2}.$ Here $\widetilde{A}$ and $\widetilde{B}$ are positive definite matrices whose spectrums consist of bulk spectrums plus several spikes, i.e. larger eigenvalues that are separated from the bulks. Conceptually, we call $\widetilde{\mathcal{Q}}_1$ a \emph{spiked separable covariance matrix model}. On the one hand, this model includes the spiked covariance matrix as a special case with $\widetilde{B}=I$. On the other hand, it allows for more general correlations of datasets. In particular, for spatio-temporal dataset, $\widetilde{A}$ and $\widetilde{B}$ represent the spatial and temporal correlations, respectively. In this paper, we study the outlier eigenvalues and eigenvectors, i.e. the principal components, of the spiked separable covariance model $\widetilde{\mathcal{Q}}_1$. We prove the convergence of the outlier eigenvalues $\widetilde λ_i$ and the generalized components (i.e. $\langle \mathbf v, \widetilde{\mathbfξ}_i \rangle$ for any deterministic vector $\mathbf v$) of the outlier eigenvectors $\widetilde{\mathbfξ}_i$ with optimal convergence rates. Moreover, we also prove the delocalization of the non-outlier eigenvectors. We state our results in full generality, in the sense that they also hold near the so-called BBP transition and for degenerate outliers. Our results highlight both the similarity and difference between the spiked separable covariance matrix model and the spiked covariance model. In particular, we show that the spikes of both $\widetilde{A}$ and $\widetilde{B}$ will cause outliers of the eigenvalue spectrum, and the eigenvectors can help us to select the outliers that correspond to the spikes of $\widetilde{A}$ (or $\widetilde{B}$).