惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
阮一峰的网络日志
阮一峰的网络日志
Apple Machine Learning Research
Apple Machine Learning Research
爱范儿
爱范儿
WordPress大学
WordPress大学
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
罗磊的独立博客
S
SegmentFault 最新的问题
V
V2EX
V
Visual Studio Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
美团技术团队
博客园 - 三生石上(FineUI控件)
Stack Overflow Blog
Stack Overflow Blog
Y
Y Combinator Blog
MyScale Blog
MyScale Blog
D
Docker
Google DeepMind News
Google DeepMind News
Blog — PlanetScale
Blog — PlanetScale
M
Microsoft Research Blog - Microsoft Research
Martin Fowler
Martin Fowler
S
Secure Thoughts
B
Blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
www.infosecurity-magazine.com
www.infosecurity-magazine.com
Recent Announcements
Recent Announcements
MongoDB | Blog
MongoDB | Blog
C
Cisco Blogs
C
CERT Recently Published Vulnerability Notes
T
True Tiger Recordings
GbyAI
GbyAI
P
Proofpoint News Feed
P
Privacy International News Feed
Jina AI
Jina AI
The Cloudflare Blog
I
Intezer
AWS News Blog
AWS News Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
S
Security Archives - TechRepublic
NISL@THU
NISL@THU
The Register - Security
The Register - Security
Recent Commits to openclaw:main
Recent Commits to openclaw:main
P
Palo Alto Networks Blog
S
Schneier on Security
L
LINUX DO - 热门话题
C
CXSECURITY Database RSS Feed - CXSecurity.com
Security Latest
Security Latest
C
Cybersecurity and Infrastructure Security Agency CISA

cs.AI updates on arXiv.org

PACD-Net: Pseudo-Augmented Contrastive Distillation for Glycemic Control Estimation from SMBG Quantifying Hyperparameter Transfer and the Importance of Embedding Layer Learning Rate Neural Estimation of Pairwise Mutual Information in Masked Discrete Sequence Models Correcting Stochastic Update Bias in Preconditioned Language Model Optimizers Decomposing MXFP4 quantization error for LLM reinforcement learning: reducible bias, recoverable deadzone, and an irreducible floor Less Data, Faster Training: repeating smaller datasets speeds up learning via sampling biases Conformal Selective Acting: Anytime-Valid Risk Control for RLVR-Trained LLMs Efficient Table QA via TableGrid Navigation and Progressive Inference Prompting How Much Online RL is Enough? Informative Rollouts for Offline Preference Optimization in RLVR Chronicle: A Multimodal Foundation Model for Joint Language and Time Series Understanding Pseudo-Siamese Network for Planning in Target-Oriented Proactive Dialogues Nonlocal operator learning for fMRI encoding and decoding tasks Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression Latent Process Generator Matching The Devil is in the Condition Numbers: Why is GLU Better than non-GLU Structure? Catching a Moving Subspace: Low-Rank Bandits Beyond Stationarity Accelerating Video Inverse Problem Solvers with Autoregressive Diffusion Models Winfree Oscillatory Neural Network CP-MoE: Consistency-Preserving Mixture-of-Experts for Continual Learning Data-Efficient Neural Operator Training via Physics-Based Active Learning Introspective X Training: Feedback Conditioning Improves Scaling Across all LLM Training Stages Hack-Verifiable Environments: Towards Evaluating Reward Hacking at Scale SDM: A Powerful Tool for Evaluating Model Robustness Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies Group-Algebraic Tensors: Provably-optimal Equivariant Learning and Physical Symmetry Discovery Axiomatizing Neural Networks via Pursuit of Subspaces CAdam: Context-Adaptive Moment Estimation for 3D Gaussian Densification in Generative Distillation DASH: Fast Differentiable Architecture Search for Hybrid Attention in Minutes on a Single GPU LEAP: A closed-loop framework for perovskite precursor additive discovery Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics Plug-and-Play Spiking Operators: Breaking the Nonlinearity Bottleneck in Spiking Transformers Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning OCTOPUS: Optimized KV Cache for Transformers via Octahedral Parametrization Under optimal Squared error quantization A Sharper Picture of Generalization in Transformers Behavior-Consistent Deep Reinforcement Learning Improving Quantized Model Performance in Qualitative Analysis with Multi-Pass Prompt Verification SOLAR: A Self-Optimizing Open-Ended Autonomous Agent for Lifelong Learning and Continual Adaptation TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health FedCritic: Serverless Federated Critic Learning-based Resource Allocation for Multi-Cell OFDMA in 6G FusionCell: Cross-Attentive Fusion of Layout Geometry and Netlist Topology for Standard-Cell Performance Prediction Smaller Abstract State Spaces Enable Cross-Scale Generalization in Reinforcement Learning Variance Reduction for Expectations with Diffusion Teachers It Takes Two: Complementary Self-Distillation for Contextual Integrity in LLMs Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine Data Scaling as Progressive Coverage of a Predictive Contribution Spectrum AVSD: Adaptive-View Self-Distillation by Balancing Consensus and Teacher-Specific Privileged Signals Robust Subspace-Constrained Quadratic Models for Low-Dimensional Structure Learning On the Regularity and Generalization of One-Step Wasserstein-guided Generative Models for PDE-Induced Measures Mechanisms of Misgeneralization in Physical Sequence Modeling \ECUAS{n}: A family of metrics for principled evaluation of uncertainty-augmented systems Runtime-Certified Bounded-Error Quantized Attention Divide et Calibra: Multiclass Local Calibration via Vector Quantization AgentAtlas: Beyond Outcome Leaderboards for LLM Agents Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models Dynamic TMoE: A Drift-Aware Dynamic Mixture of Experts Framework for Non-Stationary Time Series Forecasting GraphDiffMed: Knowledge-Constrained Differential Attention with Pharmacological Graph Priors for Medication Recommendation AGPO: Adaptive Group Policy Optimization with Dual Statistical Feedback LLM Pretraining Shapes a Generalizable Manifold: Insights into Cross-Modal Transfer to Time Series Efficient Learning of Deep State Space Models via Importance Smoothing FBOS-RL: Feedback-Driven Bi-Objective Synergistic Reinforcement Learning Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs REFLECTOR: Internalizing Step-wise Reflection against Indirect Jailbreak Tippett-minimum Fusion of Representation-space Diffusion Models for Multi-Encoder Out-of-Distribution Detection Multi-Step Likelihood-Ratio Correction for Reinforcement Learning with Verifiable Rewards Code Generation by Differential Test Time Scaling Sutra: Tensor-Op RNNs as a Compilation Target for Vector Symbolic Architectures Mahjax: A GPU-Accelerated Mahjong Simulator for Reinforcement Learning in JAX NeuroQA: A Large-Scale Image-Grounded Benchmark for 3D Brain MRI Understanding DeCoR: Design and Control Co-Optimization for Urban Streets Using Reinforcement Learning PREFINE: Preference-Based Implicit Reward and Cost Fine-Tuning for Safety Alignment Approximation Theory for Neural Networks: Old and New TabPFN-MT: A Natively Multitask In-Context Learner for Tabular Data From Circuit Evidence to Mechanistic Theory: An Inductive Logic Approach APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents \textit{Stochastic} MeanFlow Policies: One-Step Generative Control with Entropic Mirror Descent Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning Instance Discrimination for Link Prediction DEL: Digit Entropy Loss for Numerical Learning of Large Language Models Residual Paving: Diagnosing the Routing Bottleneck in Selective Refusal Editing Geometry-Lite: Interpretable Safety Probing via Layer-Wise Margin Geometry Parallel LLM Reasoning for Bias-Resilient, Robust Conceptual Abstraction Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty Agent JIT Compilation for Latency-Optimizing Web Agent Planning and Scheduling ClaimDiff-RL: Fine-Grained Caption Reinforcement Learning through Visual Claim Comparison Consistently Informative Soft-Label Temperature for Knowledge Distillation Spectral Unforgetting: Post-Hoc Recovery of Damaged Capabilities Without Retraining Closed-form predictive coding via hierarchical Gaussian filters Modality-Decoupled Online Recursive Editing Training Language Agents to Learn from Experience Quant.npu: Enabling Efficient Mobile NPU Inference for on-device LLMs via Fully Static Quantization The Hidden Signal of Verifier Strictness: Controlling and Improving Step-Wise Verification via Selective Latent Steering GROW: Aligning GRPO with State-Action Modeling for Open-World VLM Agents Multi-agent Collaboration with State Management Automated Kernel Discovery Towards Understanding High-dimensional Bayesian Optimization STELLAR: Scaling 3D Perception Large Models for Autonomous Driving Design for Manufacturing: A Manufacturability Knowledge-Integrated Reinforcement Learning Framework for Free-Form Pipe Routing in Aeroengines JUDO: A Juxtaposed Domain-Oriented Multimodal Reasoner for Industrial Anomaly QA Machine-Learning-Enhanced Non-Invasive Testing for MASLD Fibrosis: Shallow-Deep Neural Networks Versus FIB-4, Tabular Foundation Models, and Large Language Models Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search torchtune: PyTorch native post-training library
Causal Unlearning in Collaborative Optimization: Exact and Approximate Influence Reversal under Adversarial Contributions
Ali Mahdavi, · 2026-05-21 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Federated learning systems must support data deletion requests to comply with privacy regulations, yet retraining from scratch after each deletion is computationally prohibitive. We present HF-KCU, a method that removes a client's contribution by approximating the influence function through conjugate gradient iterations in Krylov subspaces, reducing complexity from O(d^3) to O(kd) where k<<d.A causal weighting mechanism ensures that only clients holding the deleted data receive parameter updates, preventing spurious changes to unaffected clients. Our method is designed to handle bounded adversarial perturbations to the Hessian and gradient, providing graceful degradation under realistic threat models. We validate HF-KCU across convolutional (ResNet-18, SimpleCNN) and transformer (ViT-Lite) architectures on CIFAR-10, MNIST, and Fashion-MNIST. On CIFAR-10 under Dirichlet (alpha=0.5) partitioning, HF-KCU achieves 47.75 times speedup over retraining while maintaining test accuracy within 0.60% of the rational baseline(71.16 vs 71.76 %). Membership inference attacks on the forget set yield success rates of 0.499 matching the retrained model and confirming effective privacy restoration. We provide convergence guarantees showing that the Krylov approximation error decreases as O((k ^1/2-1)/(k^1/2+1)) where k is the Hessian condition number. The causal weighting mechanism ensures surgical updates, where only clients holding deleted data are modified, preserving model quality for unaffected participants and avoiding the instability of gradient-based approaches in asynchronous federated settings. This design provides interpretability as each update is directly traceable to the influence of the deleted data. The method's efficiency and precision make it suitable for production federated systems where deletion requests arrive asynchronously and computational budgets are constrained.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Performance (cs.PF)
Cite as: arXiv:2605.20341 [cs.LG]
  (or arXiv:2605.20341v1 [cs.LG] for this version)
  https://doi.org/10.48550/arXiv.2605.20341

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Azadeh Zamanifar [view email]
[v1] Tue, 19 May 2026 18:00:39 UTC (32 KB)