惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Proofpoint News Feed
The Last Watchdog
The Last Watchdog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
P
Privacy & Cybersecurity Law Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
T
Threatpost
www.infosecurity-magazine.com
www.infosecurity-magazine.com
W
WeLiveSecurity
Scott Helme
Scott Helme
Google DeepMind News
Google DeepMind News
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
G
GRAHAM CLULEY
M
MIT News - Artificial intelligence
博客园 - 【当耐特】
V
Visual Studio Blog
Apple Machine Learning Research
Apple Machine Learning Research
Attack and Defense Labs
Attack and Defense Labs
Google Online Security Blog
Google Online Security Blog
S
Security @ Cisco Blogs
博客园_首页
J
Java Code Geeks
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
H
Hacker News: Front Page
雷峰网
雷峰网
K
Kaspersky official blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 司徒正美
T
Tor Project blog
阮一峰的网络日志
阮一峰的网络日志
L
LangChain Blog
I
Intezer
C
CXSECURITY Database RSS Feed - CXSecurity.com
G
Google Developers Blog
Help Net Security
Help Net Security
博客园 - Franky
U
Unit 42
P
Proofpoint News Feed
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
量子位
L
LINUX DO - 热门话题
N
News and Events Feed by Topic
MyScale Blog
MyScale Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
N
News and Events Feed by Topic
H
Help Net Security
Blog — PlanetScale
Blog — PlanetScale
T
Threat Research - Cisco Blogs
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
TaoSecurity Blog
TaoSecurity Blog

cs.LG updates on arXiv.org

Synthetic Tabular Generators Fail to Preserve Behavioral Fraud Patterns: A Benchmark on Temporal, Velocity, and Multi-Account Signals Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning Bias-Corrected Adaptive Conformal Inference for Multi-Horizon Time Series Forecasting Counterfactual Peptide Editing for Causal TCR--pMHC Binding Inference Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation Enhancing Confidence Estimation in Telco LLMs via Twin-Pass CoT-Ensembling MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models Physics-informed reservoir characterization from bulk and extreme pressure events with a differentiable simulator Some Theoretical Limitations of t-SNE Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Multi-Task LLM with LoRA Fine-Tuning for Automated Cancer Staging and Biomarker Extraction Text-Attributed Knowledge Graph Enrichment with Large Language Models for Medical Concept Representation Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models When Less Latent Leads to Better Relay: Information-Preserving Compression for Latent Multi-Agent LLM Collaboration BioTrain: Sub-MB, Sub-50mW On-Device Fine-Tuning for Edge-AI on Biosignals Linear Probe Accuracy Scales with Model Size and Benefits from Multi-Layer Ensembling Dataset-Level Metrics Attenuate Non-Determinism: A Fine-Grained Non-Determinism Evaluation in Diffusion Language Models WIN-U: Woodbury-Informed Newton-Unlearning as a retain-free Machine Unlearning Framework FAST: A Synergistic Framework of Attention and State-space Models for Spatiotemporal Traffic Prediction Adaptive Unknown Fault Detection and Few-Shot Continual Learning for Condition Monitoring in Ultrasonic Metal Welding Universality of Gaussian-Mixture Reverse Kernels in Conditional Diffusion Computational framework for multistep metabolic pathway design LEGO-MOF: Equivariant Latent Manipulation for Editable, Generative, and Optimizable MOF Design Learning Inference Concurrency in DynamicGate MLP Structural and Mathematical Justification Parameter-efficient Quantum Multi-task Learning Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Self-Organizing Maps with Optimized Latent Positions A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification Optimization with SpotOptim Physics-Informed Neural Networks for Solving Derivative-Constrained PDEs Spectral Thompson sampling Online learning with noisy side observations Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate Composite Silhouette: A Subsampling-based Aggregation Strategy RPS: Information Elicitation with Reinforcement Prompt Selection UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization Beyond State Consistency: Behavior Consistency in Text-Based World Models Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket Hardware-Efficient Neuro-Symbolic Networks with the Exp-Minus-Log Operator Drowsiness-Aware Adaptive Autonomous Braking System based on Deep Reinforcement Learning for Enhanced Road Safety MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study Quantum Machine Learning for Colorectal Cancer Data: Anastomotic Leak Classification and Risk Factors Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation BOAT: Navigating the Sea of In Silico Predictors for Antibody Design via Multi-Objective Bayesian Optimization PRiMeFlow: Capturing Complex Expression Heterogeneity in Perturbation Response Modelling Unsupervised domain transfer: Overcoming signal degradation in sleep monitoring by increasing scoring realism Physics-Informed Neural Networks for Methane Sorption: Cross-Gas Transfer Learning, Ensemble Collapse Under Physics Constraints, and Monte Carlo Dropout Uncertainty Quantification A Complete Symmetry Classification of Shallow ReLU Networks Momentum Further Constrains Sharpness at the Edge of Stochastic Stability Complex Interpolation of Matrices with an application to Multi-Manifold Learning Adaptive Learning via Off-Model Training and Importance Sampling for Fully Non-Markovian Optimal Stochastic Control. Complete version HUANet: Hard-Constrained Unrolled ADMM for Constrained Convex Optimization Fast Voxelization and Level of Detail for Microgeometry Rendering Rare Event Analysis via Stochastic Optimal Control From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning TIP: Token Importance in On-Policy Distillation Neural architectures for resolving references in program code $π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data First-See-Then-Design: A Multi-Stakeholder View for Optimal Performance-Fairness Trade-Offs MAny: Merge Anything for Multimodal Continual Instruction Tuning Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark ASTER: Latent Pseudo-Anomaly Generation for Unsupervised Time-Series Anomaly Detection Context Sensitivity Improves Human-Machine Visual Alignment Evaluating Supervised Machine Learning Models: Principles, Pitfalls, and Metric Selection SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention Artificial intelligence application in lymphoma diagnosis with Vision Transformer using weakly supervised training Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation Vision-Language-Action Jump-Starting for Reinforcement Learning Robotic Agents Automatic Charge State Tuning of 300 mm FDSOI Quantum Dots Using Neural Network Segmentation of Charge Stability Diagram Ordinary Least Squares is a Special Case of Transformer (How) Learning Rates Regulate Catastrophic Overtraining Golden Handcuffs make safer AI agents Design Space Exploration of Hybrid Quantum Neural Networks for Chronic Kidney Disease C-voting: Confidence-Based Test-Time Voting without Explicit Energy Functions From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning Representation over Routing: Diagnosing Temporal Routing Pathologies in Multi-Timescale PPO SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning Monthly Diffusion v0.9: A Latent Diffusion Model for the First AI-MIP Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning Asymmetric-Loss-Guided Hybrid CNN-BiLSTM-Attention Model for Industrial RUL Prediction with Interpretable Failure Heatmaps MyoVision: A Mobile Research Tool and NEATBoost-Attention Ensemble Framework for Real Time Chicken Breast Myopathy Detection Outperforming Self-Attention Mechanisms in Solar Irradiance Forecasting via Physics-Guided Neural Networks A KL Lens on Quantization: Fast, Forward-Only Sensitivity for Mixed-Precision SSM-Transformer Models Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence Diffusion Sequence Models for Generative In-Context Meta-Learning of Robot Dynamics Beyond Uniform Sampling: Synergistic Active Learning and Input Denoising for Robust Neural Operators The Spectrascapes Dataset: Street-view imagery beyond the visible captured using a mobile platform Deep Spatially-Regularized and Superpixel-Based Diffusion Learning for Unsupervised Hyperspectral Image Clustering DroneScan-YOLO: Redundancy-Aware Lightweight Detection for Tiny Objects in UAV Imagery Rethinking Uncertainty in Segmentation: From Estimation to Decision
Maximin Relative Improvement: Fair Learning as a Bargaining Problem
Jiwoo Han, Moulinath Banerjee, Yuekai Sun · 2026-02-04 · via cs.LG updates on arXiv.org

When deploying a single predictor across multiple subpopulations, we propose a fundamentally different approach: interpreting group fairness as a bargaining problem among subpopulations. This game-theoretic perspective reveals that existing robust optimization methods such as minimizing worst-group loss or regret correspond to classical bargaining solutions and embody different fairness principles. We propose relative improvement, the ratio of actual risk reduction to potential reduction from a baseline predictor, which recovers the Kalai-Smorodinsky solution. Unlike absolute-scale methods that may not be comparable when groups have different potential predictability, relative improvement provides axiomatic justification including scale invariance and individual monotonicity. We establish finite-sample convergence guarantees under mild conditions.