惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Proofpoint News Feed
The Last Watchdog
The Last Watchdog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
P
Privacy & Cybersecurity Law Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
T
Threatpost
www.infosecurity-magazine.com
www.infosecurity-magazine.com
W
WeLiveSecurity
Scott Helme
Scott Helme
Google DeepMind News
Google DeepMind News
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
G
GRAHAM CLULEY
M
MIT News - Artificial intelligence
博客园 - 【当耐特】
V
Visual Studio Blog
Apple Machine Learning Research
Apple Machine Learning Research
Attack and Defense Labs
Attack and Defense Labs
Google Online Security Blog
Google Online Security Blog
S
Security @ Cisco Blogs
博客园_首页
J
Java Code Geeks
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
H
Hacker News: Front Page
雷峰网
雷峰网
K
Kaspersky official blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 司徒正美
T
Tor Project blog
阮一峰的网络日志
阮一峰的网络日志
L
LangChain Blog
I
Intezer
C
CXSECURITY Database RSS Feed - CXSecurity.com
G
Google Developers Blog
Help Net Security
Help Net Security
博客园 - Franky
U
Unit 42
P
Proofpoint News Feed
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
量子位
L
LINUX DO - 热门话题
N
News and Events Feed by Topic
MyScale Blog
MyScale Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
N
News and Events Feed by Topic
H
Help Net Security
Blog — PlanetScale
Blog — PlanetScale
T
Threat Research - Cisco Blogs
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
TaoSecurity Blog
TaoSecurity Blog

cs.LG updates on arXiv.org

Synthetic Tabular Generators Fail to Preserve Behavioral Fraud Patterns: A Benchmark on Temporal, Velocity, and Multi-Account Signals Generalization Guarantees on Data-Driven Tuning of Gradient Descent with Langevin Updates Automated co-design of high-performance thermodynamic cycles via graph-based hierarchical reinforcement learning Bias-Corrected Adaptive Conformal Inference for Multi-Horizon Time Series Forecasting Counterfactual Peptide Editing for Causal TCR--pMHC Binding Inference Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation Enhancing Confidence Estimation in Telco LLMs via Twin-Pass CoT-Ensembling MOONSHOT : A Framework for Multi-Objective Pruning of Vision and Large Language Models Physics-informed reservoir characterization from bulk and extreme pressure events with a differentiable simulator Some Theoretical Limitations of t-SNE Concrete Jungle: Towards Concreteness Paved Contrastive Negative Mining for Compositional Understanding Multi-Task LLM with LoRA Fine-Tuning for Automated Cancer Staging and Biomarker Extraction Text-Attributed Knowledge Graph Enrichment with Large Language Models for Medical Concept Representation Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models When Less Latent Leads to Better Relay: Information-Preserving Compression for Latent Multi-Agent LLM Collaboration BioTrain: Sub-MB, Sub-50mW On-Device Fine-Tuning for Edge-AI on Biosignals Linear Probe Accuracy Scales with Model Size and Benefits from Multi-Layer Ensembling Dataset-Level Metrics Attenuate Non-Determinism: A Fine-Grained Non-Determinism Evaluation in Diffusion Language Models WIN-U: Woodbury-Informed Newton-Unlearning as a retain-free Machine Unlearning Framework FAST: A Synergistic Framework of Attention and State-space Models for Spatiotemporal Traffic Prediction Adaptive Unknown Fault Detection and Few-Shot Continual Learning for Condition Monitoring in Ultrasonic Metal Welding Universality of Gaussian-Mixture Reverse Kernels in Conditional Diffusion Computational framework for multistep metabolic pathway design LEGO-MOF: Equivariant Latent Manipulation for Editable, Generative, and Optimizable MOF Design Learning Inference Concurrency in DynamicGate MLP Structural and Mathematical Justification Parameter-efficient Quantum Multi-task Learning Enhancing Reinforcement Learning for Radiology Report Generation with Evidence-aware Rewards and Self-correcting Preference Learning Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges Self-Organizing Maps with Optimized Latent Positions A Bayesian Framework for Uncertainty-Aware Explanations in Power Quality Disturbance Classification Optimization with SpotOptim Physics-Informed Neural Networks for Solving Derivative-Constrained PDEs Spectral Thompson sampling Online learning with noisy side observations Character Beyond Speech: Leveraging Role-Playing Evaluation in Audio Large Language Models via Reinforcement Learning Robust Ultra Low-Bit Post-Training Quantization via Stable Diagonal Curvature Estimate Composite Silhouette: A Subsampling-based Aggregation Strategy RPS: Information Elicitation with Reinforcement Prompt Selection UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization Beyond State Consistency: Behavior Consistency in Text-Based World Models Simulation-Based Optimisation of Batting Order and Bowling Plans in T20 Cricket Hardware-Efficient Neuro-Symbolic Networks with the Exp-Minus-Log Operator Drowsiness-Aware Adaptive Autonomous Braking System based on Deep Reinforcement Learning for Enhanced Road Safety MolCryst-MLIPs: A Machine-Learned Interatomic Potentials Database for Molecular Crystals DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off Unsupervised Anomaly Detection in Process-Complex Industrial Time Series: A Real-World Case Study Quantum Machine Learning for Colorectal Cancer Data: Anastomotic Leak Classification and Risk Factors Provably Efficient Offline-to-Online Value Adaptation with General Function Approximation BOAT: Navigating the Sea of In Silico Predictors for Antibody Design via Multi-Objective Bayesian Optimization PRiMeFlow: Capturing Complex Expression Heterogeneity in Perturbation Response Modelling Unsupervised domain transfer: Overcoming signal degradation in sleep monitoring by increasing scoring realism Physics-Informed Neural Networks for Methane Sorption: Cross-Gas Transfer Learning, Ensemble Collapse Under Physics Constraints, and Monte Carlo Dropout Uncertainty Quantification A Complete Symmetry Classification of Shallow ReLU Networks Momentum Further Constrains Sharpness at the Edge of Stochastic Stability Complex Interpolation of Matrices with an application to Multi-Manifold Learning Adaptive Learning via Off-Model Training and Importance Sampling for Fully Non-Markovian Optimal Stochastic Control. Complete version HUANet: Hard-Constrained Unrolled ADMM for Constrained Convex Optimization Fast Voxelization and Level of Detail for Microgeometry Rendering Rare Event Analysis via Stochastic Optimal Control From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning TIP: Token Importance in On-Policy Distillation Neural architectures for resolving references in program code $π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data First-See-Then-Design: A Multi-Stakeholder View for Optimal Performance-Fairness Trade-Offs MAny: Merge Anything for Multimodal Continual Instruction Tuning Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning HINTBench: Horizon-agent Intrinsic Non-attack Trajectory Benchmark ASTER: Latent Pseudo-Anomaly Generation for Unsupervised Time-Series Anomaly Detection Context Sensitivity Improves Human-Machine Visual Alignment Evaluating Supervised Machine Learning Models: Principles, Pitfalls, and Metric Selection SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention Artificial intelligence application in lymphoma diagnosis with Vision Transformer using weakly supervised training Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces Design and Behavior of Sparse Mixture-of-Experts Layers in CNN-based Semantic Segmentation Vision-Language-Action Jump-Starting for Reinforcement Learning Robotic Agents Automatic Charge State Tuning of 300 mm FDSOI Quantum Dots Using Neural Network Segmentation of Charge Stability Diagram Ordinary Least Squares is a Special Case of Transformer (How) Learning Rates Regulate Catastrophic Overtraining Golden Handcuffs make safer AI agents Design Space Exploration of Hybrid Quantum Neural Networks for Chronic Kidney Disease C-voting: Confidence-Based Test-Time Voting without Explicit Energy Functions From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning Representation over Routing: Diagnosing Temporal Routing Pathologies in Multi-Timescale PPO SFT-GRPO Data Overlap as a Post-Training Hyperparameter for Autoformalization Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning Monthly Diffusion v0.9: A Latent Diffusion Model for the First AI-MIP Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus From Order to Distribution: A Spectral Characterization of Forgetting in Continual Learning Asymmetric-Loss-Guided Hybrid CNN-BiLSTM-Attention Model for Industrial RUL Prediction with Interpretable Failure Heatmaps MyoVision: A Mobile Research Tool and NEATBoost-Attention Ensemble Framework for Real Time Chicken Breast Myopathy Detection Outperforming Self-Attention Mechanisms in Solar Irradiance Forecasting via Physics-Guided Neural Networks A KL Lens on Quantization: Fast, Forward-Only Sensitivity for Mixed-Precision SSM-Transformer Models Minimax Optimality and Spectral Routing for Majority-Vote Ensembles under Markov Dependence Diffusion Sequence Models for Generative In-Context Meta-Learning of Robot Dynamics Beyond Uniform Sampling: Synergistic Active Learning and Input Denoising for Robust Neural Operators The Spectrascapes Dataset: Street-view imagery beyond the visible captured using a mobile platform Deep Spatially-Regularized and Superpixel-Based Diffusion Learning for Unsupervised Hyperspectral Image Clustering DroneScan-YOLO: Redundancy-Aware Lightweight Detection for Tiny Objects in UAV Imagery Rethinking Uncertainty in Segmentation: From Estimation to Decision
Learning Credal Ensembles via Distributionally Robust Optimization
Kaizheng Wang, Ghifari Adam Faza, Fabio Cuzzolin, Siu Lun Chau, · 2026-02-09 · via cs.LG updates on arXiv.org

Credal predictors are models that are aware of epistemic uncertainty and produce a convex set of probabilistic predictions. They offer a principled way to quantify predictive epistemic uncertainty (EU) and have been shown to improve model robustness in various settings. However, most state-of-the-art methods mainly define EU as disagreement caused by random training initializations, which mostly reflects sensitivity to optimization randomness rather than uncertainty from deeper sources. To address this, we define EU as disagreement among models trained with varying relaxations of the i.i.d. assumption between training and test data. Based on this idea, we propose CreDRO, which learns an ensemble of plausible models through distributionally robust optimization. As a result, CreDRO captures EU not only from training randomness but also from meaningful disagreement due to potential distribution shifts between training and test data. Empirical results show that CreDRO consistently outperforms existing credal methods on tasks such as out-of-distribution detection across multiple benchmarks and selective classification in medical applications.