惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Proofpoint News Feed
The Last Watchdog
The Last Watchdog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Know Your Adversary
Know Your Adversary
P
Privacy & Cybersecurity Law Blog
D
Darknet – Hacking Tools, Hacker News & Cyber Security
T
Threatpost
www.infosecurity-magazine.com
www.infosecurity-magazine.com
W
WeLiveSecurity
Scott Helme
Scott Helme
Google DeepMind News
Google DeepMind News
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
G
GRAHAM CLULEY
M
MIT News - Artificial intelligence
博客园 - 【当耐特】
V
Visual Studio Blog
Apple Machine Learning Research
Apple Machine Learning Research
Attack and Defense Labs
Attack and Defense Labs
Google Online Security Blog
Google Online Security Blog
S
Security @ Cisco Blogs
博客园_首页
J
Java Code Geeks
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
H
Hacker News: Front Page
雷峰网
雷峰网
K
Kaspersky official blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 司徒正美
T
Tor Project blog
阮一峰的网络日志
阮一峰的网络日志
L
LangChain Blog
I
Intezer
C
CXSECURITY Database RSS Feed - CXSecurity.com
G
Google Developers Blog
Help Net Security
Help Net Security
博客园 - Franky
U
Unit 42
P
Proofpoint News Feed
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
量子位
L
LINUX DO - 热门话题
N
News and Events Feed by Topic
MyScale Blog
MyScale Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
N
News and Events Feed by Topic
H
Help Net Security
Blog — PlanetScale
Blog — PlanetScale
T
Threat Research - Cisco Blogs
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
TaoSecurity Blog
TaoSecurity Blog

cs.AI updates on arXiv.org

Detecting Safety Violations Across Many Agent Traces C-ReD: A Comprehensive Chinese Benchmark for AI-Generated Text Detection Derived from Real-World Prompts GenTac: Generative Modeling and Forecasting of Soccer Tactics ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks Discourse Diversity in Multi-Turn Empathic Dialogue Evaluating Cooperation in LLM Social Groups through Elected Leadership SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context A Mamba-Based Multimodal Network for Multiscale Blast-Induced Rapid Structural Damage Assessment Agentic Driving Coach: Robustness and Determinism of Agentic AI-Powered Human-in-the-Loop Cyber-Physical Systems Legal2LogicICL: Improving Generalization in Transforming Legal Cases to Logical Formulas via Diverse Few-Shot Learning Playing Along: Learning a Double-Agent Defender for Belief Steering via Theory of Mind RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents A Triadic Suffix Tokenization Scheme for Numerical Reasoning Synthius-Mem: Brain-Inspired Hallucination-Resistant Persona Memory Achieving 94.4% Memory Accuracy and 99.6% Adversarial Robustness on LoCoMo Time is Not a Label: Continuous Phase Rotation for Temporal Knowledge Graphs and Agentic Memory NovBench: Evaluating Large Language Models on Academic Paper Novelty Assessment A collaborative agent with two lightweight synergistic models for autonomous crystal materials research Problem Reductions at Scale: Agentic Integration of Computationally Hard Problems Limited Perfect Monotonical Surrogates constructed using low-cost recursive linkage discovery with guaranteed output Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization Lectures on AI for Mathematics METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models Quantization Dominates Rank Reduction for KV-Cache Compression Anthropogenic Regional Adaptation in Multimodal Vision-Language Model Low-rank Optimization Trajectories Modeling for LLM RLVR Acceleration Think Before you Write: QA-Guided Reasoning for Character Descriptions in Books METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning From Agent Loops to Structured Graphs:A Scheduler-Theoretic Framework for LLM Agent Execution Learning from Contrasts: Synthesizing Reasoning Paths from Diverse Search Trajectories The Missing Knowledge Layer in Cognitive Architectures for AI Agents CoRe-ECG: Advancing Self-Supervised Representation Learning for 12-Lead ECG via Contrastive and Reconstructive Synergy Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems BankerToolBench: Evaluating AI Agents in End-to-End Investment Banking Workflows Enhancing Multimodal Large Language Models for Ancient Chinese Character Evolution Analysis via Glyph-Driven Fine-Tuning The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping Consistency of AI-Generated Exercise Prescriptions: A Repeated Generation Study Using a Large Language Model RECIPER: A Dual-View Retrieval Pipeline for Procedure-Oriented Materials Question Answering Exploring Knowledge Conflicts for Faithful LLM Reasoning: Benchmark and Method CocoaBench: Evaluating Unified Digital Agents in the Wild MathAgent: Adversarial Evolution of Constraint Graphs for Mathematical Reasoning Data Synthesis Environmental Footprint of GenAI Research: Insights from the Moshi Foundation Model From Answers to Arguments: Toward Trustworthy Clinical Diagnostic Reasoning with Toulmin-Guided Curriculum Goal-Conditioned Learning Use of AI Tools: Guidelines to Maintain Academic Integrity in Computing Colleges Frugal Knowledge Graph Construction with Local LLMs: A Zero-Shot Pipeline, Self-Consistency and Wisdom of Artificial Crowds Efficient Training for Cross-lingual Speech Language Models Guardrails Beat Guidance: A Large-Scale Study of Rules, Skills, and Persistent Configuration for Coding Agents Towards Proactive Information Probing: Customer Service Chatbots Harvesting Value from Conversation Hodoscope: Unsupervised Monitoring for AI Misbehaviors PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk AI Integrity: A New Paradigm for Verifiable AI Governance Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds A Systematic Analysis of the Impact of Persona Steering on LLM Capabilities Intelligent Approval of Access Control Flow in Office Automation Systems via Relational Modeling Uncertainty-Aware Web-Conditioned Scientific Fact-Checking Introspective Diffusion Language Models Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics When Valid Signals Fail: Regime Boundaries Between LLM Features and RL Trading Policies When Verification Fails: How Compositionally Infeasible Claims Escape Rejection Back to the Barn with LLAMAs: Evolving Pretrained LLM Backbones in Finetuning Vision Language Models CFMS: A Coarse-to-Fine Multimodal Synthesis Framework for Enhanced Tabular Reasoning Delving Aleatoric Uncertainty in Medical Image Segmentation via Vision Foundation Models RAG-KT: Cross-platform Explainable Knowledge Tracing with Multi-view Fusion Retrieval Generation A molecular clock for writing systems reveals the quantitative impact of imperial power on cultural evolution Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models QShield: Securing Neural Networks Against Adversarial Attacks using Quantum Circuits Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation EvoNash-MARL: A Closed-Loop Multi-Agent Reinforcement Learning Framework for Medium-Horizon Equity Allocation Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music ZoomR: Memory Efficient Reasoning through Multi-Granularity Key Value Retrieval DIB-OD: Preserving the Invariant Core for Robust Heterogeneous Graph Adaptation via Decoupled Information Bottleneck and Online Distillation AOP-Smart: A RAG-Enhanced Large Language Model Framework for Adverse Outcome Pathway Analysis A Benchmark for Gap and Overlap Analysis as a Test of KG Task Readiness Task2vec Readiness: Diagnostics for Federated Learning from Pre-Training Embeddings Retinal Cyst Detection from Optical Coherence Tomography Images Resilient Write: A Six-Layer Durable Write Surface for LLM Coding Agents Speaking to No One: Ontological Dissonance and the Double Bind of Conversational AI MeloTune: On-Device Arousal Learning and Peer-to-Peer Mood Coupling for Proactive Music Curation Advancing Polish Language Modeling through Tokenizer Optimization in the Bielik v3 7B and 11B Series TInR: Exploring Tool-Internalized Reasoning in Large Language Models Do BERT Embeddings Encode Narrative Dimensions? A Token-Level Probing Analysis of Time, Space, Causality, and Character in Fiction Prosociality by Coupling, Not Mere Observation: Homeostatic Sharing in an Inspectable Recurrent Artificial Life Agent Generating Multiple-Choice Knowledge Questions with Interpretable Difficulty Estimation using Knowledge Graphs and Large Language Models Deep-Reporter: Deep Research for Grounded Multimodal Long-Form Generation When More Thinking Hurts: Overthinking in LLM Test-Time Compute Scaling Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models Teaching Language Models How to Code Like Learners: Conversational Serialization for Student Simulation Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game Bringing Value Models Back: Generative Critics for Value Modeling in LLM Reinforcement Learning FACT-E: Causality-Inspired Evaluation for Trustworthy Chain-of-Thought Reasoning Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting FedRio: Personalized Federated Social Bot Detection via Cooperative Reinforced Contrastive Adversarial Distillation Skill-SD: Skill-Conditioned Self-Distillation for Multi-turn LLM Agents Principles Do Not Apply Themselves: A Hermeneutic Perspective on AI Alignment Learning and Enforcing Context-Sensitive Control for LLMs Preference-Agile Multi-Objective Optimization for Real-time Vehicle Dispatching Efficient Process Reward Modeling via Contrastive Mutual Information
Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training
Vin Bhaskara, Haicheng Wang · 2026-04-21 · via cs.AI updates on arXiv.org

Local prediction-error-based curiosity rewards focus on the current transition without considering the world model's cumulative prediction error across all visited transitions. We introduce Curiosity-Critic, which grounds its intrinsic reward in the improvement of this cumulative objective, and show that it admits a tractable per-step surrogate: the difference between the current prediction error and the asymptotic error baseline of the current state transition. We estimate this error baseline online with a learned critic co-trained alongside the world model; since the critic only has to learn how hard a transition is to predict, its estimate of the irreducible noise floor converges well before the world model saturates, redirecting exploration toward learnable transitions. The reward is higher for learnable transitions and collapses toward zero for stochastic ones, thereby separating epistemic (reducible) from aleatoric (irreducible) prediction error online. Prior prediction-error curiosity formulations, from Schmidhuber (1991) to learned-feature-space variants, emerge as special cases corresponding to specific approximations of this error baseline. Experiments on a stochastic grid world show that Curiosity-Critic outperforms prediction-error, visitation-count, and Random Network Distillation methods in training speed and final world model accuracy.