惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs updates on arXiv.org

End-to-End Intracortical Speech Decoding from Neural Activity Ant Backpressure Routing for Dynamic Wireless Multi-hop Networks with Mixed Traffic Patterns AvAtar: Learning to Align via Active Optimal Transport Faithfulness as Information Flow: Evaluating and Training Faithful Chain-of-Thought Reasoning AcroRL: Learning Aggressive Quadrotor Inversion using Bidirectional Thrust Learning regime-dependent governing equations: A symbolic decision tree approach Phonetic Modeling of Dialectal Variation in Vietnamese Speech Enhancing Reliability in LLM-Based Secure Code Generation Momentum Streams for Optimizer-Inspired Transformers Structure-Aware RAG: Structured Retrieval Augmented Generation from Noisy Data for Conversational Agents CRISP -- Clustering-Based Redundancy-Reduced Instance Sampling for Pathology Case Representation and Retrieval Reframing LLM Agent Security as an Agent-Human Interaction Problem CoDA: Color Distribution Probing for Efficient and Generalizable AI-Generated Image Detection Unlocking Apple's Private Cloud Compute: An Analysis of Privacy-Preserving Artificial Intelligence Interdomain Attention: Beyond Token-Level Key-Value Memory LLMs Show No Signs Of Individuated Metacognition Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes Synheart Capacity: A Theory-Driven Physiological Representation of Cognitive Capacity Dynamics from Wearable Signals Asymmetric Adaptation-based Real-time Fault Diagnosis Under Transitional Operating Conditions Poisoning the Watchtower: Prompt Injection Attacks Against LLM-Augmented Security Operations Through Adversarial Log Content Generative OOD-regularized Model-based Policy Optimization PrivFusion: A Privacy-preserving Multi-Agent Framework for Harmonizing Distributed Datasets Humans Cannot Detect AI-Generated Media But Communities May -- For Now: Collaborative AI Detection in r/RealOrAI on Reddit A Comprehensive Evaluation of Vertex Elimination Algorithms for Algorithmic Differentiation MeVer at CheckThat! 2026: Cluster-Aware Hard-Negative Mining for Multilingual Scientific-Source Retrieval Cross-Modal Action Recognition in Egocentric Video Using Mamba: Integrating RGB and Hand Skeleton Streams via CLS Token Fusion Strategies Five Queries Are Enough: Query-Efficient and Surrogate-Free Membership Inference Attacks on RAG via Entailment Designs, linear codes, plateaued functions, and their interconnections On Permutation Groups of Cyclic Codes over Finite Fields Program Synthesis for Non-Linear Real Arithmetic: Going Beyond Realizability Temporal Concept Drift in Legal Judgment Prediction: Neural Baselines Across Three Epochs of Ukrainian Court Decisions Discovering Lexical Gaps Using Embeddings from Multilingual LLMs TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models Evolving Robustness--Exploration Trade-off in Online Reinforcement Learning via Quantile Bayesian Risk MDPs CurveRL: Principled Distribution-Aware Context Reweighting for LLM Reasoning Fourier Feature Pyramids for Physics-Informed Neural Networks Side-by-side Comparison Amplifies Dialect Bias in Language Models Balancing Fairness, Privacy, and Accuracy: A Multitask Adversarial Framework for Centralized Data-Driven Systems Representation-Guided Discrete Molecular Graph Retrosynthesis Learning Laplacian Eigenspace with Mass-Aware Neural Operators on Point Clouds Gaussian Rank-Based Neighborhood Degree for Graph Neural Networks in Image Classification Polar: Agentic RL on Any Harness at Scale Identifying and Mitigating Systemic Measurement Bias in Production LLM Inference Benchmarks An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods LLMTabBench: Evaluating LLMs on Binary Tabular Classification From Zero to Few Shots How Much Structure Do LLMs Need? Evaluating LLMs for Bibliometric Cluster Description Resident KV Claims: A Conformance Contract for Future Reuse under Active KV Pressure Bayesian Rational Search Engine User Plume Segmentation from MethaneSAT with Cross-Sensor Transfer Learning and Physics-Informed Postprocessing RxGS: Receiver-Generalizable 3D Gaussian Splatting for Radio-Frequency Data Synthesis Can Graph-Based Microservice Performance Detection Be Used for Microservice Intrusion Detection? Safety-Oriented Routing Analysis of Mixtral MoE Under Benign and Harmful Prompts GIBLy: Improving 3D Semantic Segmentation through an Architecture-Agnostic Lightweight Geometric Inductive Bias Layer Improving the Accuracy of the Exponentially Fitted Scheme on Piecewise Uniform Meshes ArtSplat: Feed-Forward Articulated 3D Gaussian Splatting from Sparse Multi-State Uncalibrated Views Benchmarking Patent Embeddings: A Multi-Task Evaluation of 22 Models Across Retrieval, Classification, and Clustering LEARNT: A Practical Estimator for Cardinality of LIKE Queries with Formal Accuracy Guarantees SAM: State-Adaptive Memory for Long-Horizon Reasoning Agent Terrain-Adaptive Grouser Wheel for Optimal Planetary Exploration: Design and Experimental Investigation Modernizing User Privacy Preference Measurement through GPPI: A GDPR-aligned Privacy Preference Item Bank When Does Synthetic Patent Data Help? Volume-Fidelity Trade-offs in Low-Resource Multi-Label Classification Toward Enactive Artificial Intelligence ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions Found in Conversation: LLMs Teach Themselves to Close the Multi-Turn Gap Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions Decompose-and-Refine: Structured Legal Question Answering with Parametric Retrieval SEAL: Synergistic Co-Evolution of Agents and Learning Environments DRInQ: Evaluating Conversational Implicature with Controlled Context Variation ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale Private Adaptive Covariance Estimation via Gaussian Graphical Models A lift for input-convex neural network training Omissive Bias in Religious Representation: Benchmarking LLM Answers to Everyday Ethical Decision-making Refined Analysis of Entropy-Regularized Actor-Critic ChainzRule: Sample-Efficient, Robust Deep Learning Across Tabular, NLP, and Vision Tasks From One-Pass SGD to Data Reuse: Mini-Batch Scaling Laws in Sketched Linear Regression Optimizing Digital Therapeutic Interventions: Online Learning under Endogenous Adherence An Interactive Paradigm for Deep Research A Unified Python Framework for Direct PPO-based Control of AHUs with Economizer Logic and CO2-Constrained Ventilation Assessing the Operational Viability of Foundation Models for Time Series Forecasting Batch Normalization Amplifies Memorization and Privacy Risks Rubato: Transcribing Piano Music with Timestamps CAffNet: Hard Constraint-Affine Neural Networks ChainLearn: A Blockchain-Based Capacity-Aware Framework for Federated Ensemble Learning GEESE: Genotype-aware End-to-End Spatio-temporal Embedding for Behavioral Phenotyping Smoother Action Chunking Flow Policy via Prior-Corrected Orthogonal Trust-Region Guidance The Model Is Not the Product: A Dual-Pillar Architecture for Local-First Psychological Coaching Concept Drift Adaptation Using Self-Supervised and Reinforcement Learning In Android Malware Detection Vision-Guided Outdoor Flight and Obstacle Evasion via Reinforcement Learning Analyzing the Effects of Two-Stage Peer Evaluation Beyond Final Answers: Auditing Trajectory-Level Hallucinations in Multi-Agent Industrial Workflows A Reinforcement Learning Inspired Latent Yield Based Adaptive Algorithm Switching Mechanism SliceWorld: A Predictive and Controllable World-State Model for CT Report Generation Treatment Effect Estimation with Differentiated Networked Effect on Graph Data Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation Accuracy Analysis of the Proxy Point Method with Applications to Some Toeplitz Matrices Attested Tool-Server Admission: A Security Extension to the Model Context Protocol Deep-Research Agents Can Be Poisoned via User-Generated Content How Well Do Models Follow Their Constitutions? Sketch Bug: Using Sketch-Based Input for Interactive Code Debugging ECo-MoE: Embodiment-Conditioned Mixture of Experts Increases the Evolvability of Robots
No Certificate, No Execution: Certified Traces as a Foundation for Trustworthy AI Agents
Xiao-Yang Li · 2026-05-26 · via cs updates on arXiv.org

View PDF HTML (experimental)

Abstract:We argue that trustworthy AI agents, especially in high-stakes and policy-governed domains, should make execution conditional on certified traces rather than rely only on stronger generative models, output-level guardrails, or post-hoc audits. A generative agent may propose recommendations, tool calls, reports, or actions, but generation is not permission: an action may be computable yet impermissible, and individually permissible actions may compose into an impermissible trace. We formalize trustworthy agency through a \textbf{Proposal--Certification--Execution (PCE)} architecture: a probabilistic generating machine $M_G$ proposes candidate execution traces, a \textbf{Permissibility Machine} $M_\Pi$ certifies proposed traces under a policy system $\Pi$, and execution proceeds only for certified traces. The executable trace language is $L_{\mathrm{exec}} = L_G \cap L_{\mathrm{cert}}(M_\Pi)$. Before execution, a trace is a structured pre-execution record submitted for certification: it specifies intended steps, evidence, proposed tool calls, approvals, replayable computations, credentials, and execution conditions. This perspective complements chain-of-thought monitorability: visible reasoning may help detect misbehavior, but monitorability is not certifiability, and reasoning is only one component of a broader execution trace. The formal principle is simple: an agent-generated trace should execute only when it carries a checkable certificate witnessing permissibility under $\Pi$: \textbf{no certificate, no execution}. We develop certified traces and Permissibility Machines as foundations for trustworthy AI agents, connect trace certification to proof-carrying execution, proof memory, privacy, and zero-knowledge certificates, and propose evaluating agents by what generated traces can be safely certified for execution, not by output accuracy alone.
Subjects: Computational Engineering, Finance, and Science (cs.CE)
Cite as: arXiv:2605.24462 [cs.CE]
  (or arXiv:2605.24462v1 [cs.CE] for this version)
  https://doi.org/10.48550/arXiv.2605.24462

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Xiao-Yang Liu [view email]
[v1] Sat, 23 May 2026 08:24:42 UTC (4,807 KB)