惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

cs.AI updates on arXiv.org

Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation Metropolis-Scale Resilient and Trustworthy Traffic Flow Inference Using Multi-Source Data Guarded Repair for Harm-Aware Post-hoc Replacement of LLM Mathematical Reasoning Mimir: Large-scale Multilingual Concept Modeling STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media Evidence-Linked Radiology Reporting: A Human-Supervised Reference Architecture for Structured Imaging Intelligence Overcoming "Physics Shock" in Earth Observation A Heteroscedastic Uncertainty Framework for PINN-based Flood Inference When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure Complement Submodular Information Measures for Balanced and Robust Data Selection A Large-Scale Dataset and Benchmark: Do Protein-Ligand Models Learn Binding Sites or Just Binding Likelihood? The Concept Allocation Zone: Tracking How Concepts Form Across Transformer Depth LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs Rethinking Federated Unlearning via the Lens of Memorization Language Bias in LVLMs: From In-Depth Analysis to Simple and Effective Mitigation Who judges the judges? Governance from metrics: a runtime framework for continuous LLM compliance monitoring Verified SHAP: Provable Bounds for Exact Shapley Values of Neural Networks Quaternion Self-Attention with Shared Scores CONF-KV: Confidence-Aware KV Cache Eviction with Mixed-Precision Storage for Long-Horizon LLM Cascade-KDE: Robust Time-Series Restoration under Out-of-Distribution Impulse Corruptions OSDTW: Optimal Shared Depth and Task Weighting for Long-Tailed Recognition Spectral Probe-Circuits: A Three-Step Recipe for Identifying Attention-Head Circuits in Pretrained Transformers PILOT: Policy-Informed Learned Optimization for Adaptive Deep Network Training Selective Test-Time Compute Scaling for Click-Through Rate Prediction via Uncertainty-Triggered Feature Path Exploration Catching The Correct Answer Trap: Characterising AI Tutor Blind Spots When Analysing Student Reasoning Assessing the Operational Viability of Foundation Models for Time Series Forecasting Generative OOD-regularized Model-based Policy Optimization Filtered Posterior Mean Collections: A Unified Framework for Analytical Models of Diffusion Generalization When Reasoning Hurts: Source-Aware Evaluation of Frontier LLMs for Clinical SOAP Note Generation Grammatically-Guided Sparse Attention for Efficient and Interpretable Transformers Knowledge Graph Modulated Deep Learning for Limited-Sample Clinical Data Analysis Towards a Universal Causal Reasoner Raon-Speech Technical Report RealBench: Benchmarking Data-Driven Numerical Weather Forecasting Under Operational Conditions and Extreme Event Challenges LAPLEX: The FFT of Learnable Laplace Kernels Parameter Efficient Multi-Class Intelligent Scheduling for Multimodal Online Distributed Industrial Anomaly Detection Bilevel Optimization of Synthetic Trajectories for Multi-Turn LLM Fine-Tuning Inference-Time Alignment of Diffusion Models via Trust-Region Iterative Twisted Sequential Monte Carlo Factorize to Generalize: Retrieval-Guided Invariant-Dynamic Decomposition for Time Series Forecasting Leveraging Gauge Freedom for Learning Non-Gradient Population Dynamics of Stochastic Systems Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning Measuring the Depth of LLM Unlearning via Activation Patching ASTRO: Adaptive Spatio-Temporal Reinforcement Optimization for GNN Powered Anomly Detection in Cyber Physical Systems The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models Feature Lottery? A Bifurcation Theory of Concept Emergence Abduction-Deduction Entanglement: Domain Generalization via Representation Transplants Balancing Fairness, Privacy, and Accuracy: A Multitask Adversarial Framework for Centralized Data-Driven Systems Extracting Training Data from Diffusion Language Models via Infilling Cross-Domain Energy-Guided Diffusion Generation for Off-Dynamics Reinforcement Learning ChaosBench-Logic v2: Evaluating LLM Logical Reasoning over Dynamical Systems at Scale An Interactive Paradigm for Deep Research Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction SemanticZip: A Pilot Framework for Lossy Text Compression with LLMs as Semantic Decompressors Improving Labeling Consistency with Detailed Constitutional Definitions and AI-Driven Evaluation Side-by-side Comparison Amplifies Dialect Bias in Language Models Momentum Streams for Optimizer-Inspired Transformers Mixture of Complementary Agents for Robust LLM Ensemble Treatment Effect Estimation with Differentiated Networked Effect on Graph Data JudgmentBench: Comparing Rubric and Preference Evaluation for Quality Assessment LLM Agent Based Renewable Energy Forecasting Using Edge and IoT Data A Review of Solar Wind Weather and Grid Aware Decision Support Federated Learning over Human-Body Communication for On-Body Edge Intelligence: A Survey, Taxonomy, and BODYFED-HBC Scheduling Vignette By Their Fruits You Will Know Them: Comparing Formalizations of Law by the Decisions They Encode READER: Reasoning-Enhanced AI-Generated Text Detection SEP-Attack: A Simple and Effective Paradigm for Transfer-Based Textual Adversarial Attack On the Stability and Realizability of Recurrent Polynomial Surrogate Ternary Logic Gate Networks SomaliBench Eval: Measuring English-to-Somali Refusal Gaps in Open-Weight Language Models Hidden-State Privacy Has an Empty Middle Beyond the Aggregation Dilemma: Prior-Retaining Decoupled Learning for Multimodal Graphs Tiny Brains, Giant Impact: Uncovering the Keystone Neurons of LLM with Just a Few Prompts Polymorphism Is Rotation: Operational Mechanistic Interpretability from a Two-Layer Transformer to Pythia-70m On the Impact of Class Imbalance on the Learning Dynamics of Deep Neural Networks:An Intuitive Insight Beyond Generative Priors: Minority Sampling with JEPA-Guided Diffusion TriVAL: A Tri-Validation Framework for Faithful Automatic Optimization Modeling Explainable Retinal Imaging for Prediction of Multi-Organ Dysfunction in Type 2 Diabetes Riemannian-Manifold Steering: Geometry-Aware Generative Autoencoders for Label-Free Steering TGFormer: Towards Temporal Graph Transformer with Auto-Correlation Mechanism Context: Proactive Goal-Directed Intelligence via Composable Sandboxed Programs, Declarative Wiring, and Structured Interaction Not All Transitions Matter: Evidence from PPO Theoretical Analysis of Sparse Optimization with Reparameterization, Weight Decay, and Adaptive Learning Rate Signs Beat Floats: Low-Rank Double-Binary Adaptation for On-Device Fine-Tuning Generative Representation Learning on Hyper-relational Knowledge Graphs via Masked Discrete Diffusion Courant: a State-Adaptive Perceiver-Based Neural Surrogate with Local Support and Interpretable Field Decomposition Truthful Online Preference Aggregation for LLM Fine-Tuning in Mobile Crowdsourcing GL-LFGNN:A Global-Local Dual-branch Causal Graph Neural Network Based on Liang-Kleeman Information Flow for EEG Emotion Recognition PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection World-State Transformations for Neuro-symbolic Interactive Storytelling Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization TS-Skill: A Benchmark for Evaluating Analytical Skills in Time-Series Question Answering EchoDistill:Alignment Noisy-to-Clean Self-Distillation for Robust Audio LLMs A general tensor-structured compression scheme for efficient large language models Temporal Concept Drift in Legal Judgment Prediction: Neural Baselines Across Three Epochs of Ukrainian Court Decisions Knowledge Graph-Driven Expert-Level Reasoning for Neuroscience Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models Iterative Refinement Neural Operators are Learned Fixed-Point Solvers: A Principled Approach to Spectral Bias Mitigation Scale When Needed: Adaptive Neuron-level Mixed Precision Quantization Aware Training AI-Associated Lexical Shifts Across 34 Languages: Cross-Lingual Convergence and Diachronic Uptake in News Writing Document Classification Pattern Recognition via Information Fusion: A Systematic Review of Multimodal and Multiview Representation Approaches Batch Normalization Amplifies Memorization and Privacy Risks Binding Visual Features Point by Point A Multi-Agent LLM Framework for Rating the Quality of Surgical Feedback
KYA: A Framework-Agnostic Trust Layer for Autonomous Systems with Verifiable Provenance and Hierarchical Policy Composition
Kolawole Qua · 2026-05-26 · via cs.AI updates on arXiv.org

View PDF HTML (experimental)

Abstract:Observability tells operators when an agent is slow. KYA tells operators when an agent is wrong, drifting, leaking, or quietly going rogue. We present KYA (Know Your Agents), an open-source trust and governance layer for autonomous systems composed of five primitives: (1) a four-gate inbound apply pipeline composing Ed25519 signature verification with multi-anchor pinning, persist-time expiry, only-tighten composition, and operator-approval-as-default; (2) an only-tighten composition algebra over a three-channel multi-tenant hierarchy (platform default,tenant override, signed external recommendation); (3) KYP -- Know Your Principal, a schema-level unification of trust scoring across human users, AI agents, and service accounts; (4) auditable interaction-multiplier amplification over an AIVSS-shaped additive baseline, with bounded asymmetric per-interaction multipliers carrying stable audit codes; and (5) two-axis delegation attribution combining a static observation-gated delegation-trust premium with zero-config runtime orchestrator-blame at three SDK hook surfaces. KYA is framework-agnostic across 22 agent frameworks. The pure-function scorer runs sub-millisecond at p99 and the system sustains ~1,800 ops/sec at 20 concurrent workers with HMAC chain integrity preserved end-to-end. The four-gate inbound apply pipeline rejects forged, expired, loosening, and unapproved recommendations on every trial (1,200 / 1,200) with sub-millisecond p99 latency on SQLite. KYA detects 89% of 1,200 adversarial probes from PyRIT and Garak, including the recently-published topology-guided multi-agent attack. The system is available under Apache 2.0 as the veldt-kya package on PyPI (release candidate at submission time; stable v0.1.0 forthcoming)
Comments: 26 pages including appendix. Code available under Apache 2.0 at this https URL (pip install veldt-kya). Two-domain worked examples (loan decisioning under NYDFS/ECOA/CFPB; clinical triage under HIPAA/21 CFR Part 11/FDA SaMD).Reproducibility artifacts in-tree
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Multiagent Systems (cs.MA); Software Engineering (cs.SE)
ACM classes: I.2.11; K.6.5; K.4.1; K.4.2; D.4.6
Cite as: arXiv:2605.25376 [cs.CR]
  (or arXiv:2605.25376v1 [cs.CR] for this version)
  https://doi.org/10.48550/arXiv.2605.25376

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Kolawole Quadri [view email]
[v1] Mon, 25 May 2026 02:59:54 UTC (179 KB)