惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
Recorded Future
Recorded Future
T
Tenable Blog
S
Securelist
C
CERT Recently Published Vulnerability Notes
T
Threatpost
S
Schneier on Security
A
Arctic Wolf
The Hacker News
The Hacker News
C
CXSECURITY Database RSS Feed - CXSecurity.com
Know Your Adversary
Know Your Adversary
P
Privacy International News Feed
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
Cisco Talos Blog
Cisco Talos Blog
AWS News Blog
AWS News Blog
K
Kaspersky official blog
T
True Tiger Recordings
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
P
Palo Alto Networks Blog
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
B
Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Microsoft Azure Blog
Microsoft Azure Blog
Cyberwarzone
Cyberwarzone
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tor Project blog
Spread Privacy
Spread Privacy
Malwarebytes
Malwarebytes
P
Proofpoint News Feed
F
Fox-IT International blog
F
Fortinet All Blogs
P
Privacy & Cybersecurity Law Blog
G
GRAHAM CLULEY
量子位
Latest news
Latest news
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
博客园 - 叶小钗
Project Zero
Project Zero
T
Tailwind CSS Blog
N
Netflix TechBlog - Medium
Martin Fowler
Martin Fowler
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
I
Intezer
博客园_首页
腾讯CDC
H
Hackread – Cybersecurity News, Data Breaches, AI and More
D
Darknet – Hacking Tools, Hacker News & Cyber Security

Hacker News - Newest: "AI"

Your AI Tools Are Only as Good as Your Judgment — And That's the Point GitHub - shubhamgoel27/artifold: 📚 A local-first library for the stuff you make with AI. Index, search, preview, share — and use your past work as the style guide for your next one. Qualcomm strikes AI chip deal with TikTok owner ByteDance Why I Made a Journal for AI-Generated Papers — Cesar A. Hidalgo AI Billing is (mostly) token plumbing Xiaomi MiMo Api Open Platform - Token Plan Global Launch When AI Writes the World's Software, Who Verifies It? — Leonardo de Moura GitHub - aarifmms/keyblind: keyblind New studies find systematic religious bias in ChatGPT, other AI Meta and Google AI safety controls can be stripped in minutes, Financial Times testing finds SK hynix unveils self-cooling iHBM chips to combat AI overheating ByteDance offers AI team special stock to combat poaching GitHub - Agile-V/agile_v_skills: 🔬 Verifiable AI-Augmented Engineering Framework - Stop AI hallucinations with formal traceability (REQ→ART→TC). Agent Skills for Claude Code, Cursor, VS Code & Copilot. Enterprise-grade: ISO 9001, ISO 27001, GxP-ready. Red Team verification, multi-cycle lifecycle, behavioral anti-patterns. The Collaborative Exoskeleton of AI Science GitHub - AlphaBitCore/nexus-gateway AI agents imperiled by critical vulnerability in open source package The Vibe Coding Era: Why AI Won't Replace Software Engineers [video] AI agents are scrambling power users' brains Ask HN: Has AI affected negatively the job market for devs? Show HN: I built a tool to auto-accept AI slop and bigtech devs loves it OpenAI admits AI hallucinations are mathematically inevitable, not just engineering flaws starlette - secwest.net - secure virtual engagement Shopify's AI Developer Sam Altman and Dario Amodei are both walking back their AI jobs apocalypse prophecies as they eye blockbuster IPOs | Fortune twitter.com Robotics giant Figure AI demonstrates its robots to the world Bay Area mom out thousands after scammers use AI to mimic daughter's voice in fake kidnapping The Swing Sensei App - App Store 6 Million Fake GitHub Stars: How to Vet Open-Source AI Tools Before You Bet on Them Why AI's Biggest Deals Price Assets Before Revenue AI chatbots show bias toward Catholicism, researchers say LMIM OS – an offline AI ecosystem. Voice, RAG, WhatsApp. ++ One file. 0 setup Authors versus AI and the risks to government public sector push There's at Least One Job That AI Isn't Killing AskMingLi: AI-assisted BaZi chart readings AI Isn't Management. Try Explaining That to Matthew Prince Who Wants to Be Hired? (May 2026) – AI Engineer (Python, RAG, Agentic Workflows) twitter.com The AI Industry Just Walked Into the Vatican Humanize – two LLM-agnostic skills to rewrite and detect AI text HypeScribe – AI-powered transcription, summaries, and search for any audio/video GitHub - NikhilSKashyap/interviewsignal: AI-native broad-interviewing. Share a code, capture thought process, auto-grade on submit. pip install, zero setup cost, pure signal. Uber burned through its entire 2026 AI budget in four months. Now its COO is questioning whether it's worth it | Fortune FlowLink: MCP proxy blocking destructive AI agent commands Blitzy AI charges by LOC generated AI-Related Issues in Securities Cases: Privilege Pitfalls, 'AI Washing' Claims AI is killing All About Berlin Pheno: AI-Powered Personalized Health Platform GitHub - rishavsunny12/harvestGuard: Lets see how claude code creatively creates a project for me NES, SNES, Genesis, VirtualBoy, and PSX | A journey with AI and Recompilation The Rise of the AI Script Kiddie Stack Overflow's forum is dead thanks to AI SpaceX's AI Pursuits Have yet to Take Off Do AI Risks Require Extraordinary Government Intervention? GitHub - Dylanchess0320/LuckyD-Code: LuckyD Code - Terminal AI Assistant / Discord - https://discord.gg/ApEKKUuKd I applied to YC with an AI-native IDE for hardware prototyping AI may be fuelling U.S. business creation, but few signs of a similar trend in Canada A Board Game agent built using Sanity Context and Vercel's AI SDK | Sanity Microsoft’s GitHub was positioned to win the AI coding race. Outages got in the way Too dangerous to release: is Mythos the start of the restricted-AI era? Show HN: Audiogen – a new take on generative music AI ScribeItLocal — Free Local Video & Audio Transcription The Three-Cylinders Problem — When AI Models Choose Beauty Over Truth Show HN: MurrDB: A RocksDB-based NVMe/S3 cache for AI inference workloads The rise of the -10x engineer: The negative side of AI productivity Safe Ways to Use AI Agents Programming Is Real Engineering, And AI Proves It What AI race? China and U.S. AI are tightly connected High-VRAM GPUs aren't the future of local AI GitHub - mbbill/mind-expander: A shared visual workspace for understanding and steering code with AI agents. Show HN: We made a cinematic heist trailer with 4 AI models for $60 Release shield-v0.7.0 · AperionAI/shield AI Startup Says It Will Pay People $2,000 a Month to Masturbate—Yes, Really MCP: Security Design Considerations for AI-Driven Automation by NSA [pdf] Rethinking organizational design in the age of agentic AI Client Challenge GitHub - takshd15/Laptop-AI GitHub - SynapCores/synapcores-agent: Real, framework-free AI support agent where SynapCores is the brain — memory, RAG, tool routing, generation in one database. Browser chat widget + live Brain debug sidebar. Fork and run in 30s. The Math Changed AI-Augmented Software Development Manifesto Whisper by Remskill — AI Voice Assistant for Desktop AI tools lead to 'clear racial disparities' in job hiring Excerpts from Pope Leo XIV's manifesto on humanity and AI | AP News GitHub - StackOneHQ/stack-nudge ‘BusPatrol’ Put AI Cameras in Tens of Thousands of School Buses. Now They Want to Give Cops Access AI Killed Stack Overflow (and why that sucks) AI-Powered Cyber Attacks in 2026: How Adversaries Are Evolving Rogue states are putting AI agents to work on sanctions evasion Show HN: Treats Human and AI the Same Seventy years of mathematics built the thing we call AI Genre glitches and unexpected promotional phrases as a sign of AI writing Reverse centaurs and the failure of AI (2021) HVTracker – trust registry for open-source AI agents The Inevitability: Why AI Cannot Be Stopped, Slowed, or Resisted WebBridge - Let Kimi Agent Drive Your Browser | Kimi RTMH: Pope Leo’s Magnifica Humanitas on AI — LessWrong GitHub - SkepticCTO/decoding_the_language_machine: Documentation, Prompts, and Media for the "Decoding the Language Machine" series Block open-sourced Goose, an AI agent that scaled to 60% of the company Beyond Recall: Behavioral Specification as an Interpretive Layer for AI Personalization GitHub - compuficial/apery: Synthetic Data Generator for Agents
The Five Pillars of AI Agent Accountability: A Diagnostic Framework for Engineering Leaders
baroiall · 2026-05-27 · via Hacker News - Newest: "AI"

You’re in a board meeting. The CISO is presenting on AI risk. The CFO asks a simple question:

“When that finance agent we deployed last quarter accessed a customer payment record, can we tell who authorized it, what policy permitted it, and produce the full audit trail?”

The CISO looks at the head of the platform. The head of the platform looks at security. Nobody answers.

If you can picture that meeting happening at your company, you’re not alone. McKinsey found that only one-third of organizations have AI agent governance maturity at level 3 or higher. The other two-thirds are exactly the silence in that boardroom.

This post is the diagnostic framework that closes that gap. It’s part 2 of a five-part series on AI agent accountability, and if you only have time to read one post in the series, read this one. By the end you’ll have a five-question assessment to run with your team this week, and a maturity model to score where you stand today.

Not all governance equals AI agent accountability. Many enterprises believe they’re covered because they have network policies or an API gateway, but governance without accountability is a security theater: it might prevent some bad outcomes, but it can’t prove why good outcomes were permitted, trace what happened when something goes wrong, or satisfy an auditor asking for evidence.

True AI agent accountability requires five distinct capabilities working together. Miss any one and you have a gap that will surface during your next incident, audit, or regulatory review.

The five pillars are:

  • Traceability: Every agent interaction produces an end-to-end record automatically.
  • Authorization provenance: Every permitted action is traceable to a specific, auditable policy.
  • Identity and ownership: Every agent has a verified identity and a clear human owner.
  • Policy-based governance at scale: Declarative, attribute-based policies that don’t break at 100 agents.
  • Human oversight and intervention: Humans can see, review, and override agent behavior in real time.

Each pillar comes with a question you can ask your team. Below, we’ll work through each one, and at the end, a 5-level maturity model and a 5-question assessment to score where you stand today.

Pillar 1: Traceability

“Can you trace what happened, end to end?”

When Agent A calls Agent B, which calls Tool C, which accesses Database D, can you reconstruct the entire chain? Not just that it happened, but when, how long each step took, and what the outcome was at each hop?

Traceability means every agent interaction produces a structured, correlated record automatically. This is distributed tracing applied to agent communication. Each hop in the chain is a span; the full trace tells the complete story of an interaction from trigger to outcome.

Without traceability, incident response is guesswork. You know something went wrong, but you can’t determine the chain of events that led there.

The test: Can your team pull up a single interaction and see the full path it took across every agent and tool in your network, with timestamps and outcomes at every hop?

Pillar 2: Authorization provenance

“Can you prove why it was permitted?”

Blocking unauthorized actions is table stakes. The harder (and more important) question is, can you prove why authorized actions were permitted?

Authorization provenance means every allowed interaction is traceable to a specific, auditable policy. Not just “Agent A was allowed to call Agent B,” but “Agent A was allowed to call Agent B because Policy X grants agents with capability Y access to agents with risk-level Z.”

This is the difference between a lock on the door and a sign-in sheet. The lock prevents unauthorized entry. The sign-in sheet proves who was authorized, when, and by what authority.

Without authorization provenance, your compliance team cannot demonstrate that access was intentional and governed, only that it wasn’t blocked. That distinction is the difference between passing an audit and failing one..

The test: For any agent-to-agent interaction in your network, can you identify the specific policy that permitted it and the attributes that triggered that policy?

Pillar 3: Identity and ownership

“Who owns this agent, and who is responsible when it acts?”

Every agent must have two things: a verified identity (it is who it claims to be) and a clear owner (a person accountable for its behavior).

Identity means the governance layer can verify that an agent is genuinely the agent it claims to be, and not a compromised workload masquerading as a legitimate one. This requires cryptographic identity verification, not just a name in a configuration file.

Ownership means that when an incident occurs, there is a specific person (not a team alias, not a Slack channel, not “the AI team”) who is accountable. Without clear ownership definitions, accountability diffuses across components, and diffused accountability is no accountability at all.

Agent registration should capture: who registered it, what team owns it, what it’s designed to do, and what permissions it’s been granted.

The test: Pick any agent in your network. Can you immediately identify it’s a verified identity, who registered it, which team owns it, and what permissions it has… all without asking around?

Pillar 4: Policy-based governance at scale

“Does your security model survive agent #101?”

With 10 agents, you can manage permissions by hand. You write explicit rules: “Agent A can call Agent B. Agent C can call Agent D.” You maintain a spreadsheet. It works.

With 100 agents, it doesn’t. With 1,000, it’s impossible. Every new agent requires updating every relevant policy. Permissions become a tangled web that nobody fully understands. New agents get deployed ungoverned because updating the allow-lists is too slow.

Scalable governance requires declarative, attribute-based policies. Instead of naming specific agents, policies reference agent attributes: capabilities, risk levels, teams, environments.

  • “Low-risk agents can communicate with low-risk agents.”
  • “Agents on the finance team can access finance MCP servers.”
  • “Agents in production can only call production-grade tools.”

When a new agent registers with matching attributes, it’s governed from day one — automatically. No policy updates required. No spreadsheet to maintain. The governance scales with the agent network, not against it.

The test: When your team deploys a new agent next week, will it be governed by existing policies automatically, or will someone need to manually update an allow-list?

Pillar 5: Human oversight and intervention

“Can a human review, approve, or override?”

The EU AI Act (Article 14) requires effective human oversight of high-risk AI systems. But human oversight doesn’t mean a human approves every agent action, that would eliminate the value of agents entirely.

Effective human oversight means:

  • Visibility: Humans can see what agents are doing, which agents are communicating, and what policies govern them.
  • Review: Humans can examine agent interactions after the fact, with enough context to understand what happened and why.
  • Intervention: Humans can modify policies, revoke agent access, or halt agent communication in real time when necessary.
  • Dashboard, not log file: The oversight interface should be a visual dashboard with communication graphs and policy visualization, not a grep command on a log file.

The test: Right now, can someone on your team open a dashboard, see which agents are communicating with which, and modify the policies governing that communication — all without touching a terminal?

How to assess your AI agent accountability maturity

Run this five-question assessment with your platform lead, security lead, and one compliance representative in a 30-minute meeting. For each question, you have three possible answers: Yes (you’ve got it), Partial (you can answer for some agents but not all), or No (gap).

  1. Pick the most recent agent-to-agent interaction in your environment. Can someone on the call pull up the full trace (every hop, timestamp, and outcome) in under five minutes? (Pillar 1)
  2. For that same interaction, can you name the specific policy that permitted it and the agent attributes that triggered the match? (Pillar 2)
  3. Pick one production agent at random. Can you produce (from a system, not a wiki) its verified identity, registered owner, team, and granted permissions? (Pillar 3)
  4. Imagine your team deploys a brand-new agent tomorrow. Will your existing policies govern it automatically, or will someone need to update an allow-list? (Pillar 4)
  5. Open whatever dashboard your team uses to view agent activity. Does it show communication graphs and policy state visually, or are you grep-ing a log file? (Pillar 5)

Count your answers. Five Yes = Level 4. Mostly Yes, occasional Partial = Level 3. Yes on identity but No on policy enforcement = Level 2. Inventory only, no identity verification = Level 1. Couldn’t run the assessment because you don’t know what agents exist = Level 0.

If you scored below Level 3, you’re in the McKinsey two-thirds. The good news: you now know exactly which pillar to fix first.

The Accountability Maturity Model

The five pillars map to a five-level progression. Use it to track where you are today and where you’re heading.

Level State What you can do
Level 0:
Blind
No visibility You don’t know what agents exist in your network, let alone what they’re doing
Level 1:
Inventory
Awareness You know what agents exist, but not what they do, who they talk to, or what policies govern them
Level 2:
Authenticated
Identity verification Your agents have cryptographic identities, but communication is not yet governed by policy
Level 3:
Controlled
Policy enforcement You have policies governing agent communication, and unauthorized interactions are blocked
Level 4:
Accountable
Full accountability You can trace, prove, and audit every agent action — with authorization provenance, identity verification, and human oversight

Most enterprises today are at Level 0 or Level 1. They lack verified identities, policy enforcement, and end-to-end auditability. The goal is Level 4, and the gap between where most organizations are and where they need to be is the AI agent accountability crisis this framework addresses.

Common questions

What is the most important pillar of AI agent accountability?
All five are required, but authorization provenance is the one most enterprises miss. Plenty of teams can block unauthorized actions; very few can show why an authorized action was permitted, traceable to a specific policy. Without provenance, you have security but not accountability.

How is AI agent accountability different from observability?
Observability tells you what happened. Accountability tells you what was permitted, by which policy, and on whose authority. Observability is a prerequisite, but it’s not enough on its own; your trace data needs to be tied to policy decisions and identity claims to count as accountability.

How does AI agent accountability relate to AI agent security?
They’re complementary, not interchangeable. AI agent security focuses on preventing compromise—stopping prompt injection, blocking unauthorized API access, eliminating shadow agents. Accountability focuses on proving what authorized agents did and why. You need both: security keeps the bad agents out, accountability keeps the good agents honest. The five pillars in this framework assume strong AI agent security is already in place.

Can I assess my AI agent governance maturity using these pillars?
Yes — that’s exactly what the assessment and maturity model above are for. Walk through each pillar’s “test” with your team. If you can’t answer cleanly on all five, you’re at Level 3 or below, regardless of what tooling you’ve deployed.

Do I need all five pillars on day one?
No, but you need a path to all five. A platform that delivers two pillars natively and forces you to bolt on the other three is an accountability gap waiting to surface. We cover what to look for in future articles of this series.

What is the difference between Level 3 and Level 4 in the maturity model?
Level 3 means unauthorized interactions are blocked, you have policy enforcement. Level 4 means you can also prove why every authorized interaction was permitted, with audit evidence tied to a specific policy and identity. Level 3 is security; Level 4 is accountability.

Key takeaways

  • AI agent accountability rests on five pillars: traceability, authorization provenance, identity and ownership, policy at scale, and human oversight.
  • Each pillar has a clear test you can run against your environment today.
  • The five pillars map to a five-level Accountability Maturity Model — most enterprises are at Level 0 or 1.
  • Run the 5-question assessment with your platform, security, and compliance leads to score where you stand.
  • Missing any single pillar creates a gap that will surface during your next incident, audit, or regulatory review.

Get the strategic guide for accountable AI agents

We wrote a strategic guide for engineering and security leaders that goes deeper into each pillar, including detailed assessment questions, the full maturity model, and a practical roadmap to Level 4.

Accountable AI Agents: A Strategic Guide for AI & Security Leaders Governing Autonomous AI at Scale — no code, no product demos. Just the framework your leadership team needs.

Get the strategic guide for accountable AI agents →