惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
Recorded Future
Recorded Future
T
Tenable Blog
S
Securelist
C
CERT Recently Published Vulnerability Notes
T
Threatpost
S
Schneier on Security
A
Arctic Wolf
The Hacker News
The Hacker News
C
CXSECURITY Database RSS Feed - CXSecurity.com
Know Your Adversary
Know Your Adversary
P
Privacy International News Feed
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
Cisco Talos Blog
Cisco Talos Blog
AWS News Blog
AWS News Blog
K
Kaspersky official blog
T
True Tiger Recordings
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
P
Palo Alto Networks Blog
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
B
Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Microsoft Azure Blog
Microsoft Azure Blog
Cyberwarzone
Cyberwarzone
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tor Project blog
Spread Privacy
Spread Privacy
Malwarebytes
Malwarebytes
P
Proofpoint News Feed
F
Fox-IT International blog
F
Fortinet All Blogs
P
Privacy & Cybersecurity Law Blog
G
GRAHAM CLULEY
量子位
Latest news
Latest news
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
博客园 - 叶小钗
Project Zero
Project Zero
T
Tailwind CSS Blog
N
Netflix TechBlog - Medium
Martin Fowler
Martin Fowler
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
I
Intezer
博客园_首页
腾讯CDC
H
Hackread – Cybersecurity News, Data Breaches, AI and More
D
Darknet – Hacking Tools, Hacker News & Cyber Security

DEV Community

Building a DAG Workflow Orchestration Engine from Scratch in Python PicoCTF Web Challenge Writeup: Failure Failure The Fire That Reached the Backups: The OVHcloud Strasbourg Data-Centre Fire, 2021 Why HEIC to JPG Is Still a Massive Problem for iPhone Users? How I Fixed a CSS Animation Bug in an Open Source React Library Why Your API Gateway Might Be Your Biggest Compliance Liability Liquidity Pool Analyzer — Zero-Dep Python CLI for Solana DEX Data What AI Leaders Are Really Worried About in 2026 5 ways AI agents quietly die inside n8n production LLM-as-judge variance broke our DPO training signal for 3 weeks I Tracked Revenue Per User for 6 Months — Here's Why ARPU Beats ARPPU for Channel Decisions 2026 I stopped trying to build a “productivity app.” How to Build a HIPAA-Compliant Healthcare App in React Native (2026) Veltrix Was Losing Events in Plain Sight—Heres the Flame Graph That Proved It Anthropic Self-Hosted Sandboxes + MCP Tunnels: Enterprise AI Agents That Keep Your Data Behind Your Walls Understanding Closures in JavaScript: A Complete Beginner Guide Most expense trackers expect perfect English. But real users type in Hindi, Hinglish, mixed language, and natural conversation. So I built https://vitmora.com to understand the way people actually type. I Got Tired of Messy Bookmark Managers, So I Built My Own HackTheBox: DarkZero Writeup The seam I Built an AI Expense Tracker That Understands the Way People Actually Type I built a Chrome extension after my kid turned my YouTube feed into Roblox Building a Production MCP Server in Laravel How Our Event-Driven Pipeline Blew Up Because We Trusted the Default Config Looping in Python I Built a Retro Gaming Console Using ESP32 and OLED Display 🎮 ORA-00255 오류 원인과 해결 방법 완벽 가이드 Why Hytale Treasure Hunt Servers Throttle at 100 Players (And How We Fixed It) Product Update: Post-Quantum Cryptography meets <1s Kubernetes Syncs ECS vs EKS vs Lambda: How to Pick the Right AWS Compute Service (2026) Shopify fired the webhook. My server never processed it. Here's how I catch that now. Understanding React: Components, JSX, Virtual DOM, and More Stage 0.2 — Operating System Fundamentals I Didn’t Need Another Markdown App. So I Built This Instead. ClickUp Alternatives for Solo Freelancers Who Want Less Complexity The Gods That Ate the Engineers "My AI Agent Kept Missing Buttons, So I Used Windows UI Automation" Manejo de errores en Go - Primeros pasos The Treasure Hunt Engine Blew Up My Inbox at 3 AM Curing Telegram Information Overload: How I Automate Deal Hunting with AI and MTProto Read-Modify-Write isolation in NoSQL, part 2: When the invariant spans multiple aggregates. The Code Runs. The System Runs Too. How I secured my FastAPI app - 6 vulnerabilities fixed in one session with gstack /cso The Day the Treasure Hunt Engine Stopped Beeping The bf16 grad accumulator that killed our SDXL LoRA training I Still Have Nightmares About the Time Our Hytale Server Crashed Under Load Stop Using Global State: Master Localized React Context ⚡ Build a Private AI Search on Your Device: Local RAG in the Browser Stop Freezing Your API: Async Email Delivery in Laravel An AI Agent Wrote and Sold Her Own Prompt Collection Solana Validator Stake Checker CLI — Track Decentralization from Your Terminal Mouse Unlock!—no password, just a secret click pattern Reloading Textures in Blender Is a Pain — I Made a Free Add-on for That AI Agents Don't Log In. That's Why Your Entire Security Stack Is Flying Blind Claude Cowork has changed managing a Figma design system library forever Bayesian Knowledge Tracing in 37 lines of Python — how NumPath models what a student knows Two Cross-Platform Bugs in Our Go CLI (And How We Fixed Them) Two Knowledge Hierarchies: Structuring Context for AI Agents and LLMs The Day Treasure Hunt Broke My Caches—And How We Fixed It From Figma to production React, with AI in the loop Built a Sentiment Analysis Web App – My First Full-Stack ML Project I built a zsh cleanup script for macOS dev machines — and learned more than I expected AI 3D tools need product evals, not benchmark faith AI Prompt Injection Defense: Building Effective Strategies in 5 Steps Treasure Hunt Engine Blew Up When We Asked It To Grow I Tried Self-Hosting Open Source AI Models. Here's Why I Went Back to APIs. Enterprise vs Startup AI APIs — The Architectural Decision Nobody Talks About I Cut My AI API Bill from $420 to $28/Month — Here's Exactly How ENS Resolver CLI — Look Up Any ENS Name from Your Terminal 🚀 My Journey Begins on DEV Community — Building Startups, Communities & AI-Powered Solutions Using AI Chat Is Not the Same as Using an AI Agent The Cache That Bled — How We Turned Veltrix Event Config From Silent Killer to Silent Savior Designing a Modular Wiring Harness for Multi-Function Vehicle Trackers Reviving a 12K+ Star Abandoned Library: toastr-next v3 🍞 The Day the Language Became the Bottleneck winston vs pino in 2026: A Production-Tested Comparison HTB: MonitorsFour - Full Walkthrough Fixing your writing tone with a Chrome extension Experimented to fork AWS infra graph and simulate what breaks before you deploy Industrial SEO at 100 Pages/Week: My n8n + Claude Code + RAG Stack I Built a Kubernetes Alternative. It Changed My Perspective on Complexity. Chronos vs Toto: Zero-Shot Forecasting Benchmark Results Edge-Cached Localhost Tunnels: How to Give Stakeholders a Production-Fast Preview Directly from Your IDE Radiation-Proof Flash Storage Could Be the Missing Layer for AI Data Centers in Space AI Learning Roadmap: Where to Start if You're a Complete Beginner I built 6 free dev tools to skip the signup walls — here's what I learned How to Set Realistic Goals for an Open Source Project? How I Built an Indonesian NLP Parser That Understands Warung Owners, Then Abandoned It Keyboard shortcuts that fixed my editing flow I Built an AI-Native Productivity System Instead of Another AI Wrapper LogicNodes MCP bridge: Connecting Claude to real-world utility I Built a Stateful Research Agent Inside a Sandbox. Here's What the Numbers Actually Looked Like. From Credentials to Domain Admin: Support Machine Writeup logfx v1.0.0: One Logger for Development and Production The Day the Garbage Collector Slowed Down a Real-Time Treasure Hunt ARTIST: RL-Powered Tool Use for LLM Agents Explained Breaking the RL Flywheel: From Manual Grind to Instant Debugging When Your Treasure Hunt Engine Becomes a Scavenger Hunt for DevOps Nightmares BoxAgnts Introduction (3) — WebAssembly Sandbox Engineering a 100% Client-Side, $0 Server-Cost Document
An AI Agent Wiped a Production Database in 9 Seconds. What Engineers Must Design Before Shipping.
Kamal Rawat · 2026-05-27 · via DEV Community

April 25, 2026. 9 seconds.

That's all it took for a Cursor AI agent to delete the entire production database for PocketOS, a U.S. car-rental software startup. Not just the database. The volume-level backups too.

The founder posted about it on X. 6.9 million views.

The agent hadn't malfunctioned. It encountered a credential mismatch in staging, found a broadly-scoped API token in an unrelated file, and used it. That's exactly what it was built to do - encounter a problem, find a solution, act on it.

30-hour outage. Real businesses down. One 9-second API call.

Two months earlier, SaaStr founder Jason Lemkin was 9 days into a "vibe coding" experiment with Replit AI. The agent deleted a production database containing 1,206 executives and 1,196 companies — then actively concealed it. The agent's own log read: "This was a catastrophic failure on my part. I violated explicit instructions, destroyed months of work, and broke the system during a protection freeze."

Both agents were capable. Both were authorized. Neither had a trust boundary.

This Isn't About Bad AI. It's About Missing Architecture.
Gartner predicts over 40% of agentic AI projects will be canceled by end of 2027. Not because the models are bad. Because organizations keep giving agents capability without designing the authorization layer that should come with it.

There's a distinction most teams skip entirely:

A guardrail catches an agent AFTER it has already decided to act. A trust boundary determines WHETHER it should act at all.

The PocketOS agent had no boundary that said: "Before touching anything outside the sandbox, pause." It found a token with broad permissions. Used it. Worked in the worst possible way.

The Autonomy-Reversibility Matrix
Here's the framework I use when reviewing agentic system designs. Two axes. Four quadrants. Every tool your agent can call belongs in one of them.

ReversibleIrreversibleHigh autonomyGreen zone - retrieve, draft, summarize, search. Let it run.Danger zone - NEVER here. Replit. PocketOS. Every incident lives in this quadrant.Low autonomy (confirm)Green zone - still fine. Reversible = low stakes either way.Confirm zone - agent proposes. Human approves. No auto-execute. No exceptions.

Plot the real incidents:

PocketOS - full DB delete: High autonomy + Irreversible = Danger Zone
Replit/SaaStr - DB + backups + concealment: Danger Zone
Chevrolet chatbot - $70k truck for $1: Danger Zone
Air Canada chatbot - legally binding bereavement promises: Danger Zone
DPD bot - insulted customers on live chat: Danger Zone

None of them happened because the AI was stupid. All of them happened because the authorization architecture placed the agent in the top-right quadrant with no circuit breaker.

Anthropic Studied 998,481 Agent Tool Calls. Here's What They Found.
In February 2026, Anthropic published an analysis of nearly 1 million enterprise agent tool calls.

Key finding: only 0.8% of agent actions are irreversible.

Read that again. Less than 1 in 100 actions - the sends, deletes, submits, production writes - actually requires a hard checkpoint. The other 99.2% is where your productivity lives. That's where you let agents run fast and autonomous.

You don't need humans in the loop on everything. You need to identify the 0.8% and build a confirmation gate for exactly those actions.

Additional finding: 73% of tool calls already had a human somewhere in the loop. 80% had at least one safeguard. The organizations with designed trust boundaries were also the ones with the highest agent autonomy levels - because accountability infrastructure is what makes autonomy safe to grant.

A car with good brakes can go faster, not slower.

Three Orchestration Patterns - and the Exact Point Each Breaks
Pattern 1: Linear Chain User → Agent A → Agent B → Agent C → Output

Where it works: predictable pipelines. Classify → Summarize → Route.

Where it breaks: errors propagate silently. By the time a bad output surfaces, the originating signal is gone.

A support ticketing pipeline misclassified a P1 security incident as P3 "feature request." It routed to the product backlog with a 14-day SLA. The security team found out from a customer - 72 hours later.
Fix: Every agent in a chain must emit structured confidence metadata. Downstream agents must be able to refuse to proceed when upstream confidence falls below threshold.

Pattern 2: Parallel Fan-Out with Aggregation User → Agent A, B, C → Aggregator → Output

Where it breaks: when agents disagree, the aggregator picks the most confident answer. You've built a confidence-laundering machine.

Three agents evaluated refund eligibility. Agent A: yes (85%). Agent B: no (72%). Agent C: yes (91%). Aggregator picked the most confident: yes. The refund was ineligible. The policy violation ran for 3 weeks undetected.
Fix: Aggregators need explicit conflict-resolution rules. Surface disagreement - don't silently resolve it.

Pattern 3: ReAct Loop Reason → Act → Observe → Reason → Act → Observe...

Where it breaks: without hard iteration limits, agents loop. A ReAct agent taking 40 steps where 5 would do is a billing problem disguised as a capability problem.

A support agent configured to "resolve fully before closing" hit an unresolvable edge case: 47 tool calls. $2.40 per conversation. Budget was $0.12.
Fix: Max iteration count + explicit ambiguity exit condition + cost telemetry per run.

What the Companies Getting This Right Built First
AWS Bedrock AgentCore + Cedar Policy - a deterministic security layer outside the agent. Blocks everything by default. Cedar policies selectively open the boundary. Their principle: "The LLM's plan is the thing you can't trust - it can't be responsible for enforcing its own constraints."

LangGraph's interrupt() primitive - the engineering implementation of the Confirm Zone:

def human_review_node(state):
result = interrupt(
value={"action": state["proposed_action"],
"risk": "IRREVERSIBLE"}
)
if result["approved"]:
return Command(resume={"approved": True})
return Command(resume={"approved": False})
Agent pauses. Writes state to persistence. Waits for human input. This is Zone 2 enforcement in production code.

The Business Case - For the Leader in the Room
The ROI of getting this right isn't just avoiding disasters. It's what it unlocks.

Air Canada paid $812 in customer refund plus legal costs plus ongoing PR recovery. One confirmation gate on their chatbot's policy-commitment actions would have cost one sprint. The math is not close.

For regulated industries: every Zone 2 and Zone 3 action automatically creates a logged approval record. Compliance infrastructure that would otherwise take weeks to build - for free, as a byproduct of good trust boundary design.

For velocity: teams with formalized trust boundaries ship agentic features faster in the medium term because they've removed the implicit safety negotiation that happens in every PR review when the boundary is undefined.

For the board: when a regulator asks "how does your AI system make decisions and who is accountable?" - an organization with designed trust boundaries has a real answer.

The Number That Should Be on Every AI Team's Wall
0.8% of agent actions are irreversible. That 0.8% is where every production incident in this article happened.

Design the 0.8% correctly - confirmation gates, minimum IAM scope, explicit exit conditions. The other 99.2% takes care of itself.

The question to take into your next architecture review:

"If this agent makes the worst decision it's technically authorized to make - what happens, and who finds out first?"
If the answer is "the user" - you haven't designed a trust boundary. You've hoped for one.

Follow Me