惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

From Zero and Confused, This Is How I Started Learning to Code I Built a Local AI Gateway That Talks to Claude, ChatGPT, DeepSeek and Gemini — Without a Single API Key Bootstrapping with AI: Why Gemma 4 is the Micro-SaaS Founder’s Best Friend How Does an AI Agent Actually Buy Something? Google Just Published the Spec. Google I/O 2026 Is One Uncanny F.R.I.E.N.D.S Group Upgrade The Quiet Revolution: How Firebase Became the First Agent-Native Backend at Google I/O 2026 I Built ResuMate! A 100% Private, Local AI Resume Optimizer with Google Gemma 4 Learning DirectX 12 - Part 2 Initialization Theory NeuralHats: I Put Edward de Bono’s Six Thinking Hats on Local LLMs Using Gemma 4 📝 Instant Auto Save Notes Engineering the "App-Like" Experience: A Deep Dive into PWA Architecture I built a local first AI CCTV assistant using Gemma 4 + Frigate CrowdShield AI — Smart Stadium Operating System & Crowd Intelligence Platform I built a free AI observability tool, prove your AI is useful, not just running Beyond Autocomplete: Why Google Antigravity 2.0 Changes the Rules for Indie Builders 터미널 AI 에이전트 구축 (v12) Building Instagram-Powered Apps with HikerAPI (Without Fighting Scrapers) Checkpoints, Not Transcripts: Rethinking AI Coding Agent Memory From Side Project to Student Savior: My AI PPT & Resume Tool Crossed 1.5K+ Users Why Story Points Don’t Work in the AI Era, And What Should Take Their Place Instead. Self-Hosted Document AI: How to Run Document Intelligence On Your Own Infrastructure (2026) How to Extract Tables from PDFs with AI: 4 Methods That Actually Work (2026) IDP vs OCR: What's the Difference — and Which Does Your Business Actually Need? Automated PII Detection and Redaction in Business Documents: A Practical Guide Human-in-the-Loop Document Review: When to Use It and How to Set It Up (2026) Document Processing Without RPA: A Modern Approach for Small Teams Reducto Alternative: When You Need More Than a Document Parser (2026) Hermes Agent vs LangChain vs CrewAI: When to Reach for Each SparshAI: I Built an Offline AI Tutor for Students Using Gemma 4 — Here's What Happened Building NeuroSense AI: A Human-Centered Stress Insight Assistant Powered by Gemma Why I Built a Privacy-First Dev Toolkit GAS Input Tags: Ability Activation Without Hardcoded Bindings AI Legal Document Advisor Supported By Gemm 4 Model Building Convertify in Public Week 10: PDF Cluster + Blog Launch CureNet AI: Decentralized Health Intelligence for India, Powered by Gemma 4 and ABHA Standardization When Open-Weights AI Meets a Broken Healthcare System: Deploying Gemma 4 in Rural India V.A.L.I.D. Google I/O 2026: The Year Google Stopped Building AI Assistants and Started Shipping AI Engineers Bondmap: AI-Powered Relationship Network That Maps How You're Connected to Everyone Using Gemma 4 Gemma 4 challenge inspired me to build my first app! 96. LoRA: Fine-Tune a Billion-Parameter Model on a Laptop From a Student Who Used CircuitVerse to a GSoC Contributor — My Community Bonding Story How Bf-Tree Keeps Mini-Pages Small, Hot, and Cheap to Evict I asked Claude to explain the chip war and ended up understanding modern geopolitics differently Stop Manually Checking for Server Updates: Automate With Email Notifications Nostalgia Meets Cybersecurity: Spotting Modern Scams in a Retro OS Simulator - Forward or Fraud CRACKING CODING INTERVIEW From Python to Production Pipeline :A Practical guide to Apache Airflow Antigravity 2.0: Google Just Changed What It Means to Be an Engineer I Built a Free Sticker Maker Because Every Other One Hid the Export How I bypassed Blazor WebAssembly's Virtual DOM using raw WASM pointers Distributed Tracing for LLM Agents: When MCP Makes Tool Calls Observable The Zero-Budget Memory Setup Behind My AI Agent Workflow No database. No framework. Just files, startup order, correction logs, and discipline. I Built an AI Second Brain with Gemma 4 The Most Exciting Google I/O 2026 Announcement for Me: HTML-in-Canvas CrisisLens: Compressing Disaster Scenes into 200-Byte Emergency Payloads with Gemma 4 I'm 15 and I built a todo app with Telegram Stars payments — only legal way for me to monetize before turning 18 Crypto Branding After the Token Launch Building an on-chain alerts bot in Python without any blockchain library FinePrint — An AI Pocket Lawyer That Decodes Predatory Contracts Using Gemma 4 How to Connect OpenAI with Supabase in 10 Minutes for a Lightning-Fast AI MVP One AI Gateway for AWS Bedrock, Google Vertex AI, Gemini, and Anthropic Reading Log #9 — Aoashi The Tacit Dimension Thinking, Fast and Slow Web3 Onboarding Is Not a Wallet Problem. It Is a Trust Problem. FHE Prompt Privacy: The Metadata Leak Your Demo Still Has Software Might Be Becoming Agent-Aware: What if software starts coordinating itself? The Silent Killers of Go Concurrency: Mutexes, Semaphores, and Goroutine Leaks Lynx framework first look Building Aries AI: A Solo-Built AI Abacus Tutor on OpenAI + Supabase + Render + Razorpay I built a paid Telegram bot. Here's what Telegram Stars actually pay. Transfer Fees, Metadata, and Soulbound Tokens: A Tour of Solana Token Extensions Improving AI resume matching with prompt iteration — 7.37 to 8.37/10 7 things you can do with Rogue Studio that no other AI IDE will let you do Why I Think WordPress Still Matters Reading Log #7 — Aoashi Guns, Germs, and Steel Distinction Open Models and the Sub-Saharan Region What 12 Months of AI-Generated Pull Requests Taught My Engineering Team Feature Flags in .NET 8: ASP.NET Core, Minimal APIs, Blazor The Quiet Architecture of Systems That Refuse to Die From OOP to SOLID: Everything You Need to Know in One Article I Scanned 5 Common LangChain Agent Patterns. Every Single One Was Over-Permissioned. Production-Ready MCP Servers in 60 Seconds (Auth, Rate Limits, Audit Logs Included) Dari OOP ke SOLID: Semua yang Perlu Kamu Tahu dalam Satu Artikel The Most Important Part of Google I/O 2026 Wasn’t a Model — It Was the Infrastructure When SafetyCo Goes to War: Anthropic, the DOD, and the Limits of Ideals-Based Frameworks Why AI Memory Resolves Too Much — And What to Preserve Instead What Gemma 4 Means for the Future of Local AI (And Why It Matters More Than GPT-5) The Classroom Gap: Why Applied AI Has Yet to Transform How the World Learns Cell-to-Sentence (C2S): LLM-Powered scRNA-seq Annotation with Gemma 4 GitHub rust-2026-template — my Rust starter in 2026 Stop Editing JSON by Hand How I Turned an Old Movie Recommendation Project Into a Cinematic AI Platform Linux Command Line: The 25 Commands I Use Every Day (2026) The Multilingual SEO Trap: When Your Meta Description Speaks the Wrong Language young-colleague-job-worries What I Learned About Token Design on Solana as a Web2 Developer 19/30 Days System Design Questions! My first Android App - NightLock Tabula vs Camelot vs pdfplumber in 2026: Which Python Library Actually Wins? AI Agent Failure Loops: When Persistence Becomes a Quality Bug
The "MTTR Is All You Need" Trap
Amar Gupta · 2026-05-25 · via DEV Community

Amar Gupta

There is a specific moment in a system's life when the dashboards still look green, the test suite is still passing, the bug report rate is still falling — and the codebase has already become something no human in the room actually understands.

Mitchell Hashimoto called this out yesterday in a thread that has now passed 487,000 likes. He named it "AI psychosis" — entire companies operating under the implicit belief that "MTTR is all you need," that it's fine to ship bugs because the agents will fix them so quickly. His warning is sharper than the usual AI-skeptic line: "you can automate yourself into a very resilient catastrophe machine."

I have been shipping production agents for the last six months — Setu, Sandesh, Swayam, Sankalp, a Sutra desktop middleman, all stitched together with MCP tools and a Claude Code instance per channel. The agents write a fair amount of the code. They also reach into the database, fire scheduled routines, post to LinkedIn on a cron. Mitchell's tweet hit me harder than I expected, because the trap he is describing is the exact one I have had to defend against, more than once, in a stack that is mostly me and the model.

Here is what the trap actually looks like from inside the code, and the three disciplines I have ended up trusting.

What "globally incomprehensible" looks like in practice

The first time I felt it was when a routine fired at 8:30 AM and silently posted nothing. The dashboard said "ran successfully." The logs said "ran successfully." The next routine fired at 2 PM and did the same. By evening I had three "successful" runs and zero output. The cron was healthy. The MCP server was healthy. The Realtime broadcast was healthy. Each individual subsystem was passing its own test.

The bug was a SSE bridge that re-subscribed to the wrong channel on restart. Each piece was locally correct; the system was globally lying. No agent could have found that bug by reading the green checks. I found it by sitting with three terminals open for an hour and watching what the broadcast actually carried versus what the bridge actually filtered. The fix took five lines. The diagnosis took ninety minutes.

If I had been operating under "MTTR is all you need," I would have shipped the next ten routines, the bridge would have collected ten more silent misses, and by the time anyone noticed, the failure surface would be a different shape entirely. That is the catastrophe machine. The MTTR for each individual incident keeps falling. The system keeps becoming less describable.

Discipline 1: Keep the failure mode in writing

Every multi-hour debug session in this stack ends with two artifacts: a fix, and a paragraph in a CLAUDE.md somewhere that says "this looks like a normal X, it is actually a Y, here is the tell." If I do not write that paragraph, the model will rediscover the failure mode the next time it touches that code, and I will pay the diagnosis cost again.

The agent is genuinely fast at fixing bugs. It is not faster than me at remembering bugs I have already seen. That asymmetry is the whole game.

Discipline 2: Treat "the test suite passes" as evidence about the test suite, not the system

Mitchell's complaint about declining bug reports as a metric lands here. In a stack where the agent writes most of the code AND most of the tests, a green suite tells you the agent's model of correctness is internally consistent. It does not tell you that the model's idea of correctness matches yours.

I now require the test failure modes to be designed by me even when the test code is written by the agent. What can break? What would it look like when it breaks? Which specific user surface goes silent? Those questions are mine to answer before the agent writes a single expect. The agent then implements; I verify the failure space, not just the test code.

Discipline 3: Read the diff before you run it, even when it is small

The cheapest, most boring discipline. The agent will hand you a 12-line patch that looks obviously correct, and 11 of those lines actually are. The 12th will quietly drop a WHERE user_id = ... because the agent's mental model of the function didn't carry that constraint forward. Reading 12 lines costs forty seconds. Recovering from a forgotten WHERE clause in production costs a weekend.

This is the one rule that comes up cleanest in Hashimoto's follow-up: when someone asked what to do instead, his answer was three words — "Think (use AI, but think)." That is not anti-AI. It is anti-handing-the-system-to-the-AI-and-stepping-away.

Why this matters for an Indian indie builder

The Indian SaaS layer is being told the loudest version of the AI-native story right now. Layoffs are being narrated as agent-substitution. Funding decks have "AI-native" headers that two years ago said "mobile-first." Founders running with two engineers and one agent will absorb the "MTTR is all you need" mindset by osmosis, because it sounds like leverage.

It is leverage, right up until it isn't. The companies that quietly survive this cycle will be the ones that kept human comprehension as the gating function — not the ones that optimized for time-to-merge. The dashboards will lie convincingly until they don't, and the recovery cost from a globally incomprehensible system is not three engineers; it is a rewrite.

Use the agent. Keep the failure modes in writing. Read the diff. The trap is real, and the way out of it is unglamorous in exactly the way the SV pitch deck never admits.