慣性聚合 高效追讀感興趣之博客、新聞、科技資訊
閱原文 以慣性聚合開啟

推薦訂閱源

Google DeepMind News
Google DeepMind News
人人都是产品经理
人人都是产品经理
M
MIT News - Artificial intelligence
博客园 - 叶小钗
MyScale Blog
MyScale Blog
V
Visual Studio Blog
月光博客
月光博客
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
量子位
I
InfoQ
有赞技术团队
有赞技术团队
阮一峰的网络日志
阮一峰的网络日志
Jina AI
Jina AI
V
V2EX
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
Blog — PlanetScale
Blog — PlanetScale
Last Week in AI
Last Week in AI
雷峰网
雷峰网
Stack Overflow Blog
Stack Overflow Blog
博客园 - Franky

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
智能代理失效循环:当坚持变为品质之弊
Gregory Shev · 2026-05-25 · via DEV Community

二零二六年,吾欲使吾之智械编撰之使,增一规:识止之时。

智械之使,非必止而败。

或亦行而败。

吾尝造一真品牌系统之西里尔字型之延,其务似确:使西里尔字、拉丁字、数字及特异符号,若一编撰之族。

Claude 代码与典籍持续运作。其生成文件,输出证验,汇报进度,且修正末显之诟病。

然同类之弊屡现。

此乃 AI 代理失效之险境:流程似有成效,而实质之质弊犹存。

失效之环何谓也?

困循环者,乃一往复之迹也,其司恒生新策,而本患未除。

其般五阶:

  1. 用户复拒同弊。
  2. 其司补新症。
  3. 验门过弱,弗察其患。
  4. 其司请再手察。
  5. 众复历一轮回,困于旧题。

一误固常。

真bug现于代理续行,其验系统已败。

常理之证环何以败

证环之用甚大。试之、图之、建之、理之、分之、报之,皆要之。

然证环亦可为戏,若所测非其宜。

吾之字體之項,使者在其中證明,字體已編譯,PDF已渲染,截圖已存,界框已變,數值分數亦進矣。

此非证其字句之形貌也。

众皆弃异事:视之不谐。

俄文字形有短而粗,疏而散,与拉丁文字相配则结构乖张者。

若门不能见人所恒见之弊,则门不得言事毕。

今所行之法

同弊显者再现,则止常施。

勿复为臆补之策。

勿弛其阈。

勿令观他物。

转换至失效回路破除模式。

何谓失败循环之破者

破环之器,乃智役之开关也。

更佳之次输出,乃诊断之包,非复候选之修补也。

宜含:

  1. 屡败之属;
  2. 弃置之文,含已知不良之例;
  3. 朱门初启,遇此例则溃;
  4. 巧术可化朱为青;
  5. 设者未睹其果,则盲验或独验;
  6. 明示续行、中止,或人择之决;

非惟重试之限。

重试之限,可抑耗损之增。破败循环之器,更易其事。

红门之序,实属要义

善用之门,必先败而后修,否则未证其能察旧弊。

若使执事者不能令新验败于旧弊之物,则未成真验之器。

多执事之程,皆略此节。

增一量度,观新者得高分,遂谓之进。此量度未尝强拒旧之败。

主观或视觉之务,此尤甚焉,盖弃用之文,乃通人情之味与必当之验之桥也。

当代理者受污染时

复有陷焉,曰污染之验:同此一役,既撰其补,复识其的,又评其果。

此于迭进,或有所用,然非独立之验也。

若代理已见所期之答,终验需设一确凿之关,持而隐其例,命盲者覆之,或别立一模型,不纳作者之理,抑或由人决之,盖因所求者味而非算也。

同作者之验,常自洽,非证。

吾已此为小技公之。

https://github.com/g-shevchenko/agent-failure-loop-breaker

其内植精简之技,并设仓本之规于Claude Code、Codex、Cursor及Windsurf.

所植之规,刻意简明:

若同类之弊复现,则使者当止常补,先构拒用之文,立红门为限,而后续之。

此包非为使模型更智也.

乃使工作流程少惑于动静之辨.

企业之失

众团队常视代理之恒久为资,此乃通病也。

此法适于范围明晰、测试周严之实施任务,然于接受标准关乎形貌、编校、架构或运作之工作,则风险甚大。

若Claude Code、Codex、Cursor或Windsurf屡次未能通过同类评审,则次之投资宜投于验证契约。

纵有天下至善之提示,若门径奖赏非其物,亦将循环往复。

何助于此

此模式于UI润色之循环、视觉回归之工、PDF与演示文稿之生成、字体系统、内容质询,及机关编程之务,其弊复现者,皆有所用。

此乃信也。

若用户复言“此仍为旧患”者再二,则程当易之。

实用之得

勿令人工智能“永续尝试”。

请令其证其检者能捕其末次败试。

若其不能,则次务非实施也。

尔时,次务乃筑佳门。

全文:

https://gregshevchenko.com/notes/ai-agent-failure-loop-breakers/