惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

aimingoo的专栏
aimingoo的专栏
量子位
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
Schneier on Security
Cisco Talos Blog
Cisco Talos Blog
T
ThreatConnect
J
Java Code Geeks
博客园 - 司徒正美
A
Arctic Wolf
T
True Tiger Recordings
C
Cybersecurity and Infrastructure Security Agency CISA
Cyberwarzone
Cyberwarzone
Know Your Adversary
Know Your Adversary
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
Recorded Future
Recorded Future
P
Palo Alto Networks Blog
The Hacker News
The Hacker News
The Register - Security
The Register - Security
S
Securelist
www.infosecurity-magazine.com
www.infosecurity-magazine.com
C
CXSECURITY Database RSS Feed - CXSecurity.com
Application and Cybersecurity Blog
Application and Cybersecurity Blog
I
Intezer
P
Privacy & Cybersecurity Law Blog
Scott Helme
Scott Helme
K
Kaspersky official blog
博客园 - 聂微东
Last Week in AI
Last Week in AI
V
V2EX
小众软件
小众软件
F
Fox-IT International blog
Martin Fowler
Martin Fowler
Apple Machine Learning Research
Apple Machine Learning Research
T
Tenable Blog
F
Future of Privacy Forum
Microsoft Security Blog
Microsoft Security Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
腾讯CDC
Stack Overflow Blog
Stack Overflow Blog
C
Check Point Blog
阮一峰的网络日志
阮一峰的网络日志
GbyAI
GbyAI
T
Threatpost
I
InfoQ
P
Proofpoint News Feed
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
T
Tor Project blog
G
GRAHAM CLULEY
D
DataBreaches.Net

DEV Community

The N+1 Query That Killed Our Database, And How I Fixed It Docstrings vs Markdown Docs: What Should Developers Actually Write? Training Data Provenance: The Manifest Diff That Explains the Hash Add SVGIcons MCP to Claude Code and Find SVG Icons from Your Terminal 3 CLI Tools You Can Buy with Crypto — No KYC, No Subscriptions COSS Weekly: OpenClaw competitor NanoClaw Raises $12M, Dust Raises $40M, Sonar Acquires Gitar, and more How to know if you actually need mobile proxies (without buying any) Building Cursor for Community: A Buildathon Built on Time Pressure How we built a PII masking layer for LLM APIs — local detection, reversible tokens, one line to integrate Why MLFQ Was Way Ahead of Its Time I Built a Prompt Injection Detector with 98% Recall on Unseen Attacks. Here's Why Data Beat Architecture. 8 Vite Config Options Every Developer Should Know (Vite 8) Feature Flags That Forgot to Leave Why Trust Infrastructure Is Becoming the Hidden Layer of Donation Platforms XyPriss: Rethinking Core Performance and Zero-Trust Architecture in Modern Backends Designing Configuration for Scalable Treasure Hunts SSH Login Delays: The 10-Second Wait That Drives Us Crazy Building Production Multi-Agent Workflows in n8n: What 50 Deployments Taught Us A 3-layer memory system that gives Claude Code persistent context across sessions. Trishul SNMP Suite 2.0.1: Better MIBs, Traps, and SNMP Labs How I built a production AI SaaS as a solo developer Auto-labelling 1.2M robotics frames with VLMs: a failover story India’s Laws Were Not Built for AI — And Courts Are Filling the Gap skill-insp: A Skill That Scores Other Skills Clprolf Minimalist Messaging in the Age of AI What's actually in a good .cursorrules file? I built 10 of them — here's what I learned Building Strong Python Basics – Loops, Functions and Logic How to Choose the Right Tech Stack for Your Project I built a free multi-tab JSON editor — here's what I learned HTTP Headers Every Developer Should Know (2026) Building Cross-Platform Digital Products: Challenges and Best Practices Data Privacy in the Age of AI: How Product Teams Can Build Trust with Users What Would WordPress Look Like If It Were Designed Today? Why Backup Success Does Not Mean Database Recoverability Local AI Office Assistant That Never Sends Your Documents to the Cloud Building TaskForge: Translating Enterprise Chaos into an Open-Source Scheduler Tesla P40 in a Homelab: 24GB of Inference on a Budget Llama 4: Meta's Latest — Scout, Maverick, and the MoE Revolution George Hotz called AI code 'slop.' He's half right. Como Construir um Fluxo de Trabalho Baseado em Engenharia de Prompt e Automação We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage. The Hidden Cost of Downtime: How SRE Error Budgets Protect National Economic Infrastructure Getting started with openHUMANS can be an exciting venture for developers looking to create innovative applications in the realm of human-ce Stack Overflow: A Powerful Community for Developers and Learners From Language Models to Humanoid Minds ✨ Road to Senior #2: How Computers Think in Numbers Why LLM debugging fails on fragmented repository context How to Deploy a LangGraph Agent on AWS Bedrock AgentCore An outreach kit for solo founders whose drafts can't hallucinate Open Satchel is live Amy Kwalwasser and the Growing Importance of Quantum Risk Modeling I Built ShellReq - A Native API Client for VS Code & Terminal If Microsoft and Uber can't afford AI coding, what chance do the rest of us have? MADCAP: Building a Multi-Agent Debate CLI That Argues With Itself So You Don't Have To Why most AI fails at IDOR (and how AMAS fixes it with causal reasoning) How to Audit a Laravel Codebase You've Inherited LangGraph 워크플로우 템플릿 (v34) BugBench: a developer origin story and practical guide for VS Code / Kiro users A solution to messy token systems for Next.js A NestJS reference app that proves the nest-native stack under realistic backend pressure Observability for AI Systems: Monitoring Drift, Hallucinations, and Reliability in Production I Thought “Data Analyst” Was the Whole Game… Then I Entered the Data Avengers Office 👀 Create and configure network security groups How to analyze the cost of Kafka? How I Shipped 2,500+ Commits With AI Agents Using a 12-Phase Workflow [Boost] We built MDCMS, a Markdown-first CMS for teams using AI agents Zero Heap Allocations at 1.18 GB/s: Deep Dive into ForgeZero 4.0.x The Minimum Viable Test Suite for Working with Agents Why Perplexity Started Citing My Blog: 5 Changes That Actually Worked Sync Supabase via OAuth: No Connection String Needed I asked three AI models the same API question. Only one had it right. Implementing Saga Pattern With Lambda Durable Function Why does AI forget what you said (and how to fix it) I built a daily Wordle-style game for AI tools - Here's how Mapping Polish company structures: querying KRS direct via API Built tmpdrop — a tiny self-hosted ephemeral file drop Running Local LLM - 0$ Personal Agentic AI Assistant - Part 3 LLD Object-Oriented Design: Interfaces & Abstract Classes (Designing Contracts) The Smaller Ship: Vitalik, the Ethereum Foundation's Restructuring, and What It Leaves for Investors Looking for 4 people to build something weird with me Building a Local-Only RAG System with Ollama and TypeScript The False Positive Tax: a 1:1 TP:FP analysis of eslint-plugin-security What's new in Data Preprocessor 1.5.x — R codegen, Robust Scaler, and a deadlock post-mortem How I self-hosted my Flask app on an old laptop for almost free I built a free DSA interview prep site because I was tired of the existing options I built an AI agent that migrates Next.js Pages Router to App Router Prisma Query Logging and PostgreSQL: Where the ORM Ends and the Database Begins Prisma query logging y PostgreSQL: dónde termina el ORM y empieza la base From Browser to Server : The Journey of an HTTP Request (Demystifying the Web’s Infrastructure) Santa Augmentcode Intent Ep.6 I Benchmarked 17 ESLint Security Plugins. Only One Found Every Vulnerability. How to Build a High-Performance Image Optimization Pipeline in 5 Minutes 50 Linux Commands Every DevOps Engineer Must Know Less Toil, More Flow - Automating the Path from Request to Implementation The Code Review Checklist I Actually Use How I run a small blog on Astro 5 + Content Collections Git: Best Practices for Professionals How IBM Bob Became My Everyday Coding Companion
Add Runtime Limits to Claude Agent Workflows
Joakim Willi · 2026-05-26 · via DEV Community

Joakim William Hauge

Add Runtime Limits to Claude Agent Workflows

One of the fastest ways autonomous workflows become unstable in production is not model quality.

It’s unconstrained execution.

A Claude-powered workflow starts normally:

  • retrieve context
  • call tools
  • reason
  • retry

Then suddenly:

  • retries compound
  • context expands
  • tool usage escalates
  • latency spikes
  • execution drifts indefinitely

The workflow technically remains “alive.”

Operationally:
it stopped making meaningful progress a long time ago.

This article shows a simple way to add runtime limits to Claude agent workflows using TypeScript.

No complex orchestration required.


Why Runtime Limits Matter

Most AI workflows behave normally most of the time.

The problem comes from edge cases:

  • recursive retries
  • runaway tool chains
  • unstable recovery behavior
  • non-converging reasoning loops
  • escalating context windows

A small percentage of unstable runs can consume disproportionate amounts of:

  • inference cost
  • latency
  • compute
  • operational attention

Especially in:

  • autonomous workflows
  • long-running agents
  • multi-step orchestration systems

This is where runtime limits become important.


The Goal

We want lightweight operational boundaries like:

```ts id="jlwm4"
{
maxRuntimeMs: 30000,
maxSteps: 15,
maxToolCalls: 10
}




Once execution exceeds those boundaries:

* workflows interrupt safely
* retries stop compounding
* latency remains bounded
* economic exposure stays predictable

Think of it as:



```txt id="0jlwm4"
bounded execution for autonomous systems

Enter fullscreen mode Exit fullscreen mode


Step 1 — Track Runtime State

We’ll maintain a lightweight execution context:

```ts id="1jlwm4"
type ExecutionState = {
startedAt: number;
steps: number;
toolCalls: number;
};




Initialize it:



```ts id="2jlwm4"
const state: ExecutionState = {
  startedAt: Date.now(),
  steps: 0,
  toolCalls: 0
};

Enter fullscreen mode Exit fullscreen mode


Step 2 — Define Runtime Limits

Now define simple operational constraints:

```ts id="3jlwm4"
const LIMITS = {
maxRuntimeMs: 30_000,
maxSteps: 15,
maxToolCalls: 10
};




These values do not need to be perfect initially.

The important thing is:



```txt id="4jlwm4"
execution becomes bounded

Enter fullscreen mode Exit fullscreen mode


Step 3 — Create a Runtime Guard

Now create a simple runtime enforcement layer:

```ts id="5jlwm4"
function enforceRuntimeLimits(
state: ExecutionState
) {
const runtimeMs =
Date.now() - state.startedAt;

if (runtimeMs > LIMITS.maxRuntimeMs) {
throw new Error(
"Runtime limit exceeded"
);
}

if (state.steps > LIMITS.maxSteps) {
throw new Error(
"Execution step limit exceeded"
);
}

if (state.toolCalls > LIMITS.maxToolCalls) {
throw new Error(
"Tool invocation limit exceeded"
);
}
}




This becomes your:

## runtime governance layer.

---

# Step 4 — Wrap Workflow Execution

Now enforce limits during execution:



```ts id="6jlwm4"
while (true) {
  enforceRuntimeLimits(state);

  const response =
    await claudeAgent.run();

  state.steps += 1;

  if (response.usedTool) {
    state.toolCalls += 1;
  }

  if (response.done) {
    break;
  }
}

Enter fullscreen mode Exit fullscreen mode

That’s it.

Now your workflow has:

  • bounded runtime
  • bounded execution depth
  • bounded tool usage

Why Simple Limits Work Surprisingly Well

A lot of teams initially assume they need:

  • advanced anomaly detection
  • reinforcement learning
  • sophisticated telemetry pipelines

But simple operational constraints already eliminate many expensive failure modes.

Especially:

  • retry storms
  • recursive loops
  • unstable tool churn
  • non-converging execution

You do not need perfect intelligence initially.

You need:

operational boundaries.


Production Improvements

The minimal example above works surprisingly well, but production systems usually add:

  • token velocity monitoring
  • recursion detection
  • semantic retry analysis
  • adaptive thresholds
  • tenant-specific budgets
  • escalation policies
  • execution tracing

For example:

```txt id="7jlwm4"
search
→ retry
→ search
→ retry
→ retry




is often more dangerous operationally than:



```txt id="8jlwm4"
search
→ summarize
→ respond

Enter fullscreen mode Exit fullscreen mode

even if both technically “work.”


Why This Looks Familiar

Distributed systems evolved similar operational primitives over decades:

  • retry limits
  • timeout controls
  • circuit breakers
  • bounded failure domains

Why?

Because eventually:
unconstrained execution became dangerous at scale.

Autonomous AI systems are beginning to encounter the same operational reality.


The Shift Toward Runtime Governance

Most AI infrastructure today focuses heavily on:

  • observability
  • tracing
  • replay systems
  • prompt analytics

These tools answer:

```txt id="9jlwm4"
“What happened?”




Runtime governance answers:



```txt id="10jlwm4"
“What should be allowed to continue happening?”

Enter fullscreen mode Exit fullscreen mode

That distinction matters enormously.

Because by the time runaway execution appears inside dashboards:

  • compute may already be burned
  • latency may already have degraded UX
  • retries may already have cascaded

Visibility without intervention eventually becomes incomplete.


Final Thoughts

The current AI ecosystem focuses heavily on:

  • smarter models
  • larger context windows
  • better reasoning
  • more autonomous agents

But long-term production systems will likely depend just as much on:

  • bounded execution
  • runtime governance
  • operational predictability
  • constrained failure behavior

Because eventually:
the challenge is not simply building autonomous workflows.

It is building governable autonomous workflows.