惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

aimingoo的专栏
aimingoo的专栏
量子位
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
Schneier on Security
Cisco Talos Blog
Cisco Talos Blog
T
ThreatConnect
J
Java Code Geeks
博客园 - 司徒正美
A
Arctic Wolf
T
True Tiger Recordings
C
Cybersecurity and Infrastructure Security Agency CISA
Cyberwarzone
Cyberwarzone
Know Your Adversary
Know Your Adversary
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
Recorded Future
Recorded Future
P
Palo Alto Networks Blog
The Hacker News
The Hacker News
The Register - Security
The Register - Security
S
Securelist
www.infosecurity-magazine.com
www.infosecurity-magazine.com
C
CXSECURITY Database RSS Feed - CXSecurity.com
Application and Cybersecurity Blog
Application and Cybersecurity Blog
I
Intezer
P
Privacy & Cybersecurity Law Blog
Scott Helme
Scott Helme
K
Kaspersky official blog
博客园 - 聂微东
Last Week in AI
Last Week in AI
V
V2EX
小众软件
小众软件
F
Fox-IT International blog
Martin Fowler
Martin Fowler
Apple Machine Learning Research
Apple Machine Learning Research
T
Tenable Blog
F
Future of Privacy Forum
Microsoft Security Blog
Microsoft Security Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
腾讯CDC
Stack Overflow Blog
Stack Overflow Blog
C
Check Point Blog
阮一峰的网络日志
阮一峰的网络日志
GbyAI
GbyAI
T
Threatpost
I
InfoQ
P
Proofpoint News Feed
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
T
Tor Project blog
G
GRAHAM CLULEY
D
DataBreaches.Net

DEV Community

Designing Configuration for Scalable Treasure Hunts SSH Login Delays: The 10-Second Wait That Drives Us Crazy A 3-layer memory system that gives Claude Code persistent context across sessions. Trishul SNMP Suite 2.0.1: Better MIBs, Traps, and SNMP Labs How I built a production AI SaaS as a solo developer Auto-labelling 1.2M robotics frames with VLMs: a failover story India’s Laws Were Not Built for AI — And Courts Are Filling the Gap skill-insp: A Skill That Scores Other Skills Clprolf Minimalist Messaging in the Age of AI What's actually in a good .cursorrules file? I built 10 of them — here's what I learned Building Strong Python Basics – Loops, Functions and Logic How to Choose the Right Tech Stack for Your Project I built a free multi-tab JSON editor — here's what I learned HTTP Headers Every Developer Should Know (2026) Building Cross-Platform Digital Products: Challenges and Best Practices Data Privacy in the Age of AI: How Product Teams Can Build Trust with Users What Would WordPress Look Like If It Were Designed Today? Why Backup Success Does Not Mean Database Recoverability Local AI Office Assistant That Never Sends Your Documents to the Cloud Building TaskForge: Translating Enterprise Chaos into an Open-Source Scheduler Tesla P40 in a Homelab: 24GB of Inference on a Budget Llama 4: Meta's Latest — Scout, Maverick, and the MoE Revolution George Hotz called AI code 'slop.' He's half right. Como Construir um Fluxo de Trabalho Baseado em Engenharia de Prompt e Automação We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage. The Hidden Cost of Downtime: How SRE Error Budgets Protect National Economic Infrastructure Getting started with openHUMANS can be an exciting venture for developers looking to create innovative applications in the realm of human-ce Stack Overflow: A Powerful Community for Developers and Learners From Language Models to Humanoid Minds ✨ Road to Senior #2: How Computers Think in Numbers Why LLM debugging fails on fragmented repository context How to Deploy a LangGraph Agent on AWS Bedrock AgentCore An outreach kit for solo founders whose drafts can't hallucinate Open Satchel is live Amy Kwalwasser and the Growing Importance of Quantum Risk Modeling I Built ShellReq - A Native API Client for VS Code & Terminal If Microsoft and Uber can't afford AI coding, what chance do the rest of us have? MADCAP: Building a Multi-Agent Debate CLI That Argues With Itself So You Don't Have To Why most AI fails at IDOR (and how AMAS fixes it with causal reasoning) How to Audit a Laravel Codebase You've Inherited LangGraph 워크플로우 템플릿 (v34) BugBench: a developer origin story and practical guide for VS Code / Kiro users A solution to messy token systems for Next.js A NestJS reference app that proves the nest-native stack under realistic backend pressure Observability for AI Systems: Monitoring Drift, Hallucinations, and Reliability in Production I Thought “Data Analyst” Was the Whole Game… Then I Entered the Data Avengers Office 👀 Create and configure network security groups How to analyze the cost of Kafka? How I Shipped 2,500+ Commits With AI Agents Using a 12-Phase Workflow [Boost] We built MDCMS, a Markdown-first CMS for teams using AI agents Zero Heap Allocations at 1.18 GB/s: Deep Dive into ForgeZero 4.0.x The Minimum Viable Test Suite for Working with Agents Why Perplexity Started Citing My Blog: 5 Changes That Actually Worked Sync Supabase via OAuth: No Connection String Needed I asked three AI models the same API question. Only one had it right. Implementing Saga Pattern With Lambda Durable Function Why does AI forget what you said (and how to fix it) I built a daily Wordle-style game for AI tools - Here's how Mapping Polish company structures: querying KRS direct via API Built tmpdrop — a tiny self-hosted ephemeral file drop Running Local LLM - 0$ Personal Agentic AI Assistant - Part 3 LLD Object-Oriented Design: Interfaces & Abstract Classes (Designing Contracts) The Smaller Ship: Vitalik, the Ethereum Foundation's Restructuring, and What It Leaves for Investors Looking for 4 people to build something weird with me Building a Local-Only RAG System with Ollama and TypeScript The False Positive Tax: a 1:1 TP:FP analysis of eslint-plugin-security What's new in Data Preprocessor 1.5.x — R codegen, Robust Scaler, and a deadlock post-mortem How I self-hosted my Flask app on an old laptop for almost free I built a free DSA interview prep site because I was tired of the existing options I built an AI agent that migrates Next.js Pages Router to App Router Prisma Query Logging and PostgreSQL: Where the ORM Ends and the Database Begins Prisma query logging y PostgreSQL: dónde termina el ORM y empieza la base From Browser to Server : The Journey of an HTTP Request (Demystifying the Web’s Infrastructure) Santa Augmentcode Intent Ep.6 I Benchmarked 17 ESLint Security Plugins. Only One Found Every Vulnerability. How to Build a High-Performance Image Optimization Pipeline in 5 Minutes 50 Linux Commands Every DevOps Engineer Must Know Less Toil, More Flow - Automating the Path from Request to Implementation The Code Review Checklist I Actually Use How I run a small blog on Astro 5 + Content Collections Git: Best Practices for Professionals How IBM Bob Became My Everyday Coding Companion Solana Passkey Wallet: Replacing Seed Phrases with SIMD-0075 I built a small browser puzzle game about arrows I wrapped Claude Code in a zsh function. Here's every decision I almost got wrong. Mobile Game Optimization: A Unity Developer's Checklist Git: Best Practices for Beginners Three days I lost chasing a ghost that was already dead on disk Why Too Many Parts Hurt ClickHouse Performance Guardrails for Agent Output: Pluggable Validation Before and After LLM Calls Gemma Forge: Local AI Without the Setup Wall From Half‑dead Prototype to Local‑Only AI Medical Assistant: Rewiring MedClinic with GitHub Copilot Runninig a forkbomb in Jenkins What’s Actually Happening When You Use Git Preventing Recursive Tool Loops in LangChain Agents Building a Rock-Paper-Scissors CLI with TypeScript — Union Types, Conditionals, and Jest Your AI Coding Agent Wastes 80% of Its Context. Fixed That with Graph Theory. Why Flutter Has Become the Go-To Framework for Fintech App Development
Building Production Multi-Agent Workflows in n8n: What 50 Deployments Taught Us
Ankit Dhiman · 2026-05-26 · via DEV Community

Most n8n AI workflow tutorials end at "it worked in testing." The gap between a demo and a production system handling 10,000 items/day with real money on the line is where the interesting problems live.

At Chronexa, we've built 50+ multi-agent workflows for fintech compliance teams, legal document processing, AI SDR engines, and RAG-powered research assistants. Here's what we've learned about making them reliable.


1. Design Failure as a First-Class Concern

Most n8n tutorials wire main[0]\. Production workflows wire main[0]\ and main[1]\.

Every HTTP Request node and AI node has two outputs in n8n: success (main[0]\) and error (main[1]\). Leaving the error branch unwired means failures disappear silently — you only find out when a client notices something is wrong three days later.

The pattern we use on every deployment:

\
HTTP Request → main[0] → continue workflow
→ main[1] → DLQ Sheet + Slack Alert
\
\

Set onError: 'continueErrorOutput'\ on every AI and HTTP node. Wire main[1]\ to:

  • A Dead Letter Queue (DLQ) Google Sheet or Baserow table with the failed item, timestamp, and error message
  • A Slack alert to the ops channel with the item ID and a link to the DLQ row

Never rely on a global workflow-level error trigger as a substitute for node-level error routing. The global trigger fires when the whole workflow crashes — but you want to capture partial failures item-by-item, not lose an entire batch.

Why this matters: On one fintech client's AML monitoring workflow, we caught 847 failed enrichment calls in the first week that would have silently dropped cases. The DLQ made every failure visible and recoverable.


2. HITL — The Pattern That Makes AI Output Trustworthy

Fully automated AI workflows fail silently in high-stakes contexts. Claude occasionally generates wrong company names, incorrect figures, or fabricated URLs. Without a human checkpoint, those errors reach customers.

The HITL (Human-in-the-Loop) pattern:

\
AI Node → Append to Review Sheet (status: "Pending")
→ Wait for Webhook
→ [Human reviews, sets status to "Approved" or "Rejected"]
→ Approved: continue workflow
→ Rejected: route to revision sub-workflow
\
\

Implementation in n8n:

  1. After AI generation, write output to a Google Sheet / Baserow row with a "Status" column set to "Pending Review"
  2. Use a Wait node configured to resume on webhook
  3. Set up a sheet trigger or webhook that fires when Status changes to "Approved"
  4. Add a 24-hour timeout check — if a row sits Pending too long, Slack-alert the reviewer

When to use HITL: Any workflow where AI output is customer-facing, regulatory, or financial. Skip it for internal data transformation pipelines where errors are low-stakes.

Our AI SDR engine uses HITL for outbound email review. SDRs spend 45 minutes/day approving emails instead of 6 hours writing them — the workflow does the research and drafting, a human does the final check. Reply rates went from 2.1% to 6.8%.


3. Memory Management for Long-Running Agents

Window Buffer Memory

Best for conversational agents where recency matters. Set window size to 10–20 messages — beyond 20, you're paying for context that rarely helps.

RAG over Static Documents

When your agent needs to reference a knowledge base (contracts, policies, product docs), vector retrieval beats pumping the full document into context every time.

Setup: Pinecone or pgvector + n8n's Embeddings node + Information Retrieval chain. Cost difference at scale: a 50-page policy document passed to every query costs ~$0.08/query at Claude Sonnet pricing. RAG retrieval of 3 relevant chunks costs ~$0.004/query — 20x cheaper at volume.

Session Keys for Multi-User Deployments

This is the one that bites people most often. If the same workflow handles multiple concurrent users with the default session ID, memory from User A bleeds into User B's conversation.

Fix — scope session ID to a user identifier from the webhook payload:

\javascript
sessionId: {{ $('Webhook').item.json.userId }}
\
\

We've seen this misconfiguration cause a support bot to answer one user's question with another user's account details.


4. Rate Limiting, Backoff, and Concurrency

Three failure modes that will bite you in production:

1. API Rate Limits (OpenAI/Anthropic)

For bulk workflows processing hundreds of items, rate limits hit fast. Use n8n's built-in Retry on Fail — set max retries to 3 with exponential backoff. For sustained bulk processing, add a Wait node between AI calls.

2. Webhook Concurrency

n8n's default webhook concurrency is 5 simultaneous executions. For AI workflows where each execution makes multiple LLM calls, 5 concurrent workflows can spike to 50 simultaneous API calls.

Fix: set maxConcurrency: 2\ on webhook triggers for AI-heavy workflows. It creates a queue rather than dropping requests.

3. Downstream API Timeouts

HTTP Request nodes have a 30-second default timeout. If your workflow calls slow external APIs, you'll see phantom failures. Set explicit "timeout": 60000\ on slow-API nodes, and wire the error output so timeouts go to the DLQ.


5. The Production Checklist We Use Before Every Deployment

  • [ ] Error output (main[1]\) wired on every HTTP Request and AI node
  • [ ] DLQ sheet created and connected to error outputs
  • [ ] Slack alert configured on failure with item ID and error details
  • [ ] saveSuccessfulExecution: false\ set for high-volume workflows (prevents DB bloat)
  • [ ] HITL step added for any customer-facing or regulatory output
  • [ ] Session ID scoped to user/item (not default) for multi-user agents
  • [ ] Rate limit buffer added — Wait node or Retry on Fail with backoff
  • [ ] maxConcurrency\ set to 2 on webhook triggers for AI workflows
  • [ ] Tested with 10× expected volume before go-live
  • [ ] errorWorkflow\ field set to centralized error handler

Conclusion

The difference between an n8n demo and a production system is entirely in how you handle the 10% of cases that don't go right. Designing failure handling as a first-class architectural concern, adding HITL for trust, and managing memory and concurrency carefully is what separates a reliable automation from a liability.

If you're building multi-agent workflows for real business use cases, start with the error output. Everything else follows from there.


Ankit Dhiman is the founder of Chronexa, an AI automation agency that builds custom n8n workflows for mid-market B2B companies. We've open-sourced our workflow templates at github.com/Chronexa/chronexa-n8n-workflows.