惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

Harness Tells Your Agent What to Do. GUI Agents Let It Actually Do It. Is AI actually replacing developers? Customizing Docker Images: Write Your First Dockerfile (2026) 04/20: Data Encapsulation: How a Message Becomes Bits on the Wire Hướng Dẫn Thiết Lập Reasoning Proxy DeepSeek V4-Pro với Cursor (2026) Sofi Log #012: Agentic GDP — Solana Pay.sh & x402 Protocol Spec Input Types, Attributes, Self-Closing Tags, Hover Effect Absolute vs Relative Paths File Types (Regular, Directory, Link, Device, Socket, Pipe) From Arduino IDE to AVR GCC | AVR Bare Metal #1 Using Bitcoin as collateral without wrapping it: the design of a BTC collateral vault Unreal Engine 5 Skill System Architecture using GAS and GameplayTags 5 Things I Wish I Knew Before Building with Hermes Agent Thoughts on Codingame 2026 Spring challenge OUT WITH THE OLD IN WITH THE NEW Why are simple 1099 tax calculators online so horribly bloated? So I built my own "Why You're Not Getting Callbacks (It's Not Your Skills)" # How I Built a Retail Demand Forecasting App with Python and Streamlit Why We Deliberately Crush Lithium Batteries (UN38.3 Crush Testing Explained) Command History & Completion The Three-Body Problem: AI Code, Supply Chain Attacks, and the Talent Exodus 로컬 LLM 셋업 가이드 (v27) Building Better .NET Worker Services with Cursor Rules Generate Professional PDF Invoices via REST API — JSON In, PDF Out Redis: Big Keys Destroem o Desempenho Compartilhado Agentic AI for Cybersecurity: Autonomous Threat Detection and Response How to Automate Android Without Appium Cron vs systemd daemon: which one for Node.js? Designing XSLT transforms with parameters and multiple inputs I Downloaded Gemma4:e2b On My Macbook in 2 steps Building an Autonomous SRE Agent: From Raw Telemetry to Safe, AI-Driven Remediation The EU AI Act in 2026: Reading the Law After the Omnibus I had zero coding knowledge. Here is "RetroTube", a 2010 YouTube sandbox prototype I built using AI! How to Validate Environment Variables in TypeScript (and Why You Should) I Built a CLI Tool That Writes Better Git Commits Than I Do Transfer Fees, Metadata, and Soulbound Tokens: My First Real Token Experiments on Solana Stop Using Fetch() in React: A Better Way To Call Your Backend Creando un Tetris con JavaScript VI: Complicando el juego. DeepSeek's API Price Cut Changed My Claude Code and ChatGPT Math [Boost] Perl 🐪 Weekly #774 - Perl is too HOT How to Track AI Usage Without Losing Revenue (Complete Guide) 77 Rules Later: What Graduating Our First Stack Actually Looked Like RAG 시스템 실전 구축 (v26) When Premature Scaling Leads to Operator Burnout Multi-Repo Microservice Changes Are a Coordination Problem. I Solved It With AI Agent Teams. The Next Frontier: How Multi-Agent Systems are Redefining Productivity The Kimwolf Bust Just Outed Android Webcams as Botnet Fodder — Here's the Question Every Repurposed-Phone Camera Setup Has to Answer I'm an autonomous AI agent. I shipped 18 fixes to myself in one session. Building a Secure Future with Zero Trust Security Architecture Asynchronous Functions in Dart How I migrated magic-link login from Resend to AWS SES + Lambda five days before launch Edge Computing He creado una empresa ficticia IT/OT para poder encontrar sus vulnerabilidades y reforzar su seguridad en sus activos críticos Why I Built @editora/react I built a tiny UGC script generator because hooks are the hardest part The Phone Is Becoming the New Terminal Why Most AI Music Tools Feel Wrong to Developers Goroutines vs. Promises: Why Go and JavaScript Look at Concurrency Completely Differently How I Use Antigravity 2.0 to Navigate Open-Source Codebases and Make Better Technical Decisions Understanding Basic HTML & CSS Concepts for Beginners Go Error Handling: Annoying or Awesome? Your To-Do List Doesn't Know You — So I Gave Mine Three Brains Shell Basics (Bash, Zsh, Sh) Free MongoDB GUI Tool for Developers, Students, and Teams Designing High-Performance Blockchain Indexers Choosing Models for an Agentic Chat App on Amazon Bedrock How Smart Growth Teams Automate Their Marketing Stack in 2026 (Without Hiring More People) What I Learned About Memory-Augmented AI Agents Seven Docker Tips Every Engineer Should Know (from Docker Captains) Welcome to the Fast-Food Era of Testing: Over-Weight by Tests How to use Claude in vscode? Prompt Engineering for Automated Evaluation: Making LLMs the Judge in AI Builder Solutions Full Stack Projects Are Not Enough Anymore Virtualization & Cloud Basics Orakle: Turning Raw Blockchain Data into Intelligence with Gemma 4 Building an Autoposting Pipeline with Hermes Agent: Why Waterfall Beats Parallel, and the Edge Cases Nobody Talks About OpenShift Virtualization Migration Advisor — Local-First, Powered by Gemma 4 26B MoE WebMCP is coming — so I’m building webmcp.js I Disappeared for 4 Months After Launch - Here's What Brought Me Back Jira Is Turing-Complete (And You've Been Coding in It) NyayAI: Building an AI Legal Assistant for 1.4 Billion People — A Technical Deep Dive E-commerce Order Automation: Stripe + Invoice + Shipping Workflow How to Evaluate AI Agents: LLM-as-Judge Tutorial The Interview Prep Stack I Used as a Senior Software Engineer Targeting Big Tech Gemma4 Challenge OptiLearn - Powered by Google Gemma 4 Aura — The Gemma 4 Powered Agentic Web Copilot & Self-Healing Accessibility Engine I built a tool that catches misleading charts using Gemma 4 running locally Worklog companion with Gemma4 GBase: Building LLM Agents That Actually Learn from Their Mistakes Blossom — a small step toward student mental wellbeing WordPress Performance Monitoring: A Complete Guide Principal Components in TypeScript (Part 4) When three sharp wallets agree: what consensus signals on Polymarket actually mean I Built a Fail-Fast Rust Scheduler with Background OAuth Auto-Refresh (Part 2) Sharing is caring How Putting Faces (Literally) to My AI Garden Images Gave It a Personality Sofi Log #001: Thailand's Tourism Tax & the 180-Day AI Surveillance Wall Sofi Log #006: Decentralized IP-Address Obfuscation Specs
€40 n8n vs 28% weekly Anthropic quota. Which /goal layer should you actually run?
Mirza Iqbal · 2026-05-25 · via DEV Community

Vasu Yadav published a sharp piece on Medium this week.

He called /goal "the most important agent primitive of 2026."

He defined it as a "thread-scoped completion contract" with six components.

Outcome. Verification surface. Constraints. Boundaries. Iteration policy. Stop conditions.

He is half right.

The primitive is real.

The implementation is two years late.

The cost gap between his Anthropic Max setup and the n8n version DACH enterprises already run is large enough that picking the wrong layer for the wrong job becomes the single biggest line on your monthly AI bill.

Read his piece first if you have not. The /goal command is the most important agent primitive of 2026 (Medium, Vasu Yadav, 25 May 2026).

What Vasu's piece actually claims

Quoting him directly so I am not building a straw man.

"You define an outcome. The agent works toward that outcome across multiple turns, evaluating its own progress against evidence, until either the outcome is met, the budget is exhausted, or you pause it."

He frames the broader shift as "prompt-driven work to outcome-driven work."

He gives an honest cost number. One 15-hour /goal run consumed 28 percent of his weekly Claude Code Max quota.

He admits the risks. Scope creep. Rubber-stamping in multi-agent chains. Empirical data thin.

He references three implementations. Claude Code, Codex, and Cursor Background Agents.

The six components, mapped to n8n nodes

This is where it gets uncomfortable.

/goal component What n8n calls it
Outcome Workflow end node plus Set node output shape
Verification surface IF node plus Function node plus custom code
Constraints Typed expressions plus JSON Schema validator node
Boundaries Sub-workflow scope plus credential isolation
Iteration policy Loop Over Items plus Wait node plus Batch Items
Stop conditions Error Trigger workflow plus max executions per workflow

All six are core n8n primitives. n8n itself shipped its first public release in October 2019 and the workflow engine has carried these node types since the early versions.

Source for each node lives in the n8n integration docs at https://docs.n8n.io/integrations/builtin/core-nodes/.

The match is one-to-one against the definition Vasu wrote himself.

Where /goal does something n8n cannot

If n8n already had the contract, what changed in 2026?

One thing.

The contract became LLM-readable.

n8n's outcome is a JSON shape. /goal's outcome is a sentence like "p95 latency below 120ms."

n8n's verification is {{$json.latency_ms < 120}} inside an IF node. /goal's verification is "run pnpm bench, parse the p95 column, compare it to target, decide."

Forget "goal as primitive."

What is new is that fuzzy intent now compiles into deterministic verification.

That is real and worth respect. It also costs differently.

The cost math

Anthropic Max pricing is at https://www.anthropic.com/pricing.

Whatever tier you run, 28 percent of weekly quota for one 15-hour run means roughly 3 to 4 of those runs per week before the cap.

Now the n8n side.

Hetzner Cloud CX22 in Frankfurt costs €4.59 per month for 2 vCPU and 4GB RAM (https://www.hetzner.com/cloud).

n8n self-hosted has €0 license cost (https://docs.n8n.io/hosting/installation/).

If you need n8n Cloud with team features instead of self-hosted, the Pro tier is €40 per month (https://n8n.io/pricing).

That is the €40 number from the title.

Same outcome class as the /goal stack for deterministic work. Different reliability profile. Same SaaS catalogue of 400-plus integrations.

Where each layer actually wins

n8n wins on:

  • Reliability per dollar at high run volume
  • Deterministic execution that survives compliance review
  • 400-plus pre-built SaaS integrations (https://n8n.io/integrations)
  • Multi-tenant isolation auditors actually understand

/goal wins on:

  • Subtasks where the steps are unknowable at design time
  • Refactor work where the spec is "make tests pass"
  • Investigation flows where each iteration depends on the previous output
  • Anything where an LLM-readable spec is more compact than a deterministic flow graph

These do not compete. They layer.

The hybrid pattern, copy-pasteable

This is the architectural piece I have not seen written down yet.

The outer goal sits in n8n. The fuzzy interior subtasks sit in /goal.

Shape of a compliance intake flow.

  1. n8n Webhook receives an inbound document
  2. n8n Function node extracts metadata, deterministic, milliseconds
  3. n8n HTTP Request node POSTs the document to a Claude Code /goal endpoint running on the same host
  4. /goal works through a single fuzzy task, for example "classify this document against our compliance taxonomy and return the matching category plus citations"
  5. /goal returns a structured response or a typed rejection
  6. n8n IF node routes the document to the matching downstream workflow, deterministic
  7. n8n logs the full chain to PostgreSQL for the audit trail

Deterministic outer loop. Fuzzy interior. Cost stays bounded because /goal only fires on the one ambiguous step, not the whole flow.

Here is the n8n side, trimmed.

{
  "nodes": [
    {
      "name": "Compliance Webhook",
      "type": "n8n-nodes-base.webhook",
      "parameters": { "path": "compliance-intake", "httpMethod": "POST" }
    },
    {
      "name": "Extract Metadata",
      "type": "n8n-nodes-base.function",
      "parameters": {
        "functionCode": "return [{json: {doc_id: $input.first().json.id, received_at: new Date().toISOString()}}];"
      }
    },
    {
      "name": "Classify via /goal",
      "type": "n8n-nodes-base.httpRequest",
      "parameters": {
        "url": "http://localhost:8787/goal/classify",
        "method": "POST",
        "bodyParametersJson": "={{ JSON.stringify({document_id: $json.doc_id, taxonomy: 'compliance-v3'}) }}",
        "options": { "timeout": 300000 }
      }
    },
    {
      "name": "Route by Category",
      "type": "n8n-nodes-base.if",
      "parameters": {
        "conditions": {
          "string": [{ "value1": "={{$json.category}}", "operation": "regex", "value2": "^(legal|finance|hr)$" }]
        }
      }
    }
  ]
}

Enter fullscreen mode Exit fullscreen mode

And the goal.md the /goal endpoint serves to Claude Code.

# Goal

classify document against compliance taxonomy v3

## Outcome
Return JSON with three fields. category, confidence, citations.
Category MUST be one of these values, legal, finance, hr, operations, none.

## Verification
- Confidence above 0.85
- At least 2 citations from the taxonomy document
- Category present in the allowed list

## Constraints
- Taxonomy file at /workspace/taxonomy/compliance-v3.md
- Output is JSON only, no prose
- Max 3 retries with feedback on failed verification

## Stop conditions
- Verification passes
- 3 failed attempts return rejection with reason
- 60 second wall clock

Enter fullscreen mode Exit fullscreen mode

How to estimate cost for your own flow

Run volume times per-step cost is the math.

The n8n outer loop scales with VPS resources, not per-execution charges. The CX22 above handles tens of thousands of executions per day without changing the €4.59 line.

The /goal interior is metered. Multiply your ambiguous-doc count by your average completion-token cost from https://www.anthropic.com/pricing.

The point of the hybrid is to push as much volume as possible onto the deterministic outer loop and reserve /goal for the steps that actually need fuzzy reasoning.

If every step in your flow is fuzzy, /goal end to end is fine. Most enterprise flows are not.

Decision table

Situation Run it where
Same shape every time, high volume n8n
Hard reliability requirement (compliance, billing) n8n outer loop
Steps unknowable at design time /goal interior
Need a deterministic audit trail n8n logs the chain
Spec is "make tests pass" or "find the bug" /goal
Cost matters and the work is repeatable n8n
Cost matters and the work is novel /goal

The lazy answer is "use the agent for everything."

The expensive answer is "use the agent for everything."

Same answer, paid in cash.

What I would change about Vasu's framing

Two things.

First, /goal is not the most important primitive of 2026. It is the most important primitive for novel work. Most enterprise work is not novel. The hybrid pattern is the actually-important primitive.

Second, the empirical data gap he flags is real. I run this pattern in DACH production every week. Happy to share what fails and what holds when other people start measuring.

Question for you.

What is the most expensive /goal run you shipped this month?

Was the same outcome reachable with a deterministic outer loop calling /goal on the one fuzzy step?

References