惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

宝玉的分享
宝玉的分享
WordPress大学
WordPress大学
博客园 - 司徒正美
美团技术团队
酷 壳 – CoolShell
酷 壳 – CoolShell
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
小众软件
小众软件
量子位
阮一峰的网络日志
阮一峰的网络日志
Apple Machine Learning Research
Apple Machine Learning Research
有赞技术团队
有赞技术团队
博客园 - 【当耐特】
博客园 - Franky
Jina AI
Jina AI
人人都是产品经理
人人都是产品经理
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Threat Research - Cisco Blogs
D
Darknet – Hacking Tools, Hacker News & Cyber Security
F
Fox-IT International blog
T
ThreatConnect
A
Arctic Wolf
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Last Week in AI
Last Week in AI
C
CERT Recently Published Vulnerability Notes
P
Palo Alto Networks Blog
李成银的技术随笔
Project Zero
Project Zero
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
F
Full Disclosure
H
Hacker News: Front Page
雷峰网
雷峰网
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
S
SegmentFault 最新的问题
S
Schneier on Security
T
Tor Project blog
博客园_首页
月光博客
月光博客
大猫的无限游戏
大猫的无限游戏
博客园 - 聂微东
S
Securelist
C
Comments on: Blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Attack and Defense Labs
Attack and Defense Labs
IT之家
IT之家
博客园 - 叶小钗
J
Java Code Geeks
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events

DEV Community

Open Source Project (No.73): Sub2API - All-in-One Claude/OpenAI/Gemini Subscription-to-API Relay I Made the Wrong Bet on Event Streaming in Our Treasure Hunt Engine #ai #productivity #chatgpt #python Symbolic Constant Conundrum From Manual RAG to Real Retrieval — Embedding-Based RAG with NVIDIA NIM Building an outbound-only WebSocket bridge for local AI agents Our System's Sins in Ghana: Why We Had to Rethink Digital Product Sales Execution Governance, AI Drift, and the Security Paradox of Runtime Enforcement Differential Pair Impedance: Why USB and HDMI Routing Is a Geometry Problem Small AI database questions can become big scans Your AI database agent should not see every column Rust's Low-Latency Conquest: Why We Ditched C++ for a Treasure Hunt Engine Floating-point will quietly corrupt your emissions math, and 0.1 + 0.2 already warned you Autonomous Agents: what breaks first (and why that's the real product) [2026-05-23] Agent payments are the new cloud bill footgun ORA-00069 오류 원인과 해결 방법 완벽 가이드 How I Built a Local, Multimodal Gemma 4 Visual Regression & Patch Agent: Closed-Loop Validation, Canvas Pixel Diffing, and Reproducible Benchmarks Pressure-testing Ota on Supabase: from setup prose to executable repo readiness VPC CNI en EKS: cómo dejar de pagar nodos que no usás The Future of Text Analysis: Introducing TechnoHelps Semantic Engine I built a Chrome Extension that saves product images + context directly to Google Drive & Sheets 95+ browser-based dev tools that never touch a server Running Qwen 2.5 Coder 14B Locally in Cursor with Ollama From a 10,000-line OpenSearch export script to a log analysis tool Ghost Bugs Cost $40K: A Neural Debugging Postmortem SECPAC: A Lightweight CLI Tool to Password-Protect Your Environment Variables 🚀 PasteCheck v1.7 + v1.8 — Hints that tell you what to fix, and a nudge panel that tells you where to start 8 Real Ways Developers Make Money in 2026 (Ranked by Effort) I built a free AI-powered Git CLI that writes your commit messages for you sds-converter: Converting Safety Data Sheets to MHLW Standard JSON with Rust and LLMs OpenLiDARViewer: A Browser-Based LiDAR and Point-Cloud Viewer Local-First Browser Tools: What You Should Not Upload Online Why most freelancers undercharge (and the maths behind fixing it) We built a mahjong dangerous-tile predictor calibrated on 4.97M real hands Building a Chord Progression Generator in the Browser — Music Theory in JS, Sound via Web Audio API tutorial #10: 148 Opens, 0 Replies — How My Forge Cold Email v1 Completely Failed 9 in 10 Docker Compose files skip the basic security flags How to Forward Android SMS to Telegram Automatically I built the first security scanner for MCP servers — here's what I found Building an Interplanetary Quantum Logic Engine in Rust/Ovie From AI Code Generation to AI System Investigation I gave Gemini 3.5 Flash a CVE-fix PR to review. It found another bug in the same file. When I Realized We Were Throwing Away Half Our Engine's Potential TokenJuice and the 20-Minute Cron: Inside OpenHuman’s Aggressive Context-Harvesting Engine CodeDNA: AI Codebase Archaeologist Built with Gemma 4 Thinking Mode Building a semantic search API in Go with Meilisearch April 2026 DigitalOcean Tutorials: Inference Optimization and AI Infrastructure Looking for DTMF transceiver module Moving Beyond "Tribal Software": Why the Singularity Demands the Interplanetary Hybrid Human Use SVGIcons as a Claude Custom Connector to Find Icons Faster DMARC Is Now a Proper Internet Standard: What Changed in RFC 9989/9990/9991 OpenTelemetry Is Now a CNCF Graduate — and It's Coming for Your AI Stack OpenHuman Follows OpenClaw’s Rise, But With an Obsidian Brain O erro mais caro em programas Solana: PDA sem bump check Build a Live Flight Radar in a Single HTML File DuckDB 1.5.3 Adds Quack Client-Server, SQLite Gets Cypher Graph Extension Custom Copilot Agents: Building Domain-Expert AI Teammates with Skills, MCP Tools, and Custom Knowledge RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains This week in Cursor + .NET — 3 rules + 4 essays (week ending May 22, 2026) RAG Architecture with n8n + PostgreSQL (pgvector) + Ollama Gemma4 on AWS EC2 Keep Your Taste I Built chanprobe Because My Go Queues Were Invisible Building a Live Solana TPS Meter with OrbitFlare's TypeScript SDK Using Gemma 4 to Analyze Bitcoin’s Next 5, 15, and 60 Minutes Security news weekly round-up - 22nd May 2026 When Stress Disguises Itself as Rational Planning (Bite-size Article) A Domain-Driven Notification Microservice — Patterns From Production I Built KubeCrash: Learn Kubernetes by Diagnosing Real Incidents The Real-World Test: How Gemini’s New Interface Won Over My Wife and Mother-in-Law (Who Are Totally Non-Tech) Running a Full Multi-Stage Intrusion Simulation. Every Detection Fired. Spec sheets aren't capabilities: a Day-1 Gemma 4 eval on Telugu vision Design a Clean Form with Floating Labels in Bootstrap 5 Your MCP Server Is Probably Overprivileged - Here's a Scanner For It I built a free developer tools site that works entirely in your browser Maatru: An agentic Telugu literacy app for kids, built with Gemma 4 GitHub confirms internal repository breach via poisoned VS Code extension Gemma 4 Is Not Just Another Open Model — It Changes What Developers Can Build Locally OpenVibe: An Open-Source AI Coding IDE That Works With Any Model I Inspected the System Program and It Looked Just Like My Wallet Hermes vs OpenClaw: The Two Most-Starred AI Agent Frameworks of 2026 Stop retraining YOLO: a developer’s guide to zero-shot object detection with generative VLMs AI, the New UI, Not the New API Sensors and Guides: Two Ways Your Harness Talks to Your Agent Fixing Google BigQuery Auth Proxying We didn't ship a feature, we shipped an agentic opt-in beta Wake-Up Call: Why AI Safety Guardrails Break Under Pressure 🧩 Handling 1,000+ Inputs with Angular Reactive Forms: An Enterprise Architecture Breakdown How to Collect Telegram Media Groups in Node.js I Ran Gemma 4 on an 8GB Laptop — Here’s What the Experience Was Actually Like Lean 4 101 for Python Programmers: A Gentle Introduction to Theorem Proving From Assistants to Agents: My Take on Google I/O 2026 Learning Progress Pt.16 From Unfinished Idea to Real Product: My BuildGenAI Comeback The Quiet Strategy I Revived a 9-Year-Old App with OpenAI Codex with a Product Engineer Mindset What Enterprise RAG Is Ready For Today and What Production Deployment Actually Requires Cursor AI Pricing 2026: Is It Worth $20/Month? The Brilliant Person in Your Pocket Why your Claude API bill is 3x what it should be (and how to fix it) Sloppification Is The New Obfuscation
Claude Code 2.1 Agent View & /goal: Autonomous Dev Guide 2026
Anup Karanjk · 2026-05-23 · via DEV Community

I let Claude Code run unsupervised for three hours last Wednesday. It wrote 847 lines of test coverage, debugged six failing tests, and stopped exactly when the conditions I set were met — without me touching the keyboard once. That is what the /goal command does. Combine it with Agent View, the new session dashboard that shipped in v2.1.139 on May 19, 2026, and you have the closest thing to a true autonomous coding partner the CLI has ever shipped.

The short version: claude agents gives you a live dashboard of every session — running, blocked, or done. /goal <condition> sets a completion target and Claude loops across turns until it is met. Background sessions with --bg let those loops run without blocking your terminal. This combination turns Claude Code from an interactive assistant into something closer to a scheduled background worker that asks for help only when genuinely stuck.

What Shipped in v2.1.139 (And What Came After)

The May 19 release was dense. Agent View and /goal were the headliners, but the two weeks of point releases that followed added features that change how you configure and automate agents. Here is the full picture:

Version Feature What It Does

| 2.1.139 | Agent View, /goal | Session dashboard; completion-driven loops |

| 2.1.140 | Agent hook types | Stop/SubagentStop hooks for agent frontmatter |

| 2.1.142 | New session flags | --add-dir, --settings, --mcp-config, --permission-mode |

| 2.1.143 | Worktree isolation toggle | bgIsolation: "none" for repos where worktrees break |

| 2.1.144 | /resume in agent view | Background sessions appear with bg tag; resume by ID |

| 2.1.145 | claude agents --json | JSON output for tmux, status bars, scripting |

| 2.1.147 | /code-review | Renamed from /simplify; reports correctness bugs at chosen effort level |

| 2.1.149 | /usage breakdown | Per-category costs: skills, subagents, plugins, MCP |

The version cadence tells you something: Anthropic is shipping against real developer feedback. The worktree isolation fix in 2.1.143 — adding bgIsolation: "none" for repos where git worktrees are impractical — was not a planned feature. It was a direct response to production blockers. The /resume functionality expanding to Agent View in 2.1.144 filled an obvious gap that the initial launch left open. This is the engineering culture at Anthropic: drop the headline, then fill the missing pieces fast.

Agent View: The Session Dashboard You Have Been Waiting For

Run claude agents from any terminal and you get a single live view of every Claude Code session: which ones are running, which are blocked waiting for your input, and which have finished. The count of sessions awaiting input appears in your terminal tab title — meaning you can see at a glance how many agents need attention without switching context.

# Open the agent view dashboard
claude agents

# Scope it to a specific project directory
claude agents --cwd ./my-project

# Output as JSON for scripting (tmux, status bars, monitoring)
claude agents --json

Enter fullscreen mode Exit fullscreen mode

Inside the dashboard, four keyboard shortcuts do most of the work. Ctrl+T pins a background session — pinned sessions stay alive when idle and auto-restart after Claude Code updates instead of requiring manual relaunch. Ctrl+R renames a session to something human-readable. The left arrow key detaches or closes a session while preserving its state. And v opens the session directly in your editor.

The session state preservation is the part that matters most in practice. When you detach with the left arrow or background a session with /bg, it preserves the model, effort level, permission mode, MCP servers, and settings you configured. In earlier Claude Code versions, detaching and resuming a session sometimes reverted to defaults. That regression is gone as of 2.1.142.

One subtle improvement: sessions launched from Agent View respect the permissions.defaultMode from your settings.json. If your project settings say auto, agent view sessions inherit that without you passing a flag every time. Small change, large day-to-day impact.

/goal: Tell Claude When It Is Done

The /goal command is deceptively simple. You set a completion condition. Claude works across turns until that condition is met, then stops.

# In an interactive Claude Code session
/goal Write unit tests covering 95% of the authentication module

# Compound conditions work too
/goal Implement OAuth2 login with Google and GitHub. All existing tests must pass. No TypeScript errors.

Enter fullscreen mode Exit fullscreen mode

While a goal is active, a live overlay panel shows elapsed time, number of turns completed, and cumulative token count. The overlay runs in real time without blocking your ability to type — you can interact with the session while Claude is working if you need to redirect it or provide context it is missing.

What makes /goal different from just giving Claude a task in a single message is that it creates a controlled loop. Claude writes code, runs tests, inspects failures, revises the code, and runs tests again — automatically, across as many turns as the goal requires. The completion condition is checked after each cycle. This is not a new capability in the underlying model; it is a harness change that lets the model apply its existing tool-use and reasoning capabilities over a longer trajectory without requiring you to manually re-prompt it.

I used /goal to write a Zod schema validation test suite covering edge cases I had been putting off for weeks. The goal ran for 41 minutes and 23 turns. It found three input patterns my initial schemas did not handle and fixed them before stopping. Total cost: $0.43 in API tokens according to /usage. That 41 minutes would have been three to four hours of my time, spread across three separate mornings of interrupted work.

The comparison to point-and-shoot prompting is stark. "Refactor the auth module" stops when Claude runs out of obvious things to do. "/goal Auth module refactored with no TypeScript errors and 90%+ test coverage" stops when the work is actually done. That distinction is what makes /goal a fundamentally different interface, not just a convenience wrapper.

Background Sessions: The Full Control Surface

Background sessions are not new to Claude Code, but the 2.1.x releases added enough configuration surface that they now behave like first-class agent deployments rather than a niche CLI trick.

# Start a simple background session
claude --bg "Audit the codebase for security issues in the auth module"

# Fully configured background agent
claude --bg --name "security-audit"   --model claude-opus-4-7   --effort high   --permission-mode auto   --settings ./.claude/settings.json   --mcp-config ./.claude/mcp.json   "Complete the security review and output findings to SECURITY_REPORT.md"

# Resume a background session interactively
/resume

# Resume by specific session ID
claude --resume 

Enter fullscreen mode Exit fullscreen mode

The --fallback-model flag is worth knowing for production workflows. If the primary model hits a rate limit or quota boundary, Claude Code falls back to the specified alternative instead of failing hard:

claude --bg --model claude-opus-4-7   --fallback-model claude-sonnet-4-6   "Refactor the payment processing module"

Enter fullscreen mode Exit fullscreen mode

For repositories where git worktrees cause problems — monorepos with symlinks, certain CI-generated layouts, some Windows configurations — set "worktree.bgIsolation": "none" in your settings.json. Background sessions will then edit the working copy directly instead of creating an isolated worktree. This unblocks a significant class of production use cases that were painful before 2.1.143.

Empty idle background sessions auto-retire after five minutes. Sessions that finish their work but leave a background shell running move to "Completed" rather than "Running". These distinctions matter when you are managing six sessions and need to know which ones are still doing something versus which ones have finished and are just consuming memory.

New Agent Flags: Configure Everything

The v2.1.142 release added a set of flags that turn what used to be informal defaults into explicit, preservable configuration. All of these work in both claude and claude agents:

claude --add-dir ../shared-lib          # Give Claude read access to a directory outside cwd
claude --settings ./.claude/prod.json  # Use project-specific settings file
claude --mcp-config ./.claude/mcp.json # Attach specific MCP servers to this session
claude --plugin-dir ./.claude/plugins  # Load project-specific plugins
claude --permission-mode auto           # No prompts for safe file operations
claude --effort xhigh                   # Maximum reasoning depth for trust-boundary tasks
claude --dangerously-skip-permissions  # Full bypass (use only in sandboxed environments)

Enter fullscreen mode Exit fullscreen mode

The --add-dir flag solves a recurring pain point. Claude Code previously could not read files outside the current working directory without you changing directories first. With --add-dir, you can give an agent access to a shared library, a design system package, or a documentation directory outside the project root — without restructuring your project layout to accommodate it.

For multi-agent orchestration, these flags let you configure each dispatched subagent independently. A lead orchestrator running at --effort high can dispatch subagents with lower effort for mechanical tasks like formatting or renaming, controlling cost across the entire system. For patterns on structuring these systems, the multi-agent coordination guide covers orchestrator-worker architectures in detail.

Three Workflows That Changed How I Work

1. The Background Audit

Every Friday, I run a background audit against the week's merged PRs. The agent reads the diff, checks for patterns from our CLAUDE.md security rules, and writes findings to a WEEKLY_AUDIT.md file. I come back Monday to a report rather than a vague sense that I should probably review something.

claude --bg --name "weekly-audit"   --model claude-opus-4-7   --effort high   --permission-mode auto   "Read all changes merged this week. Cross-reference against .claude/rules/trust-boundary.md. Write WEEKLY_AUDIT.md with findings ranked by severity. Flag any payment or auth path changes explicitly."

Enter fullscreen mode Exit fullscreen mode

2. The Completion Gate

Before any PR goes out, I set a /goal with the exact criteria from our definition of done. Not a checklist I run through manually. Not a reminder to check TypeScript. A condition Claude satisfies or keeps working:

/goal All TypeScript type errors resolved. All unit tests pass. No console.log statements remaining. ESLint reports zero errors on changed files.

Enter fullscreen mode Exit fullscreen mode

Claude works through the list systematically. It does not stop until every condition is true. This replaced my pre-PR checklist habit, which I was reliably skipping under deadline pressure.

3. The Pinned Session

I keep one pinned session open in Agent View for the project I am actively shipping. It holds project context, MCP server connections, and permission settings — configured once, preserved across updates. When I need Claude for something quick, I jump into the pinned session instead of starting a cold context window. Auto-restart on updates means I get new Claude Code features without losing the session configuration.

The /usage Command: Real Cost Visibility

Starting in v2.1.149, /usage breaks down your token costs by category: skills, subagents, plugins, and MCP server calls. Before this update, the aggregate cost was visible but opaque — you could not tell whether a session's cost was driven by MCP tool calls, recursive subagent spawning, or the main conversation. That distinction matters when you are optimizing a multi-agent workflow for cost efficiency.

For MCP-heavy workflows, the MCP servers guide covers patterns for keeping call counts reasonable. An agent that calls an MCP server 50 times in a session is often doing something that could be batched into 3–5 calls with better tool design. /usage makes that pattern visible instead of hiding it inside an aggregate number.

What to Do Right Now

Update to v2.1.149 or later: npm update -g @anthropic-ai/claude-code. Then run claude agents and look at what is actually running. Most developers are surprised to find sessions they backgrounded and forgot. Pin the ones that matter with Ctrl+T so they survive the next update cycle.

For any task that takes more than 20 minutes to specify, set a /goal with the completion criteria instead of a task description. The difference in outcome is consistent enough that I now default to /goal for anything beyond a single-file edit.

If you are building production workflows with background agents, read the new flag documentation before committing to an architecture. The --permission-mode, --fallback-model, and --add-dir flags together create a configuration surface that was not possible before May 2026. The agents you can build now are meaningfully more reliable than what shipped in April.

The CLAUDE.md templates, agent skill starters, and MCP server boilerplates for building these workflows are available at wowhow.cloud — pay once, configure once, ship reliably.

Originally published at wowhow.cloud