惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

Build a Secure API with Rails 8 - Part-3: Auth Controllers I A/B tested 4 LLMs on the same 500 queries. The results surprised me. OSS Monthly Recap: What My Daily Commit Challenge Taught Me About Open Source “Culture” GemmaNotes Cognitive Debt: AI Is Building Your Systems. Do You Actually Understand Them? GeekNews Frontend Weekly Deep Dive - 2026-05-25 I Built a Universal Silicon Loader That Runs on Any SOC (No Bootrom Exploit) Docker容器化部署Node.js应用最佳实践 I Put a Neural Network in a Thermometer — Then It Got Out of Hand Building MGZon: Developer Portfolio + AI Bot + Social Network (9 min demo) Bearing Life (L10): What the Catalog Number Really Tells You Longhorn Volume Health: The Gap Between 'Healthy' and Actually Working Stop Prompting. Start Specifying: How Spec-Driven Development Fixes AI Coding TIL a PowerPoint file is just a zip — so I converted .pptx to Word entirely in the browser 로컬 LLM 셋업 가이드 (v18) Cx Dev Log — 2026-04-24 github's agent audit api is the boring feature that matters # From Teaching Code to Building Real-World Applications Vivado 2026.1 and Linux: why this decision matters beyond the headline Vivado 2026.1 y Linux: por qué la decisión importa más allá del titular ORA-00206 오류 원인과 해결 방법 완벽 가이드 Entidades finas e composição: o design que escolhi para a nova plataforma 10 Open Source Tools Every Developer Should Know 🔥 SSH Config File Mastery: Turning `~/.ssh/config` Into a Productivity Tool I tried to create a programming language... in python I Replaced 70MB Node.js Log Viewer with a 172KB Zig Binary I Turned npm outdated into a CI Gate — Here's How Don't fall for the Claude Mythos hype Vestige: A Gemma 4 Brain Tracker That Won't Blow Smoke Up Your Ass Gemminate: Transforming Static Textbooks into Interactive Learning Journeys with Gemma 4 Where Did All the Code Playgrounds Go? I built PROOFER - Privacy first Chrome extension that proofreads your texts using Gemma 4 I Automated My Entire Digital Product Business on a $13/Month GCP VM. Here's the Architecture. Beginner's Mind in Engineering and AI How I use AI agents to turn ideas into public demos I Built a Quotation Generator for Kenyan Street Welders Using Gemma 4's Vision The Math Behind Neural Networks — Explained Like Nobody Did for Me 🧨 Understanding TPC with IEEE802.11h What I’m Starting to Look for in Engineers An npm Downloads Comparison Chart in 300 Lines of Vanilla JS — Nice-Tick Math and API-Direct Fetch Vitreus: Local-First Spreadsheet Intelligence with Gemma 4 Transfer Fees, Metadata, and Soulbound Tokens: A Tour of Solana Token Extensions I got tired of re-explaining my codebase to ChatGPT — so I built a VS Code extension Revisiting My Phone AI After Gemma 4: The Upgrade I Didn't Know I Needed I built a privacy-first PDF merger in 7 hours — here's the stack and the lessons Google I/O 2026 made me ask an uncomfortable question: are we still coding, or are we managing builders? SSR with JavaScript: Escaping Node.js Clunkiness with AxonASP My CKA Exam-Day Experience: What Went Right, What Went Wrong, and Lessons Learned Gemma 4 Soft Tokens: The Rise and Fall of 16x16 Words ⚡👀 Two weeks ago, I built a private AI brain on my phone using Gemma 4. Yesterday, Google dropped a new variant that made everything I built feel like a beta test. 256M parameters. MoE architecture. Apache 2.0 license. I broke down what changed and why it mat I got tired of clicking through the Stripe dashboard, so I built a CLI Getting Data from Multiple Sources in Power BI: A Practical Guide to Modern Data Integration Google Is No Longer Just a Search Engine I built GemmaPod - A truly composable and portable AI agent solution powered by your local LLM Gemma 4 E4B caught three planted fabrications in 50 seconds — on a laptop, no cloud How to build an AI-powered content moderation pipeline for user comments Running Gemma 4 on a Modest Machine: Unsloth vs LM Studio vs llama.cpp vs Ollama AI Makes Building Cheap. Our Product Architectures Still Assume It’s Expensive. I built an in-browser Roku TV remote with ~80 lines of TypeScript. Here's how Roku's ECP API actually works The Direction of Blame babbled notes: a sound-to-music agent for people who could not make music before How I Built a Live SQL Workshop Where Students Can't Break Anything Rescuing a Stranded Protocol: Re-Skinning Legacy Code for the Trestle DeFi Flywheel SOLID Heuristics Reveal Incomplete Domain Knowledge — Nothing More AllasCode Intitute / FullAgenticStack: The Intent-Based Router Introducing LogicGrid — Multi-Agent AI Orchestration for .NET AI Prompt Injection, Drupal SQLi Exploitation, and Nmap for Hardening AI Agents & Python Workflows: Anthropic Skills, Jupyter Challenges, and Edge Deployment SQLite Optimization, PostgreSQL Async Queries, & DuckLake Dataframe Spec RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix Microsoft Burned Its 2026 AI Budget on Claude Code in Six Months. That's the Real Story. Why I Started Learning FastAPI in 2026 I Abandoned Ghost for Months — Then Came Back and Finally Finished It Building an Open MIT-Licensed Ephemeris Engine in C — JPL Moshier Ephemeris 4 Smart Ways to Manage Retries in Side Projects Securing Web APIs: A Practical Guide to Authentication & Authorization Methods Google I/O 2026: AI Built an OS in 12 Hours. I Spent Mine Sorting Screenshots. 🤦 Half a Day, Not a Week: One Nix Flake for Three Machines 🌱 Keep Feeding Your CI/CD — Or Watch It Die Gemma 4 vs GPT-4o vs Llama 3: What Actually Works Locally? Vessel Ops SSH in 2026: Why Every Developer Should Know It Cold Audit AI-Generated PRs Before You Merge Them (Swarm Orchestrator 10.3.0) App Store Optimization (ASO) I built a tool to visualize Django REST Framework architecture (URLs, Serializers, Models, and more) How I made my React site agent-ready in 100 lines AI Can Generate Interfaces on the Fly. But Users Still Need Orientation. AI-Assisted Content Workflow How We Learned That Most Resume Rejections Happen Before Humans See Your CV How I Prepared for CKA: Resources, Labs, and Strategy That Worked for Me Remix Mini PC: Moving the Whole Operating System Onto the eMMC Stop Flying Blind: We Built an LLM Evaluation Framework That Works Across 17+ Agent Frameworks The Misleading "User is not authorized to access connection" Error in AWS CodeBuild — and Why Your IAM Policy Looks Fine I Resurrected a Dead F1 Project and Accidentally Built a Race Intelligence OS Remix Mini PC: After a Year of Dead Ends, the eMMC Finally Talks Not All Games Are Equal: The Real Difference Between a Trap and a Tool How to add Peppol e-invoicing to your SaaS without making it your team's problem I Built a Hermes Agent to Tell Me Which Hackathons to Enter. It Told Me to Enter This One. The Five Hooks That Change How You Ship With Claude Code Powering Your Progress: Building Robust Solutions with Laravel
Google I/O 2026’s Smartest Developer Release Wasn’t a Model, It Was the Runtime - Managed Agents in Gemini API
Balaka Biswa · 2026-05-25 · via DEV Community

This is a submission for the Google I/O Writing Challenge

Google I/O 2026’s Smartest Developer Release Wasn’t a Model. It Was the Runtime.

Every Google I/O has its headline magnet. A faster model, a shinier demo, a new capability that makes developers excited, nervous, or both. Google I/O 2026 had plenty of those moments. Gemini 3.5 Flash came with serious benchmark energy. WebMCP gave the open web crowd something ambitious to debate. AI Studio, Chrome, Search, and Gemini all moved deeper into agentic territory.

But the most important developer announcement was not the loudest one.

It was the Managed Agents in the Gemini API.

Gemini Agents

That may sound less glamorous than a new model, but that's also exactly why it matters. Models are the engines, while Managed Agents are the chassis, gearbox, dashboard, pit crew, and the emergency brake. It is the layer that turns “the model can reason and use tools” into “my application can ask an agent to do useful work, observe what it did, preserve state, collect artifacts, and continue from there.”

That is a very different product. And for developers, it may be the more important one.

The Real Bottleneck was Never Intelligence

For the last couple of years, agent demos have followed a mundane script:

  1. A model receives a task.
  2. It calls the required tools.
  3. It plans and writes code.
  4. Runs the code and inspects the result.
  5. Fixes its own mistake.

Everyone nods. But then, a developer tries to build the same thing in production and immediately runs into the actual problem.

The hard part is not only getting the model to think. The hard part is giving it a place to work. A serious agent needs a runtime. It needs a sandbox, files, tool boundaries, memory or state. It needs observable intermediate steps and controls for network access, credentials, cost, and cleanup, not to add developer ergonomics that do not require every team to rebuild the same orchestration layer from scratch.

This is the gap Managed Agents tries to close.

Google’s announcement is not plainly: “Gemini can use tools.” The more interesting claim is this: Google is packaging the agent loop itself as a managed developer primitive.

With Managed Agents, the Antigravity managed agent can run inside a Google-hosted Linux environment, execute code, manage files, use web access, preserve environment state, and return observable execution traces through the Interactions API. That shifts the developer’s job.

Instead of building the whole runtime yourself, you can start from a hosted agent environment and focus on the product boundary around it.

That boundary is where the real engineering begins.

Managed Agents flow

What Google Shipped

At I/O 2026, Google introduced Managed Agents in the Gemini API, with the Antigravity agent available as a public preview. The agent is powered by Gemini 3.5 Flash and exposed through the Interactions API and Google AI Studio.

The product has a few key parts:

Component What it does
Gemini API The developer API surface for Google’s models and agents
Interactions API The API layer built for stateful, agentic, multi-turn workflows
Antigravity managed agent Google’s hosted general-purpose agent harness
Remote Linux environment A sandbox where the agent can execute code and manage files
Environment ID A handle that lets later calls continue in the same workspace
Interaction ID A handle for continuing conversational state
AI Studio Agents Playground A visual way to prototype agent behavior
Custom agents Reusable agent configurations with instructions, sources, and environment settings

The important part is that this is not a single stateless prompt-response API. A stateless call is good for generation, classification, extraction, summarization, and one-shot reasoning. A managed agent is better suited for work that requires state, tools, files, and iteration.

Think data analysis, repository auditing, research synthesis, report generation, benchmark runs, documentation updates, or internal workflow automation. Which is why, Managed Agents is more than another AI feature. It is closer to an execution substrate.

The Architecture: Prompts Go In, Work Comes Out

A simplified Managed Agents flow looks like this:

Flowchart for Managed Agents

The developer sends a task through the Interactions API. The managed agent receives it, reasons through the task, uses available tools, reads or writes files inside the remote environment, and returns both the final output and structured information about execution.

The key is state.

Google gives developers two major handles:

Handle Purpose
previous_interaction_id Continue the conversation
environment_id Continue working in the same sandbox

That second handle is especially important. Without environment persistence, every agent task becomes a one-shot performance. With environment persistence, the agent can build on previous files and results.

Turn one can create an analysis. Turn two can improve the chart. Turn three can package the output. Turn four can audit the final files.

That feels less like prompting a chatbot and more like supervising a remote worker with a shell.

A Minimal API Pattern

Antigravity

The cleanest mental model is:

Concept Mental model
Interaction The conversation and reasoning state
Environment The working directory and execution state
Agent The policy and tool-using worker
Artifact The files created by the work
Step trace The observable record of what happened

A basic Python workflow could look like this:

from google import genai

client = genai.Client()

first_run = client.interactions.create(
    agent="antigravity-preview-05-2026",
    input=(
        "Read revenue.csv, identify the top three trends, "
        "and save a short report as report.md."
    ),
    environment="remote",
)

print(first_run.output_text)
print(first_run.environment_id)

second_run = client.interactions.create(
    agent="antigravity-preview-05-2026",
    previous_interaction_id=first_run.id,
    environment=first_run.environment_id,
    input=(
        "Now create a chart for the strongest trend "
        "and save it as chart.png."
    ),
)

print(second_run.output_text)

Enter fullscreen mode Exit fullscreen mode

The developer did not manually create a container, pass files between steps, write a tool router, manage the execution loop, or build a step logger. The managed runtime absorbs much of that scaffolding.

That is the product insight. Google is not only offering model intelligence. It is offering the operating context around that intelligence.

Why the Interactions API Matters

The Interactions API is one of the most important parts of this launch because it signals how Google expects developers to build with Gemini going forward.

Older model APIs are shaped around a single call: send content, get content. That works for many use cases. But agentic workflows need more structure. They need server-side state, tool traces, intermediate events, resumability, and file continuity.

Consider a data workflow: A user uploads three CSV files and asks for a short analysis --> The agent writes a script, runs it, creates a plot, and writes a markdown summary --> Then the user says, “Actually, split this by region and add a table.”

In a stateless setup, you either replay everything into context or manually store and reload outputs.

With Managed Agents, you continue from the prior interaction and reuse the same environment. The files are already there. The agent can inspect them again. The workflow becomes less like prompt engineering and more like a remote analytical session.

Custom Agents: From Prompt to Reusable Worker

Managed Agents are useful as one-off calls, but the more production-relevant pattern is creating reusable agents with stable instructions, sources, and environment controls. A repo auditing agent, for example, should not need a giant prompt every time. It should have a defined role, defined workspace, and clear output expectations.

A simplified setup might look like this:

from google import genai

client = genai.Client()

agent = client.agents.create(
    id="repo-auditor",
    base_agent="antigravity-preview-05-2026",
    system_instruction=(
        "Audit the repository for test failures, dependency issues, "
        "and risky code patterns. Write findings to "
        "/workspace/output/report.md."
    ),
    base_environment={
        "type": "remote",
        "sources": [
            {
                "type": "repository",
                "source": "https://github.com/your-org/your-repo",
                "target": "/workspace/repo",
            }
        ],
        "network": {
            "allowlist": [
                {"domain": "api.github.com"},
                {"domain": "pypi.org"},
            ]
        },
    },
)

Enter fullscreen mode Exit fullscreen mode

This is where Managed Agents start to look less like “chat with tools” and more like infrastructure.

You can imagine teams defining internal agents such as:

Agent Purpose
data-report-agent Turn CSVs into charts and summaries
repo-auditor Review a codebase and write findings
release-note-agent Compare commits and draft release notes
benchmark-agent Run evaluation scripts and summarize metric changes
doc-update-agent Propose documentation changes from source updates

The important engineering move is repeatability.

A useful agent should not depend on a perfect prompt typed by a tired developer at 1:13 AM. It should have persistent instructions, restricted access, stable output paths, and behavior that can be reviewed.

That is the difference between a demo and a product.

Filesystem-Native Configuration is a Bigger Deal Than it Sounds

One detail that I like is the support for instruction files such as AGENTS.md and skill files such as SKILL.md.

Now why is that a huge thing?

Developers already understand files. Repositories already have conventions. Teams already review documentation, configuration, and scripts in pull requests. Putting agent behavior into files makes that behavior easier to inspect, diff, review, and version.

A repository might look like this:

repo/
  AGENTS.md
  .agents/
    skills/
      audit-tests/
        SKILL.md
      summarize-changes/
        SKILL.md
  src/
  tests/
  package.json

Enter fullscreen mode Exit fullscreen mode

That is a smart direction because it makes agent behavior part of the software project, not an invisible prompt hidden inside a product dashboard.

A team can review:

Question Why it matters
What is the agent allowed to do? Defines operational boundaries
What files can it inspect? Controls scope
What outputs should it produce? Improves repeatability
What external domains can it access? Reduces leakage risk
What skills does it use? Makes behavior easier to audit

This is the kind of technical detail that decides whether agents become production tools or remain conference magic tricks.

Observability: Because “Trust Me Bro” is Not a Log Format

Observability meme

An agent that runs code, reads files, searches the web, and creates artifacts cannot be a black box. Developers need to know what happened. Not in a vague “the agent analyzed your data” way. They need step traces. They need to inspect tool calls. They need to see what files were touched, what commands ran, what sources were consulted, and where the process failed.

Agent observability has three jobs:

Job Why it matters
Debugging You need to know where the process went wrong
Trust Users are more likely to accept output when they can inspect the path
Governance Teams need records for security, compliance, and review

This is another reason the Interactions API matters. Agentic applications are not only about final text, they are about the work behind the text. A good platform needs to expose that work.

Pricing and Control

Managed Agents is useful, but agentic workflows can spend tokens quickly. A normal chat call is usually bounded by input and output. An agent run may include planning, tool calls, file inspection, code execution, error recovery, generated artifacts, and multiple rounds of iteration.

That means cost control is a product requirement, not an accounting afterthought.

A real integration should include:

Control Why it helps
Narrow task scopes Prevents sprawling behavior
Budget limits Stops runaway usage
Streaming visibility Lets users cancel bad runs early
Clear stop conditions Reduces unnecessary iteration
Human approval gates Protects sensitive actions
Environment cleanup Avoids stale or risky artifacts

Security: A Useful Power Still Needs a Fence

Managed Agents is exciting because it gives the model a place to act. That is also why it deserves caution. An agent that can read private files, process untrusted content, browse the web, and call tools has a real attack surface. The risky combination is:

  1. Access to private data
  2. Exposure to untrusted instructions or content
  3. Ability to communicate externally or take actions

That combination can create prompt injection, data exfiltration, and tool misuse risks. A safer architecture should wrap the managed agent in policy checks, scoped files, network allowlists, human review, and audit logs.

Flowchart

A practical rule: treat the agent like a junior engineer with shell access.

Useful? Absolutely. Unsupervised in production? Please do not make your incident report write itself.

What Needs To Improve

Managed Agents is still a preview product, and the limitations matter.

Current constraints include preview API stability, limited tool support in some areas, no structured outputs for the Antigravity agent, no MCP support for this agent yet, no background execution for Antigravity, and limited multimodal input coverage.

The biggest gap for many developers is structured output. If an agent produces artifacts for humans, markdown is fine. If it feeds another system, developers often need strict schemas.

A more mature production version should improve:

Feature Why it matters
Structured outputs Safer system-to-system integration
Job controls Better cancellation, retries, and background runs
Policy controls Stronger file, tool, and network permissions
MCP support Better tool ecosystem interoperability
Evaluation hooks Easier testing before deployment

That would move Managed Agents from promising preview to serious default runtime.

Final Verdict

Gemini 3.5 Flash gives Google a stronger engine, and WebMCP hints at a more agent-readable web. But Managed Agents gives developers the layer they actually need to turn model intelligence into product behavior: a runtime.

That runtime can execute code, handle files, preserve state, expose steps, and produce artifacts. It also forces serious questions about security, cost, observability, and control. That is exactly why it is interesting. The future of agentic software will not be won only by the smartest model; it will be won by the platform that makes smart models useful, inspectable, constrained, and economically sane.

So yes, enjoy the flashy demos. Watch the model benchmarks. Argue about whether the web needs WebMCP. But if you are a developer deciding what to build after I/O, pay close attention to the less sparkly runtime announcement.

That is usually where the future hides.

References

AI Meme