惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

宝玉的分享
宝玉的分享
WordPress大学
WordPress大学
博客园 - 司徒正美
美团技术团队
酷 壳 – CoolShell
酷 壳 – CoolShell
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
小众软件
小众软件
量子位
阮一峰的网络日志
阮一峰的网络日志
Apple Machine Learning Research
Apple Machine Learning Research
有赞技术团队
有赞技术团队
博客园 - 【当耐特】
博客园 - Franky
Jina AI
Jina AI
人人都是产品经理
人人都是产品经理
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Threat Research - Cisco Blogs
D
Darknet – Hacking Tools, Hacker News & Cyber Security
F
Fox-IT International blog
T
ThreatConnect
A
Arctic Wolf
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Last Week in AI
Last Week in AI
C
CERT Recently Published Vulnerability Notes
P
Palo Alto Networks Blog
李成银的技术随笔
Project Zero
Project Zero
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
F
Full Disclosure
H
Hacker News: Front Page
雷峰网
雷峰网
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
S
SegmentFault 最新的问题
S
Schneier on Security
T
Tor Project blog
博客园_首页
月光博客
月光博客
大猫的无限游戏
大猫的无限游戏
博客园 - 聂微东
S
Securelist
C
Comments on: Blog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Attack and Defense Labs
Attack and Defense Labs
IT之家
IT之家
博客园 - 叶小钗
J
Java Code Geeks
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events

DEV Community

Building an E2EE Chat App in Flask - Part 3: Keeping File Uploads Safe Microsoft Just Shipped MCP Governance for .NET. Here's What It Actually Enforces. How I Built a Pakistan Internet Speed Test Platform at 16 How to Build a Supervisor Agent Architecture Without Frameworks I Built My Own Corner of the Internet — Here's What It Looks Like How does VuReact compile Vue 3's defineExpose() to React? Neo-VECTR's Rift Ascent Idempotency Keys: The API Safety Net You Probably Aren't Using Building E-Commerce Sites for Niche Products: Technical Lessons from Specialty Outdoor Retailers Audit Logs: The Silent Guardian of Every Serious System Open-source SDS tooling for Japanese MHLW compliance: the gap nobody filled BetAGracevI I Built a Post-Quantum Cryptographic Identity SDK for AI Agents — Here's Why It Needs to Exist Running Claude Code across multiple repos without losing context There Are Cameras in Every Room of My House. I Put Them There. Why your AI agent loops forever (and how to break the cycle) How does VuReact compile Vue 3's defineSlots() to React? Building a Privacy-First Resume Editor with Typst WASM and React One Soul, Any Model: Portable Memory for Open-Source Agents with .klickd From Pixels to Prescriptions: Building an Autonomous Healthcare Booking Agent with LangGraph MonoGame - A Game Engine for Those Who Love Reinventing the Wheel # Day 24: In Solana, Everything is an Account Mastering Node.js HTTP Module: Build Servers, REST APIs, and Handle Requests Mastering Node.js HTTP Module: Build Servers, REST APIs, and Handle Requests RP2040 Wristwatch Tells Time With a Vintage VU Meter Needle observations about models / 2026, may From Video Transcripts to Source-Grounded AI Notes: A Practical Look at Notesnip AI Agent Dev Environment Guide — Real Experience from an AI Living Inside a Server How I Run 7 AI Models 24/7: Multi-Agent Architecture in Practice What exactly changes with the Claude Max plan? I Revived a Broken MLOps Platform — Now It's Self-Service, Policy-Guarded, and Operationally Credible OpenAI's $2M-tokens-for-equity YC deal, decoded Why DMX Infrastructure is Still Stuck in the 90s Agent Series (2): ReAct — The Most Important Agent Reasoning Paradigm Open Source Project (No.73): Sub2API - All-in-One Claude/OpenAI/Gemini Subscription-to-API Relay I Made the Wrong Bet on Event Streaming in Our Treasure Hunt Engine #ai #productivity #chatgpt #python Symbolic Constant Conundrum From Manual RAG to Real Retrieval — Embedding-Based RAG with NVIDIA NIM Building an outbound-only WebSocket bridge for local AI agents Our System's Sins in Ghana: Why We Had to Rethink Digital Product Sales Execution Governance, AI Drift, and the Security Paradox of Runtime Enforcement Differential Pair Impedance: Why USB and HDMI Routing Is a Geometry Problem Small AI database questions can become big scans Claude Code 2.1 Agent View & /goal: Autonomous Dev Guide 2026 Your AI database agent should not see every column Rust's Low-Latency Conquest: Why We Ditched C++ for a Treasure Hunt Engine Floating-point will quietly corrupt your emissions math, and 0.1 + 0.2 already warned you Autonomous Agents: what breaks first (and why that's the real product) [2026-05-23] Agent payments are the new cloud bill footgun ORA-00069 오류 원인과 해결 방법 완벽 가이드 How I Built a Local, Multimodal Gemma 4 Visual Regression & Patch Agent: Closed-Loop Validation, Canvas Pixel Diffing, and Reproducible Benchmarks Pressure-testing Ota on Supabase: from setup prose to executable repo readiness VPC CNI en EKS: cómo dejar de pagar nodos que no usás The Future of Text Analysis: Introducing TechnoHelps Semantic Engine I built a Chrome Extension that saves product images + context directly to Google Drive & Sheets 95+ browser-based dev tools that never touch a server Running Qwen 2.5 Coder 14B Locally in Cursor with Ollama From a 10,000-line OpenSearch export script to a log analysis tool Ghost Bugs Cost $40K: A Neural Debugging Postmortem SECPAC: A Lightweight CLI Tool to Password-Protect Your Environment Variables 🚀 PasteCheck v1.7 + v1.8 — Hints that tell you what to fix, and a nudge panel that tells you where to start 8 Real Ways Developers Make Money in 2026 (Ranked by Effort) I built a free AI-powered Git CLI that writes your commit messages for you sds-converter: Converting Safety Data Sheets to MHLW Standard JSON with Rust and LLMs OpenLiDARViewer: A Browser-Based LiDAR and Point-Cloud Viewer Local-First Browser Tools: What You Should Not Upload Online Why most freelancers undercharge (and the maths behind fixing it) We built a mahjong dangerous-tile predictor calibrated on 4.97M real hands Building a Chord Progression Generator in the Browser — Music Theory in JS, Sound via Web Audio API tutorial #10: 148 Opens, 0 Replies — How My Forge Cold Email v1 Completely Failed 9 in 10 Docker Compose files skip the basic security flags How to Forward Android SMS to Telegram Automatically I built the first security scanner for MCP servers — here's what I found Building an Interplanetary Quantum Logic Engine in Rust/Ovie From AI Code Generation to AI System Investigation I gave Gemini 3.5 Flash a CVE-fix PR to review. It found another bug in the same file. When I Realized We Were Throwing Away Half Our Engine's Potential TokenJuice and the 20-Minute Cron: Inside OpenHuman’s Aggressive Context-Harvesting Engine CodeDNA: AI Codebase Archaeologist Built with Gemma 4 Thinking Mode Building a semantic search API in Go with Meilisearch April 2026 DigitalOcean Tutorials: Inference Optimization and AI Infrastructure Looking for DTMF transceiver module Moving Beyond "Tribal Software": Why the Singularity Demands the Interplanetary Hybrid Human Use SVGIcons as a Claude Custom Connector to Find Icons Faster DMARC Is Now a Proper Internet Standard: What Changed in RFC 9989/9990/9991 OpenTelemetry Is Now a CNCF Graduate — and It's Coming for Your AI Stack OpenHuman Follows OpenClaw’s Rise, But With an Obsidian Brain O erro mais caro em programas Solana: PDA sem bump check Build a Live Flight Radar in a Single HTML File DuckDB 1.5.3 Adds Quack Client-Server, SQLite Gets Cypher Graph Extension Custom Copilot Agents: Building Domain-Expert AI Teammates with Skills, MCP Tools, and Custom Knowledge RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains This week in Cursor + .NET — 3 rules + 4 essays (week ending May 22, 2026) RAG Architecture with n8n + PostgreSQL (pgvector) + Ollama Gemma4 on AWS EC2 Keep Your Taste I Built chanprobe Because My Go Queues Were Invisible Building a Live Solana TPS Meter with OrbitFlare's TypeScript SDK Using Gemma 4 to Analyze Bitcoin’s Next 5, 15, and 60 Minutes Security news weekly round-up - 22nd May 2026
Google's Gemini Spark. Here's What It Actually Does for Developers.
Om Shree · 2026-05-23 · via DEV Community

Every major AI lab now has a personal agent story. Google's answer: announced at Google I/O 2026, isn't just another chatbot with calendar access. Gemini Spark runs 24/7 on Google's cloud infrastructure, with or without your device turned on, completing multi-step tasks across your entire Workspace ecosystem while you're doing something else entirely.

The Problem It's Solving

The standard Gemini experience, like most AI assistants, is reactive. You type, it responds. The moment you close the tab, nothing happens. That works fine for one-off questions. It breaks down completely for any task that involves monitoring, waiting, or executing across multiple apps over time - which is basically most of the actual overhead in a knowledge worker's day.

Google wants Gemini Spark to behave differently. Instead of simply answering questions, the assistant actively helps users manage digital activities. The premise is straightforward: stop making the user the orchestrator of every multi-step process. Hand that orchestration to the agent and get out of the way.

Gemini Spark follows a wave of popular agentic products from major AI labs, most notably Anthropic's Claude Cowork and OpenAI's ChatGPT agent, but Google is positioning Spark around one concrete structural advantage: it already has your email, your calendar, your Drive, and your Docs. No integrations to set up. No OAuth dance. It's all already there.

How Gemini Spark Actually Works

Spark was built from Gemini base models and an agentic harness from Google Antigravity - the same agent infrastructure behind the company's own internal tools. It runs on dedicated virtual machines on Google Cloud, so you don't need to keep your laptop open to make sure it's running. That's not a minor implementation detail - it's what makes Spark structurally different from agents that require a persistent local process.

Spark runs on the Gemini 3.5 Flash model and Google's Antigravity agent system. The three core primitives developers and power users will spend most of their time with are Tasks, Skills, and Schedules:

Tasks are one-off or multi-step instructions you hand off to the agent - things like "find and track interior design internships in New Orleans" or "scan my Google Drive and organize files into a tagged spreadsheet." Spark breaks these down and executes them across your connected apps.

Skills are reusable behavioral templates you define once and invoke repeatedly. As you build Skills, you define exactly how you want Spark to take actions on things you do often, tailoring your experience and saving you from repetitive prompting. The canonical example from Google's own demo: read through your last 50 sent emails, derive a writing style guide, and save that as a skill called ghostwriter that fires every time you ask Spark to draft an email.

Schedules are time-based or conditional triggers. Schedules allow you to automate your workload on your own terms by setting up time-based or conditional triggers to execute tasks exactly when you need them. A practical example: every Monday at 9 AM, scan the inbox from the past week, produce a priority summary, and block deep-work time on the calendar.

Spark integrates with Gmail, Docs, and Slides out of the box, and adds MCP connections to Canva, OpenTable, and Instacart starting today. Those MCP connections are worth paying attention to - they signal that Google is building Spark on the same open protocol that the broader agentic ecosystem is converging around, not locking the agent capability behind proprietary APIs.

What Developers Are Actually Using It For

The Google I/O demo made the email drafting use case look simple. Google's VP of Google Labs, Josh Woodward, tasked the agent with collecting information from Google Docs, emails, and chat conversations before running a skill designed to draft the email in Woodward's voice. In a second demo, Woodward showed Spark compiling a list of people who RSVP'd to a block party and building a document that itemizes what each person is planning to bring - a document that auto-updates when a new email related to the event arrives in the inbox.

That second demo is more interesting than it sounds. Auto-updating documents driven by email events is a conditional trigger executing across multiple data sources in real time. That's not chatbot behavior - that's a lightweight workflow automation, defined entirely in natural language, with no code.

For developers specifically, the Skills primitive is the most interesting surface. Teaching the agent a repeatable behavioral pattern - and giving it a name you can call - is essentially defining a function in natural language. The ghostwriter example isn't just a productivity trick; it demonstrates that Spark can build compound capabilities from prior interactions, which is closer to how production agent systems are actually architected.

The MCP integrations matter here too. If Google continues expanding Spark's MCP surface, developers will be able to connect it to custom tools and internal services using the same protocol they're already building on - without waiting for a native Google integration that may never come.

Why This Is a Bigger Deal Than It Looks

In the race to build compelling personal AI agents, Google may have an underrated advantage: it already has all your emails. That's a real moat. The hardest part of building an effective personal agent isn't the model - it's the context. An agent that can't read your history, understand your commitments, or act across your actual tools is just an expensive chatbot. Google doesn't have that cold-start problem.

Millions of users already rely on Google Workspace for work and communication. Spark doesn't ask those users to migrate to a new platform, connect a new service, or change their workflow. It inserts itself into the tools they're already in, which dramatically lowers adoption friction compared to standalone agent products.

The bigger architectural signal is the Antigravity harness. Google isn't shipping a one-off consumer feature - it's making its internal agent infrastructure available as a consumer product. That's the same pattern we've seen with cloud compute, ML infrastructure, and developer tooling: Google builds it for internal use, battle-tests it at scale, then ships it outward. If Antigravity is as capable as the Spark demos suggest, this is just the beginning of what runs on it.

Availability and Access

Gemini Spark is currently rolling out to trusted testers and will be available for Google AI Ultra subscribers over 18 in the United States, as well as select business users. Google is expanding access to more users and businesses over the coming weeks.

Gemini Spark runs on Gemini 3.5 Flash and Antigravity. Native integrations cover Gmail, Calendar, Drive, Docs, Sheets, Slides, YouTube, and Google Maps - all turned off by default, toggled on in settings. MCP connections to Canva, OpenTable, and Instacart are live at launch, with more presumably following.

The agent-first paradigm is no longer a research demo or a beta experiment. Google just shipped it to 900 million Gemini users with zero new apps required. Whether Spark becomes the default way developers automate their own workflows - or just another underused feature in the Gemini app - depends entirely on how good the Skills and Schedules primitives turn out to be in practice.

Follow for more coverage on MCP, agentic AI, and AI infrastructure.