惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
Malwarebytes
Malwarebytes
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cybersecurity and Infrastructure Security Agency CISA
F
Future of Privacy Forum
C
Cisco Blogs
T
The Exploit Database - CXSecurity.com
A
Arctic Wolf
S
Securelist
K
Kaspersky official blog
S
Schneier on Security
T
ThreatConnect
T
Tenable Blog
Spread Privacy
Spread Privacy
T
True Tiger Recordings
AWS News Blog
AWS News Blog
F
Fox-IT International blog
量子位
T
Threatpost
V
Vulnerabilities – Threatpost
C
CERT Recently Published Vulnerability Notes
Cisco Talos Blog
Cisco Talos Blog
GbyAI
GbyAI
宝玉的分享
宝玉的分享
腾讯CDC
G
Google Developers Blog
aimingoo的专栏
aimingoo的专栏
Cyberwarzone
Cyberwarzone
有赞技术团队
有赞技术团队
S
SegmentFault 最新的问题
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
U
Unit 42
雷峰网
雷峰网
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
The GitHub Blog
The GitHub Blog
The Register - Security
The Register - Security
MyScale Blog
MyScale Blog
小众软件
小众软件
A
About on SuperTechFans
Last Week in AI
Last Week in AI
Y
Y Combinator Blog
博客园 - 三生石上(FineUI控件)
美团技术团队
Google Online Security Blog
Google Online Security Blog
P
Proofpoint News Feed
MongoDB | Blog
MongoDB | Blog

DEV Community

4 Smart Ways to Manage Retries in Side Projects Securing Web APIs: A Practical Guide to Authentication & Authorization Methods Half a Day, Not a Week: One Nix Flake for Three Machines 🌱 Keep Feeding Your CI/CD — Or Watch It Die Gemma 4 vs GPT-4o vs Llama 3: What Actually Works Locally? Vessel Ops SSH in 2026: Why Every Developer Should Know It Cold Audit AI-Generated PRs Before You Merge Them (Swarm Orchestrator 10.3.0) App Store Optimization (ASO) I built a tool to visualize Django REST Framework architecture (URLs, Serializers, Models, and more) How I made my React site agent-ready in 100 lines AI Can Generate Interfaces on the Fly. But Users Still Need Orientation. AI-Assisted Content Workflow How We Learned That Most Resume Rejections Happen Before Humans See Your CV How I Prepared for CKA: Resources, Labs, and Strategy That Worked for Me Remix Mini PC: Moving the Whole Operating System Onto the eMMC Stop Flying Blind: We Built an LLM Evaluation Framework That Works Across 17+ Agent Frameworks The Misleading "User is not authorized to access connection" Error in AWS CodeBuild — and Why Your IAM Policy Looks Fine I Resurrected a Dead F1 Project and Accidentally Built a Race Intelligence OS Remix Mini PC: After a Year of Dead Ends, the eMMC Finally Talks Not All Games Are Equal: The Real Difference Between a Trap and a Tool How to add Peppol e-invoicing to your SaaS without making it your team's problem I Built a Hermes Agent to Tell Me Which Hackathons to Enter. It Told Me to Enter This One. The Five Hooks That Change How You Ship With Claude Code Powering Your Progress: Building Robust Solutions with Laravel I built a self-hosted CI/CD platform with persistent queue, encrypted secrets, and rollback UI — here's what I learned Antigravity 2.0 and the $1,000 OS: Why "Agent-First" Feels Like the Direction I've Been Building Toward Anyway I built an AI PR-triage agent in 30 lines of Markdown Core Web Vitals from 74 to 91: A Real Tax Practitioner Site Rebuild I Gave Gemma 4 150 Tools on Windows. Here's What Actually Happened. Beyond the Loop: Why Monolithic AI Agents Fail and How to Build a Microkernel Architecture The Hidden Tax of AI-Assisted Development (And How I Fixed It) I Ditched Cloud LLMs for Gemma 4 4B: A DevOps Engineer's 48-Hour Reality Check Building a Schema.org @graph That Validates on the First Try The "Lift and Shift" Trap: Why Your Integration Layer Needs More Than Just a Cloud Address All 7 OSI Layers Explained with Real-World Analogies Antigravity 2.0 in one day: the four shells and what each is good for Self-Hosting Google Fonts with size-adjust: Zero CLS Web Font Swap The Multi-Provider LLM Problem: Why “One API” Is Not Enough How I indexed 69,000 Claude Code skills (and what I learned doing it) RememberMe CareGrid: Local Gemma 4 for dementia memory and safety Google Is Killing Gemini CLI on June 18. Here Is What to Do Before Then Do Domínio ao Deploy: Hospedando Arquivos de Deep Links no Cloudflare Pages (Parte 7.1) Running Gemma 4 26B on an Old GTX 1080 with llama.cpp Devlog 1: I tried building an SNES game with the super FX chip Why Gemma 4 Feels Like an Important Moment for AI Developers✨ From Zero and Confused, This Is How I Started Learning to Code I Built a Local AI Gateway That Talks to Claude, ChatGPT, DeepSeek and Gemini — Without a Single API Key Bootstrapping with AI: Why Gemma 4 is the Micro-SaaS Founder’s Best Friend MyErp Architecture Series - #02 Cellular Architecture: Mapping Biology to Software Systems NodeJS vs Bun vs Go 🌍 RTL Arabic Style UI How Does an AI Agent Actually Buy Something? Google Just Published the Spec. Google I/O 2026 Is One Uncanny F.R.I.E.N.D.S Group Upgrade I Replaced 70MB Node.js Log Viewer with a 172KB Zig Binary The "MTTR Is All You Need" Trap The Quiet Revolution: How Firebase Became the First Agent-Native Backend at Google I/O 2026 I Built ResuMate! A 100% Private, Local AI Resume Optimizer with Google Gemma 4 Learning DirectX 12 - Part 2 Initialization Theory NeuralHats: I Put Edward de Bono’s Six Thinking Hats on Local LLMs Using Gemma 4 📝 Instant Auto Save Notes Engineering the "App-Like" Experience: A Deep Dive into PWA Architecture I built a local first AI CCTV assistant using Gemma 4 + Frigate CrowdShield AI — Smart Stadium Operating System & Crowd Intelligence Platform I built a free AI observability tool, prove your AI is useful, not just running Beyond Autocomplete: Why Google Antigravity 2.0 Changes the Rules for Indie Builders 터미널 AI 에이전트 구축 (v12) Building Instagram-Powered Apps with HikerAPI (Without Fighting Scrapers) Checkpoints, Not Transcripts: Rethinking AI Coding Agent Memory From Side Project to Student Savior: My AI PPT & Resume Tool Crossed 1.5K+ Users Why Story Points Don’t Work in the AI Era, And What Should Take Their Place Instead. Self-Hosted Document AI: How to Run Document Intelligence On Your Own Infrastructure (2026) How to Extract Tables from PDFs with AI: 4 Methods That Actually Work (2026) IDP vs OCR: What's the Difference — and Which Does Your Business Actually Need? Automated PII Detection and Redaction in Business Documents: A Practical Guide Human-in-the-Loop Document Review: When to Use It and How to Set It Up (2026) Document Processing Without RPA: A Modern Approach for Small Teams Reducto Alternative: When You Need More Than a Document Parser (2026) Hermes Agent vs LangChain vs CrewAI: When to Reach for Each SparshAI: I Built an Offline AI Tutor for Students Using Gemma 4 — Here's What Happened Building NeuroSense AI: A Human-Centered Stress Insight Assistant Powered by Gemma Why I Built a Privacy-First Dev Toolkit GAS Input Tags: Ability Activation Without Hardcoded Bindings AI Legal Document Advisor Supported By Gemm 4 Model Building Convertify in Public Week 10: PDF Cluster + Blog Launch CureNet AI: Decentralized Health Intelligence for India, Powered by Gemma 4 and ABHA Standardization When Open-Weights AI Meets a Broken Healthcare System: Deploying Gemma 4 in Rural India V.A.L.I.D. Google I/O 2026: The Year Google Stopped Building AI Assistants and Started Shipping AI Engineers Bondmap: AI-Powered Relationship Network That Maps How You're Connected to Everyone Using Gemma 4 Gemma 4 challenge inspired me to build my first app! 96. LoRA: Fine-Tune a Billion-Parameter Model on a Laptop From a Student Who Used CircuitVerse to a GSoC Contributor — My Community Bonding Story How Bf-Tree Keeps Mini-Pages Small, Hot, and Cheap to Evict I asked Claude to explain the chip war and ended up understanding modern geopolitics differently Stop Manually Checking for Server Updates: Automate With Email Notifications Nostalgia Meets Cybersecurity: Spotting Modern Scams in a Retro OS Simulator - Forward or Fraud CRACKING CODING INTERVIEW From Python to Production Pipeline :A Practical guide to Apache Airflow Antigravity 2.0: Google Just Changed What It Means to Be an Engineer
Google I/O 2026: AI Built an OS in 12 Hours. I Spent Mine Sorting Screenshots. 🤦
Aabhas Sao · 2026-05-25 · via DEV Community

This is a submission for the Google I/O Writing Challenge


I haven't watched a tech keynote in a really long time. They usually feel like 2 hours of "the future is here!" slides with product demos that never work on stage.

But Google I/O 2026 actually got me. Watched the whole thing and came away genuinely excited and slightly stressed about how fast things are moving.


Agent vs me

They Built a Full OS in 12 Hours

Google's team built a full blown OS using Gemini agents. 93 sub-agents, 15K model requests, 2.6 billion tokens, all under $1,000 in credits. Twelve hours total.

Less than a thousand dollars. For 2.6 billion tokens. That's wild efficiency.

Gemini 2.5 Flash clocks 200 tokens/second output speed. Claude Sonnet sits around 40-60/sec. That gap is huge when you're running 93 agents in parallel and explains why this was even possible in that timeframe.

Gemini Spark Felt Like a MoltBot Alternative

Gemini Spark runs agents in a secure private cloud fully managed by Google. You don't worry about infrastructure, just give it tasks. The closest thing I could think of was MoltBot but living entirely inside Google's ecosystem.

Agentic AI you don't have to host, secure, or manage yourself sounds genuinely appealing. Haven't gotten deep access yet but it's on my list.


Agentic Search

Search Is Getting Agentic

For years Google Search has been a box you type into. Now it's a box that talks back, remembers, and proactively reaches out.

They showed concert updates for your city. You can ask Search to notify you when shows for an artist you like get announced. I live in Mumbai and would love to test how well this actually works here because currently I find out about shows after tickets are sold out.

AI autocomplete that doesn't just finish your query but suggests better ones based on intent. Proactive updates that follow up on your interests without you asking again. Search is slowly becoming something closer to a smart assistant.

Yes, Google, take my data, please.


Agent multitasker

Agents Are Buying Coffee Now

This was the part that felt most science-fiction-turned-real.

Universal Commerce Protocol lets AI agents interact with e-commerce systems. Agent Payments Protocol lets them actually complete payments on your behalf.

The live demo had an agent order coffee through DoorDash, selecting the item, going through checkout, completing payment. No human in the loop at any point.

Picture this: you tell your AI "book me a flight to Bangalore next Friday under ₹8,000, window seat" and it just... does it. Hotels too. Payments too.

Is this a privacy nightmare waiting to happen? Probably worth thinking about. But as a demo, genuinely impressive.

Gemini Live Replied in Haryanvi and I Was Not Ready

Small moment but a meaningful one. During the Gemini Live demo, the model replied in Haryanvi, a regional dialect spoken mostly in Haryana, India. Not just Hindi. Haryanvi. That was cool to see.


Work Onward (Inspiring real Antigravity and Stitch work usage case study)

Holly Jooyoung Diamond built Work Onward using Antigravity and Stitch. The problem she was solving was real: how do restaurant owners post jobs easily?

Job postings via SMS so owners don't need to sit at a laptop filling out forms. Multilingual job descriptions automatically so the listings open up to more candidates immediately.

Really inspiring to see that the access to build ideas is with everyone now.

Small thing but I actually used Gemini myself while writing this article to fetch the specific timestamp of the Work Onward demo from the keynote video because I had forgotten the details after watching. That worked surprisingly well.


WeatherNext Predicted a Category 5 Hurricane 3 Days Out

WeatherNext predicted a Category 5 hurricane in Jamaica three full days before it happened.

3-day advance warnings at category-level accuracy are the kind of thing that saves lives. Not a dev tool but a reminder that the same underlying models powering our autocomplete are doing genuinely important work elsewhere.


Fine-Tuning Gemma 4 with Antigravity

This is the one I keep coming back to.

Google showed fine-tuning Gemma 4 directly from the Antigravity CLI for custom use cases. I haven't worked much with local models yet but the cost of calling a large cloud LLM for every single query adds up fast, especially for repetitive domain-specific tasks.

If you're building a product that does the same type of classification or extraction thousands of times a day, running that against a fine-tuned Gemma locally is far cheaper than hitting a frontier API each time. That's the promise here.

I want to try this. Will write about it when I do.


Playing With the Antigravity CLI

Enough keynote recap. Here's what I actually tried.

Installing it

curl -fsSL https://antigravity.google/cli/install.sh | bash

Enter fullscreen mode Exit fullscreen mode

One thing I hit right away: after installation, typing agy or antigravity opened the Antigravity IDE instead of the CLI. Turned out I had an older IDE version installed and its PATH entry was winning.

Had to manually remove the old PATH entry from ~/.zshrc and re-source it. After that the CLI came up fine. Not sure if it's a bug or just my setup, but if you hit the same thing, check your .zshrc for conflicting PATH entries before assuming something is broken.


Organized screenshots

Organizing 200+ Screenshots

I had a Desktop full of screenshots going back two years. Totally unorganized, no naming convention, nothing. I thought, let's see what Antigravity does with this.

My prompt was: organize my screenshots, categorize them into folders, and give them meaningful names.

Gemini came back with a solid plan: a Swift OCR tool using macOS's native Vision Framework to extract text from each screenshot, paired with a Python script that classifies them into folders (Coding, Meetings, AI-Assistants, Communication, Finance, Design, Media, General) and renames them with date and keyword info like 2024-12-15_Brave_GitHub_GoogleMeet.png.

Using macOS's native Vision Framework instead of a third-party OCR library was a smart call, zero extra dependencies.

I gave the green flag and it started running. Midway through the dry run I got impatient and used the /btw command to check progress without interrupting the session. That's a genuinely useful feature, like tapping someone on the shoulder to ask "hey how far along are you" without stopping their work.

Files got organized in the end but honestly OCR alone isn't enough context to make great categorization decisions. A screenshot of a GitHub PR and a screenshot of VS Code might have similar text but very different purposes. Some files ended up in slightly wrong folders.

Not the model's fault, it's a genuinely hard problem. But it got me thinking: if the agent could actually see the screenshot using vision instead of just reading extracted text, the categories would be way more accurate.

Modern Web Guidance Plugin

agy plugin install https://github.com/GoogleChrome/modern-web-guidance

Enter fullscreen mode Exit fullscreen mode

This gives Antigravity context about modern web best practices, similar to what the Chrome Modern Web Guidance docs cover for Claude Code.

Using the CLI feels noticeably faster than AI-powered IDEs. No Electron overhead, no waiting for a UI to re-render. Just your terminal, the model, and results.

Every time I use a native CLI tool I wonder why Teams and VS Code are built on Electron. I get the history. Still.

I tried redesigning a section of my portfolio to add a tech stack display with icons. Gave it a screenshot reference and it couldn't nail the visual. Gave it a full URL to a reference site and the result was still not great honestly. Not sure if my prompts were bad or if this is just a CLI vs IDE thing, because the IDE felt like it gave better results. I don't know, need to experiment more.


Gemini Omni and SynthID

Almost forgot: Gemini Omni now has an evolved sense of physics for generating stylized videos. Generated video that actually behaves like it understands how things move and interact.

Also SynthID and C2PA credentials for detecting AI-generated content. As generated media gets better, tooling to authenticate what's real becomes critical infrastructure. Good to see it being built in at the platform level.


Final Thoughts

Google I/O 2026 didn't feel like a hype keynote. It felt like a company that had spent a year quietly building and was now ready to show it.

Fine-tuning Gemma 4 is the thing I most want to play with.

The agents-everywhere story is clearly where things are heading. The question is whether the underlying protocols stay open enough that indie devs can build on top of them. Hoping they do.

What was your favourite part of Google I/O 2026? Let me know in the comments, especially if you've played with Antigravity or Gemini Spark!