惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
LangChain Blog
Security Latest
Security Latest
P
Proofpoint News Feed
GbyAI
GbyAI
PCI Perspectives
PCI Perspectives
博客园 - Franky
N
Netflix TechBlog - Medium
博客园_首页
WordPress大学
WordPress大学
K
Kaspersky official blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Vercel News
Vercel News
T
Threatpost
The Hacker News
The Hacker News
H
Help Net Security
S
Securelist
Recent Announcements
Recent Announcements
腾讯CDC
T
Tailwind CSS Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
Engineering at Meta
Engineering at Meta
C
Cisco Blogs
V
V2EX
C
Check Point Blog
S
Schneier on Security
Cyberwarzone
Cyberwarzone
C
Cybersecurity and Infrastructure Security Agency CISA
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
B
Blog RSS Feed
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Jina AI
Jina AI
M
MIT News - Artificial intelligence
T
Threat Research - Cisco Blogs
博客园 - 叶小钗
A
Arctic Wolf
AWS News Blog
AWS News Blog
Latest news
Latest news
Martin Fowler
Martin Fowler
Recorded Future
Recorded Future
Last Week in AI
Last Week in AI
The GitHub Blog
The GitHub Blog
小众软件
小众软件
B
Blog
aimingoo的专栏
aimingoo的专栏
C
Cyber Attacks, Cyber Crime and Cyber Security
V
Visual Studio Blog
P
Palo Alto Networks Blog
Spread Privacy
Spread Privacy

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python) The Hidden Cost of AI Systems Nobody Talks About. undefined vs undeclared, and how typeof behaves Switching from file-based jobs to NATS/Kafka in Rust without changing code io_uring Adventures: Rust Servers That Love Syscalls Why Agentic AI is Killing the Traditional Database The POUR principles of web accessibility for developers and designers Quantum Neural Network 3D — A Deep Dive into Interactive WebGL Visualization How To Install Caveman In Codex On macOS And Windows Automation Pipeline Reliability: Why Your Workflow Breaks When Nobody Is Watching I Built an 'Open World' AI Coding Agent — It Works From ANY Folder From Freelancing to Product: A Tech Service Company's SaaS Transformation China's AI Giants: Adding Tencent Hunyuan & ByteDance Doubao to AI University (74 Providers) On the Vibe Coders and Their Lies clerk: Auto-Summarize Your Claude Code Sessions AI Weekly — 2026/04/10–04/17 | The Model Lockdown Is Here, but the Toolchain Is the Real Battleground AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場 Maybe this is how Open-Source apps are born... 🚀 Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide tRPC v11 + Next.js App Router: End-to-End Type Safety Without the Boilerplate ShadCN UI in 2026: Why I Stopped Installing Component Libraries and Started Owning My Components SaaS Billing in React Server Components: Stripe + Supabase Without a Single `useEffect` Join our DEV Weekend Challenge — $1,000 in Prizes Across TEN winners! Submissions Due April 20 at 6:59 AM UTC. Implementing FSRS Spaced Repetition in Flutter + Supabase — Adding Memory Science to an AI Learning App "I Texted My Localhost From the Train — Claude Code Fixed the Bug Before I Got Home" I Built a Sales Prep AI and It Went Deeper Than Expected Design to Code #2: One JSON, Eleven Outputs Solving the 100M-Row Problem: A Summary Table Pattern for High-Volume Push Notification Logs Flutter Web With Wasm: What Actually Changes For Developers I Built 50 Royalty-Free Soundtracks for My Side Project in a Weekend Using AI Music Generation The Vibe Coding Security Checklist: 7 Things to Check Before You Ship Stop Letting Googlebot Guess Fix Your React App's SEO Right Desconstruindo o Streaming do LinkedIn: Como Criar um Engine de Extração de Vídeo de Alta Performance com HLS e FFmpeg (EDA Part-1) EDA (Exploratory Data Analysis) Explained With Real Life — Why Looking at Your Data Is the Most Important Step in Machine Learning Brand Relationship Management at Scale: Our 4-Touch Outreach System for 200+ Brands Why String.fromEnvironment() Might Return an Empty String in Dart JGuardrails 1.0.0 — Hardening Java LLM Apps Against Jailbreaks, Toxicity, and Prompt Injection Plan and Schedule a Full Week of Threads Content From One Claude Conversation Coding Cat Oran Ep3, Five Tables Changed Everything BFF模式详解:构建前后端协同的中间层 I'm done watching freelancers get buried by 200 proposals. So I'm building the alternative. This is my first post BFS Algorithm in Java Step by Step Tutorial with Examples Tracking LLM Pricing Monthly: An Open Dataset for 22 AI Models How We Measure Content ROI on a Comparison Site: Revenue Attribution Without Perfect Data Introducing Nova AI Ops: The AI-Native Operating System for SRE Teams I built a free desktop video downloader for Windows — Grabbit How Talkie OCR Helps Vision-Impaired & Dyslexic Users Read the World Around Them VRCFaceTracking安装和iPhone面捕配置教程,有bug Even CrowdStrike Can't See Your Agents The Automation Gold Rush: What n8n Workflows and Claude Are Opening Up for Developers Right Now
The Runtime Was Dead Long Before the Dashboard Noticed
Shivam Gawal · 2026-05-24 · via DEV Community

This is a submission for the Google I/O Writing Challenge

At 12:00 PM during the Cerebral Valley Google I/O Hackathon, RepoProbe attached itself to a generated FastAPI repository that looked production ready from almost every conventional angle.

The container booted inside Google's Antigravity sandbox without instability. Docker compilation layers completed cleanly. The ASGI runtime mounted correctly. Health probes stabilized almost immediately. Gemini 3.5 Flash summarized the repository as a distributed inference backend coordinating asynchronous workers through Redis queues and MCP orchestration layers.

Nothing failed during shallow inspection.

The repository structure looked convincing enough that most engineers would stop investigating after the first few minutes. Route boundaries were separated correctly from worker execution paths. OpenTelemetry instrumentation wrapped request lifecycles properly. Retry handlers existed. Queue semantics looked believable. The logs looked believable too.

Then RepoProbe started replaying corrupted authentication traffic against the live runtime.

JWT timestamps shifted outside valid windows. Signature payloads were reconstructed with malformed byte ordering. Claims objects were intentionally truncated before replay. Several requests combined impossible cryptographic states that should have terminated execution immediately if verification logic actually existed underneath the middleware layer.

The responses barely changed.

At first the behavior looked like cache contamination somewhere inside the request path.

Syscall tracing exposed something worse.

During replay, the middleware never touched the descriptor associated with the verification key material at all.

No read boundary appeared against the mounted secret volume.

No epoll_wait occurred on the expected cryptographic dependency path.

request replay
      ↓
jwt.decode(... verify=False)
      ↓
broad exception handler
      ↓
HTTP 200 OK

Enter fullscreen mode Exit fullscreen mode

expected syscall:
read("/run/secrets/jwt.pem")

observed:
nothing

Enter fullscreen mode Exit fullscreen mode

The application surface resembled authentication closely enough that conventional inspection procedures accepted it as authentication. Kernel level activity showed no evidence that signature verification had ever occurred.


The Payment Pipeline Was Simulating Settlement State

Several hours later, another repository exposed what initially looked like a production grade financial reconciliation pipeline.

Settlement events propagated through asynchronous queues correctly. Internal transaction state transitioned through believable lifecycle stages. Retry handlers activated during simulated webhook failures. The API emitted realistic transaction identifiers following Stripe formatting conventions closely enough that aggregation systems indexed them naturally during replay.

Packet inspection showed the runtime never established a successful outbound connection to any payment provider.

The orchestration layer generated synthetic settlement continuity locally while replaying reconciliation progress internally through its own queue substrate. Socket state transitions revealed repeated connection failures against a nonexistent upstream target while the scheduler continued mutating local financial state as though confirmation packets had already returned successfully.

Distributed tracing reinforced the illusion because spans still reflected believable ordering semantics even though no external payment lifecycle existed underneath the orchestration boundary.

otel.trace.status = OK
worker.retry.count = 3
transaction.state = settled
queue.depth = 0

Enter fullscreen mode Exit fullscreen mode

tcpdump:
SYN
SYN
SYN
timeout

Enter fullscreen mode Exit fullscreen mode

Traditional observability tooling interpreted the system as healthy because the generated runtime continued producing structurally valid telemetry despite the absence of any successful network level settlement flow.


The MCP Runtime Failure Was Worse

The MCP orchestration graph failed differently.

Statically, the repository looked sophisticated enough to resemble a legitimate long horizon agent runtime. Tool schemas validated correctly. Context hydration initialized during startup. Capability negotiation exposed bidirectional streaming interfaces. Dependency graphs resolved without structural collisions during shallow inspection.

The failure surfaced only after concurrent execution pressure forced the scheduler into conflicting assumptions about ownership boundaries inside the orchestration graph itself.

One execution node permitted nullable asynchronous hydration during tool initialization while downstream branches assumed dependency resolution had already completed synchronously before delegation began.

Under concurrent replay, unresolved futures accumulated faster than the scheduler could unwind blocked execution paths.

Event loop starvation followed gradually.

Internal task queues stopped draining.

Several coroutine branches remained suspended indefinitely waiting for ownership resolution that no active execution path still controlled.

The process itself never crashed.

Health checks remained green because probe execution required almost no scheduler activity. OpenTelemetry spans continued streaming because instrumentation hooks emitted timing boundaries independently from runtime progress. The orchestration dashboard still showed active execution because state transitions were reconstructed from buffered queue metadata rather than from live coroutine advancement.

Thread inspection showed forward progress had stopped nearly forty seconds earlier.

scheduler loop:
active

health endpoint:
200 OK

otel exporter:
streaming

coroutine ownership:
deadlocked

queue drain rate:
0/s

Enter fullscreen mode Exit fullscreen mode

Runtime observation
The orchestration layer still appeared operational because telemetry exporters were attached to queue metadata transitions rather than active coroutine advancement inside the scheduler loop itself.

RepoProbe Stopped Trusting Telemetry

After that point, telemetry stopped being treated as evidence.

Descriptor activity was traced directly against scheduler state. Network socket allocation was mapped against live coroutine ownership. eBPF hooks attached to syscall boundaries exposed execution stalls underneath the orchestration layer while the observability stack above continued reporting healthy progress semantics.

The infrastructure remained conversationally alive after the runtime underneath had already stopped advancing.

Early versions of RepoProbe relied heavily on the managed execution infrastructure introduced during Google I/O 2026. Long horizon investigation loops coordinated through persistent runtimes. Gemini synthesized intermediate reasoning continuously while hosted orchestration layers handled escalation dynamically across sandbox environments.

The infrastructure itself started degrading under hackathon traffic several hours later.

Quota exhaustion spread across hosted runtime environments. Sandbox allocation stalled unpredictably. Detached interaction sessions continued replaying cached execution summaries after containers became unreachable. Several orchestration retries severed synchronization boundaries internally between execution runtimes and the agent layers coordinating them.

One investigation session exposed the underlying failure clearly.

A managed runtime lost causal contact with its Antigravity sandbox after repeated orchestration retries detached the execution boundary internally. The sandbox itself had already stopped progressing, but the orchestration layer still retained buffered outputs generated during earlier execution cycles.

Gemini continued synthesizing coherent runtime summaries from stale tool traces while the underlying container no longer possessed active execution state.

[dead container]
        ↑
stale tool outputs
        ↑
orchestration runtime
        ↑
Gemini synthesis layer

Enter fullscreen mode Exit fullscreen mode


Deterministic Verification Became Mandatory

Filesystem traversal, AST extraction, dependency reconstruction, syscall tracing, packet inspection, scheduler ownership analysis, queue replay, and runtime mutation were separated completely from the reasoning layer afterward.

Gemini only received finalized execution artifacts once deterministic runtime collection completed independently underneath it.

Without that separation, long horizon orchestration systems drift into recursive verification collapse surprisingly quickly. Generated summaries begin validating earlier generated summaries while buffered telemetry reinforces stale orchestration state long after the underlying runtime has detached from reality.

The most damaging failures rarely appeared as startup crashes.

One generated repository constructed a distributed worker topology combining Redis Streams semantics with local in memory scheduler fallbacks embedded directly inside the API process. Under concurrent replay, some execution branches treated queue ownership as distributed infrastructure while adjacent branches mutated shared state synchronously inside the web process itself.

Queue acknowledgements appeared successful from the orchestration layer perspective while scheduler inspection showed several tasks had never initialized correctly at all.

Another repository referenced an asynchronous execution framework that did not exist anywhere across PyPI, Conda, or GitHub. RepoProbe reconstructed a substitute execution chain heuristically by mapping unresolved import signatures against known queue initialization patterns observed across Celery, Dramatiq, and RQ based systems.

The repository booted partially afterward, but syscall replay later exposed incompatible assumptions between worker hydration state and API lifecycle ownership. Several execution branches attempted to mutate queue state before the underlying event subscribers had attached descriptors to the active scheduler loop.

The runtime continued exposing healthy telemetry while orphaned execution paths accumulated underneath the queue substrate silently.


What Google I/O 2026 Actually Changed

Antigravity sandboxes reduced the operational cost of isolated execution dramatically.

Persistent orchestration runtimes normalized long horizon agent workflows.

MCP execution graphs standardized tool coordination layers.

Gemini 3.5 Flash reduced reasoning latency enough that continuous orchestration became economically practical at enormous scale.

The bottleneck shifted somewhere else entirely.

Generation stopped being expensive.

Verification became expensive instead.

Not syntax verification.

Not whether the container boots successfully.

Not whether telemetry appears healthy during shallow inspection.

The expensive problem now is proving that execution semantics still preserve causal truth once scheduler ownership, cryptographic boundaries, descriptor activity, queue hydration, packet synchronization, runtime mutation, and concurrent replay begin interacting simultaneously under live execution pressure.

Because many of the repositories RepoProbe investigated did not fail loudly.

They remained operationally persuasive long after the runtime underneath had already stopped being real.