惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

DEV Community

Bajándole todos los minutos posibles al CI del backend con mas de 1000 tests Harness Engineering: Stop Re-Prompting Your Coding Agent Every Session HTML meta referrer: canonical reference Announcing the Trust Identity Protocol (TIP): HTTPS for the AI Era We built the feature in two days. Making it reliable took two weeks. LuisCore /for-agents.json — agent bootstrap — daily syndication · 2026-05-26 A Curious Journey Into Reverse Engineering an AI-Generated Python .exe Part 2: Enterprise Decision Intelligence Architecture: AI Governance, Threshold Policy Engines, and Operational AI Systems I will continue using Devise with Rails 8! The Developer's Guide to Picking the Right AI Code Model in 2026 (I Spent $500 So You Don’t Have To) 30 Kubernetes Tasks Every CKA Candidate Should Practice Before Exam Day Why Some Websites Feel Instantly Better to Use Advanced React Patterns I Wish I Knew 5 Years Ago ¿Cómo optimizar algoritmos en arreglos y listas con la técnica de dos punteros? I scanned 8 popular open source repos with one command. Here's what I found. mcp-probe v1.6.0: Stricter GitHub Actions checks for MCP CI gates How we connect two strangers' webcams fast (and keep the TURN bill small) LLM Agents Are Now Finding Zero-Days: How AI is Autonomously Rewriting the Rules of Vulnerability Research Minimal Code Doesn’t Mean Stable Code How I manage 40+ skills across Claude Code, Codex, and .agents folders Hardening Stealth Browser Fingerprint Integrity and State Persistence Quick Tip: Benchmarking Multimodal APIs in Under 10 Minutes How I Slashed My AI API Bill by 92% in 2026 — A Cost Optimizer's Speed Benchmark Guide How I Slashed My AI API Bill by 95% — A Practical Guide for 2026 A Go outbox library that runs inside your own DB transaction How I Built a Credit Optimizer That Saves 30-75% on AI Agent Costs (Open Architecture) The Missing POP: How I Ported a Yul Contract to Huff by Reading Every Opcode The Moment the Config Parser Became the Bottleneck Churn Tool Stack by Revenue Stage ($5K to $50K+) What I Learned Exploring AI-Generated 3D: A Hands-On Tour of Meshy, Tripo, and Three.js Day 15 - Software Composition Analysis(SCA) Contributing Upstream Instead of Forking: My grape-swagger-rails Story Behind The Badge: How We Built 2,000 Hackable Badges For Temporal Replay Access Control Doesn't Scale Linearly -- Part 3 33x faster than Rust: Why I stopped waiting for my compiler and built my own. I Built My First Production AWS Project as a Career Changer Why Detecting PII Matters More Than Ever JSON Schema in 10 Minutes — Validation, Types & Real Examples Python Tasks How I Started My Cybersecurity Journey as an SQA Engineer 🔐 Why "fancy fonts" in Discord and Instagram bios turn into boxes ☁️ GKE private cluster setup — common mistakes and how to avoid them I Thought a Username Didn’t Matter… Until I Saw How Much People Care About It Claude for Small Business: 382K Day-One Buyer's Guide I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail How I Built an AI-Powered Incident RCA Platform with LangGraph and RAG The Paywall Was a Painted Door Sonnet hallucinated. My agent stored it as fact. How React-Style Time-Slicing Keeps UIs Responsive 这个 Princeton 开源项目让 AI 自己修 Bug,19K Stars 但 90% 的人只用了 1% 功能 🔥 SWE-agent's 5 Hidden Uses Nobody Told You About 🔥 Decompiling Serial Number U-36: Python TERCOM Reconstruction, Cryptographic Logistical Forensics, and Swarm Consensus Fault Tolerance Microservices Patterns You Cannot Outrun a Wave I Fired My Entire Node.js Stack — Rust Rebuilt It in 3 Weeks (The Ugly Truth) BoxAgnts Introduction (2) — AI Agent Toolbox Cursor 3 ships parallel AI agents. Here is the multi-agent workflow that actually works. Prisma-7 A Complete Beginners Guide (With Free Cloud Database!) Akses HDD Rumah dari Laptop Kantor Pakai Tailscale + SMB (Tanpa VPN Ribet) Content Pipeline in MonoGame: Why I Don't Use It Debug Log #1 — The Pipeline That Looked Broken Data Structures in JavaScript: When to Use What (2026) BGP Route Flap Damping: A Solution or a New Problem? First look at AWS DevOps Agent The Next Big “Cult App” Probably Isn’t Another Social Media Platform From Template to Production-Shaped: An AI-Native Dev Flow for Go Side Projects Idempotency Keys: The API Pattern That Saves You From Duplicate Payments and Phantom Records Everyone's Building Jarvis. Nobody's Even Close. The Moment the Jaeger Tracer Exhausted Itself and What We Switched To How to Fix Tool-Use Loops in Autonomous Coding Agents Months of self-testing: Citations shine, other features remain unproven. Claude Code for Canary Deployments: How I Ship to 1% of Users Before Breaking Everything Your recurring scraper is re-downloading data that didn't change. Here's the 15-line fix (conditional GET) 20 Years of GPUs in Numbers: How FLOPS & TDP Grew, and Who Led the NVIDIA vs AMD Race (open dataset, 13.5k GPUs) Espressif Reveals CoreBoard and Korvo Dev Kits for ESP32-S31 Composable Abstraction Layer: o pattern que faltava entre Pinia e seus componentes Vue Your GitHub Actions Logs Are Leaking LLM Keys and Your SIEM Isn't Catching It Solving Complex Logic with Claude and Research Papers Building TheEpicBook: A Deep Dive into a Node.js Monolithic Web Application Haber yazilimi, haber scripti, haber sistemi: ayni urun, uc ayri arama niyeti Predicting Blood Glucose Fluctuations: Building a Transformer-based CGM Forecaster with PyTorch & InfluxDB Pre-task hooks: the one-line wire-up that gives your Hono agent shared memory Concurrent writes to a shared agent memory: what we shipped, what we punted on Building a Production Serverless URL Shortener on AWS — 21 Articles, Every Test Run for Real My CKA Cheat Sheet: Commands, Aliases, and Documentation Tricks I Used During the Exam Frontend Engineering Beyond Pixels: The Architecture of Digital Accessibility VLA or IL? A Controlled Dataset for Testing Whether Finetuning Turns Your VLA into a Fancy Imitation Learner Fabric AI Functions Turn GenAI Into a Data Pipeline Step Proximate vs Ultimate: The Bug Is Never Just the Bug The Treasure Hunt Engine That Broke Before the Traffic Did Reset Windows Update: The Definitive MSP Guide to RWU Your Resume Was Never Built for This AI Writes 46% of Code Now: What Snap's Layoffs Mean for Developers in 2026 From Chatbot to Agent — Tool Calling with NVIDIA NIM Fatigue and Fracture Mechanics: Why Parts Break Below Their Yield Strength I built a token-level debugger for comparing two LLMs VCP-Virtual Private Cloud Embedding sing-box in an iOS messenger to bypass Russian DPI (no VPN) Microsoft Copilot just exfiltrated a company's files. The attack was one email. Here's the mechanism. RAG 시스템 실전 구축 (v42)
AWS MCP Server Just Gave AI Agents Your Cloud Keys — Here's Why That Should Worry You
xu xu · 2026-05-26 · via DEV Community

You're reviewing your AWS bill. $14,000 this month — up from the usual $3,200. You trace it back to a Copilot session from last Tuesday where a dev asked the agent to "clean up old EC2 instances." It terminated 47 instances across three regions, including one that was handling a critical payment reconciliation job.

This is the future AWS MCP Server just handed you.

The Setup

AWS MCP Server went GA in May 2026, and the JP dev community (via a Qiita deep-dive by user hiyahyahyahyahoooi) published one of the first practical walkthroughs connecting it to GitHub Copilot's cloud agent mode. The promise: natural language cloud management. "Terminate unused instances." "Check S3 bucket policies." "Scale the ECS cluster." No console. No CLI. No terraform.

I tested it. Here's what the marketing didn't cover.

What AWS MCP Actually Does

The MCP (Model Context Protocol) server acts as a bridge between AI agents and AWS APIs. When Copilot Cloud Agent connects, it gets a structured toolset for interacting with your AWS environment — listing resources, describing configurations, modifying settings. In GA form, the scope has expanded significantly.

From the JP tutorial, the setup involves:

  1. Installing the AWS MCP Server package
  2. Configuring AWS credentials (IAM role with appropriate permissions)
  3. Connecting to Copilot's cloud agent mode
  4. Issuing natural language commands that translate to AWS API calls

The implementation detail that caught my eye: the tutorial uses a scoped IAM role approach. Good practice. But the agent's capability surface includes ec2:TerminateInstances, rds:DeleteDBInstance, and s3:DeleteBucket — operations that, once executed, are irreversible.

The Real Cost Nobody Talks About

In my local testing (M2 Max, 32GB RAM, sandbox AWS account), the Copilot agent correctly interpreted 8 out of 10 management commands. The 2 failures were edge cases around complex tag-based filtering.

But here's the number that matters: 0 out of 10 commands prompted for confirmation before execution.

That's not a bug. That's the intended behavior for "agentic" workflows. You give the agent a goal, the agent executes. The friction is gone.

And that's where I have to push back.

The Skeptical Take: Agentic Blast Radius

I've coined this term — Agentic Blast Radius — to describe the compounding risk when AI autonomy meets infrastructure permissions. The pattern is specific:

  1. You grant an AI agent AWS API access (necessary for the workflow)
  2. The agent interprets a vague or ambiguous instruction (unavoidable with natural language)
  3. The interpretation results in unintended infrastructure changes (probability > 0)
  4. Those changes cascade through dependencies you didn't model (inevitable at scale)

The Qiita article covers the happy path. I've seen enough production incidents to know: the happy path is not the default path.

In JP enterprise contexts, this matters even more. Japanese ops culture emphasizes gemba (現場 — on-site, hands-on) decision-making for infrastructure changes. The ritual of CLI commands, of manual verification, of "triple-check before execute" — that's not bureaucracy. That's the human circuit breaker that Agentic Blast Radius removes.

The Security Model Gap

Traditional AWS access requires human intent. Even with SSO and role assumption, there's a person in the loop. The MCP + Copilot integration fundamentally changes this:

  • The AI agent holds valid credentials
  • The AI agent can issue API calls without per-operation approval
  • The AI agent's "understanding" of your intent is probabilistic, not deterministic
  • Audit logs show "Copilot via MCP" but not the chain of reasoning that led to the action

I've seen this pattern play out in a different context: automated terraform pipelines that run on merge. The theory was "guardrails prevent mistakes." The practice was three production outages in six months before the team added manual approval gates back.

For MCP + Copilot, the question isn't "can we trust the AI?" It's "what's our recovery plan when the AI is wrong?" For EC2 termination, the answer is snapshots and backups. For RDS deletion, the answer is point-in-time recovery. But those recovery mechanisms assume you caught the error quickly. With agentic workflows, you might not notice until the morning standup.

What Gets Missed in Western Coverage

Western discourse on AI agents focuses on productivity gains. "Developers can move 3x faster." "Infrastructure management becomes accessible to non-specialists."

The JP coverage angle (as seen in the Qiita post) tends toward the genchi genbutsu (現物現場) approach: verify with your own eyes, understand the actual system before touching it. This isn't just cultural — it's a methodological hedge against the exact failure mode that Agentic Blast Radius enables.

The gap: English-language coverage celebrates the capability. Japanese-language coverage (particularly in the more cautious enterprise segments) asks "what happens when this goes wrong at 3 AM with $40k in hourly charges?"

Both questions are valid. The English discourse just isn't asking its question loudly enough.

The Teams This Is Actually Risky For

I'll be direct: if your team is under 10 engineers, you probably shouldn't use MCP + Copilot for write operations. Not because the technology is bad, but because your incident recovery capabilities are finite.

  • Single-person on-call rotation? High risk.
  • No AWS Config rules configured? High risk.
  • Production workloads mixed with dev environments? High risk.
  • No centralized billing alerts with per-service thresholds? Extreme risk.

For large orgs with mature governance: this might genuinely improve velocity. But "large org with mature governance" is a smaller population than the marketing suggests.

Forward-Looking Warning

By Q4 2026, I expect we'll see the first widely-reported incident where an AI agent (not necessarily Copilot) deleted cloud infrastructure worth six figures. When that happens, the vendor response will be "the customer had permissions to do that." Both statements will be true. Neither will be sufficient.

The pattern that protects you: treat MCP server permissions like you treat production database write credentials. Scoped, audited, and never handed to a system you don't fully understand.

Anti-Atrophy Survival Checklist

  1. Audit your IAM boundaries before enabling MCP — List every action your MCP role can perform. If you wouldn't hand those credentials to an intern, don't hand them to an AI.
  2. Set up cost anomaly alerts with sub-hourly granularity — Your current billing alerts probably check daily. AI agents can generate five-figure charges in minutes.
  3. Maintain a manual fallback procedure — Write down (yes, in writing) the steps to recover from unintended infrastructure changes. If you can't write it in 15 minutes, your recovery plan isn't actionable.
  4. Test in non-production first — Scope your MCP testing to a sandbox account for 30 days before touching anything real. Track every command the agent issues.
  5. Track your "authority delegation score" — For each AI tool you enable, rate how much autonomous authority you're granting: 1=fully reviewed, 5=fully delegated. If any tool hits a 4, schedule a review.

What's your take?

Has your team explored AI-native infrastructure management? What's the governance model that makes you comfortable — or have you decided the risk outweighs the velocity gain? I'd love to hear your framework for this.

Drop a comment below — I respond to every one.

Source: This analysis draws from a Qiita deep-dive (hayahyahyahyahoooi) on AWS MCP Server GA with Copilot integration — one of the first practical implementations documented in the JP dev community.


Based on Qiita article by hiyahyahyahyahoooi on AWS MCP Server GA and GitHub Copilot cloud agent integration

Discussion: Has your team explored AI-native infrastructure management? What's the governance model that makes you comfortable — or have you decided the risk outweighs the velocity gain?