惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

博客园 - 司徒正美
D
Darknet – Hacking Tools, Hacker News & Cyber Security
M
MIT News - Artificial intelligence
腾讯CDC
IT之家
IT之家
Microsoft Azure Blog
Microsoft Azure Blog
M
Microsoft Research Blog - Microsoft Research
阮一峰的网络日志
阮一峰的网络日志
H
Help Net Security
L
LangChain Blog
G
Google Developers Blog
Stack Overflow Blog
Stack Overflow Blog
人人都是产品经理
人人都是产品经理
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
博客园 - 【当耐特】
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
U
Unit 42
Recent Announcements
Recent Announcements
S
SegmentFault 最新的问题
大猫的无限游戏
大猫的无限游戏
博客园 - Franky
T
The Blog of Author Tim Ferriss
罗磊的独立博客
宝玉的分享
宝玉的分享
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
雷峰网
雷峰网
D
DataBreaches.Net
爱范儿
爱范儿
Schneier on Security
Schneier on Security
P
Palo Alto Networks Blog
Spread Privacy
Spread Privacy
Hugging Face - Blog
Hugging Face - Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
K
Kaspersky official blog
P
Privacy & Cybersecurity Law Blog
博客园_首页
T
Threat Research - Cisco Blogs
I
InfoQ
有赞技术团队
有赞技术团队
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Recorded Future
Recorded Future
量子位
H
Hackread – Cybersecurity News, Data Breaches, AI and More
GbyAI
GbyAI
Cyberwarzone
Cyberwarzone
B
Blog
C
Check Point Blog
P
Proofpoint News Feed
S
Securelist
A
Arctic Wolf

DEV Community

Export all of Andoid apps GitHub Copilot Workspace Review: Task-Level AI Coding in the Browser Vibe Coding Is Fun Until Production NVIDIA CUTLASS: High-Performance CUDA Templates for AI Linear Algebra The creator told 2,000 people to ship in 30 days. Nobody built the structure for it. EC2 Beginner Guide: Launch Your First AWS Instance JaisCloud — A Free, Single-Binary AWS Emulator in Go Cursor IDE Review: What Makes It a Genuinely Different AI Code Editor How to Debug Complex Regex Patterns Offline Without Leaking Proprietary Data The Moment the Default Runtime Became the Payload HoneyCloud — Reviving My Final Year Cybersecurity Project Rust Kernel Modules, Ready-to-Ship: A cargo-generate Template with Tests, CI, and Zero-Panic… Genetec Security Center 5.14.0.0: What's Coming, What's Already Here, and Why I'm Still Telling Clients to Wait Day 9 - Sparse embedding continued - RAG I coded an Air Hockey game where a local SLM hacks the DOM to cheat (and trash-talks you) 🤖🏓 Docker Desktop Alternatives: OrbStack vs Colima vs Rancher Desktop The Veltrix Treasure-Hunt Engine Litmus Test Quantizing Gemma 4 on Mac with llama.cpp How I Built an AI-Powered Google Maps Scraper for Lead Generation Readsb ADS-B Aircraft Local State Archive The Veltrix Event Engine Blew Up Because We Trusted the Defaults MiniScript Weekly News — May 27, 2026 The benchmark that made me change my mind about Jakarta EE in 2026 El benchmark que me hizo cambiar de opinión sobre Jakarta EE en 2026 Building an Offline-First Bushfire Response Platform With Hermes Agent The Golang Trinity: Functions, Methods, Interfaces gitwink — a read-only tray git glance for the AI-agent era I built a free PHP & Laravel tutorial site — 49 lessons from scratch Sniffing Modbus Traffic with 5 Lines of Python (And Why It Should Scare Your OT Team) Why Hytale Treasure Hunts Explode In Production (And How We Fixed It) XGroundControlStation The Worst Time to Quit Software Engineering Might Be Right Now Closiq Discord Agent: An AI Customer Support Monolith 🚀 Why Does Using an ORM Decrease Database Performance? An Experience... Document photos are a tiny image-processing problem with sharp edges Meet the G2 Nano: A 1GHz Dev Board Built for Robotics The 34x Pricing Gap: Why AI Model Selection in 2026 Is a Math Problem, Not a Loyalty Problem I Built an Open-Source Multi-Agent Fact-Checker — Here's How It Works The Sovereign Privacy Illusion: Why GDPR Compliance Doesn’t Equal Data Control Mastering Azure Entra ID: A Hands-On Guide to User Management and Privileged Roles Solana Transactions Are Not What I Expected (Coming From EVM) How We Broke the Hytale Treasure Hunt Engine (And Fixed It at 3 AM) AI Memory Is Broken. Here's What's Finally Starting to Fix It State Management in Production Flutter Apps: What Actually Held Up at Scale The Role of QA in the New AI SDLC We spent 6 months feeding our compliance data to a major cloud AI. Here's what we got back. open-source coding agents need maintainers, not just models Oracle ORA-00018 오류 원인과 해결 방법 완벽 가이드 I built agmsg so Claude Code and Codex could stop using me as a copy-paste relay Gmail ate my formatting. So I built PasteClean. Amazon Bedrock AgentCore Payments: The Spending Limit Is the Product I Should Have Put Events in the Same Database as the Aggregate Root—Heres What Happened The SilentRecon Agent Loop Architecture: How We Build AI That Doesn’t Stall How I Built a Dark Cinematic Restaurant Landing Page Template and Listed It For Sale Claude Code Slash Commands You Should Know (I wasn't either) "7 Free GitHub Repos Every Beginner Should Star Right Now" Blockless Scope: JavaScript Shenanigans 🧠 Plano Físico vs Plano Funcional en AWS Networking: cómo los arquitectos senior resuelven lo que otros no ven. I Accidentally Built an AI Employee Out of Scripts and Bad Sleep Habits Stop Chasing Shiny Tools: A Minimalist AI Stack That Actually Makes You Money Day 7 of trying to get 20 paying customers in 40 days. Currently at 0. How We Blew Up Our Event Pipeline at 3 AM Because the Treasure Hunt Engine Had No Clear Operator Bounds Autograder - Finish-Up-A-Thon Federico@Cursor,Dimma@Fireworks深入探讨Composer2技术 Hermes Memory Providers: A Complete Breakdown for New Users How I Built a Cinematic Scroll Experience with GSAP and ScrollTrigger I Built a Free Spelling Bee Solver and Analysis Tool — Here's What It Does Stop Over-Engineering Your UI: Material 3 for Blazor (Without the JS State Management Nightmare) I just created the best web FullStack framework in Rust language: the Rullst! I did with the help of AI, but my tokens are over, can you help me? ASF Project Spotlight: Apache Iceberg babelForge TIL 5/27/2026 Broken Software I built a CLI that scaffolds agentic workflows for Claude Code Testing a LiveView App with Playwright: Fixing Navigation Timeouts I Turned on Agent Tracing for 30 Days. 4 Hidden Bottlenecks Were Eating 47% of My Tokens. How I monitor CVEs daily with a 50-line Python script Apache Geode 2.0, Part II: Rebuilding a Distributed System for the Modern Java Era HiTerm: A Free Remote Terminal for AI Coding Agents (Claude Code, Codex, Gemini CLI) 5 Free Online Tools You Didn't Know About — No Signup How I Finished My AI Code Reviewer Using GitHub Copilot The auth_rls_initplan linter has a blind spot: SECURITY DEFINER bodies Upgrading OtakuShelf to JHipster 9.1.0 Polishing the catalog (and reading the agent's receipts) Adding the anime side without holding my breath Pairing up: scaffolding OtakuShelf with an agent State.js Tutorial: Creating Reusable UI Components with Pure CSS Reactivity Deskbrid: A Linux Desktop HAL Built Entirely by AI Agents The Day the Treasure Hunt Engine Drowned in 300 ms Queries How I Built a Marriage Biodata PDF Generator in Next.js Supply Chain & AI Security: GlassWorm Takedown, Prompt Injection RCE, Ubuntu 24 Hardening AI Agent Production Challenges: Failures, Starlette Vulnerability, Code Gen SQLite Bugfix, PostgreSQL Migrations & Filesystem API Paradigm CUDA 13.3 Lands, AI Writes Blackwell Kernels, & FP4 VRAM Optimization for LLMs How I Found a Fake Job Assessment Repo Hiding Malware Inside SVG Files Building the Pipes: Core Data Engineering Concepts Explained Ultimate 1-Minute Xray/3x-ui Setup: VLESS, Hysteria2, Caddy Self-Steal & Smart Outbounds in One Script [Boost] Vamos falar de IA. Mas de outro lugar. Stop Duplicating Code! Is "Integration Hell" Just Laziness or a Systemic Architecture Failure?
Azure Container Apps Express: The Agent-First Platform You've Been Waiting For
Brian Spann · 2026-05-28 · via DEV Community

I've been running AI workloads on Azure Container Apps for over a year. Every time I spin up a new agent backend, the ritual is the same: create an environment, configure networking, set scaling rules, wire up health probes, then deploy the actual container. For a prototype agent that might live for a week, that's too much ceremony for what you get.

ACA Express, which hit public preview in May 2026, kills most of that ceremony. And a separate but related announcement, Docker Compose for Agents, brings MCP gateways and model serving to standard ACA environments. They solve different problems and run on different infrastructure, but together they cover the full spectrum of agent deployment on Azure.

Let me break down both.

ACA Express: What It Actually Is

Express is a new environment tier within Azure Container Apps. You bring a container image. Express handles provisioning, HTTPS, scaling (including scale-from-zero with subsecond cold starts), and resource allocation. No environment to manually provision through the portal. No networking to configure. No scaling rules to write.

Under the hood, Express is built on ACA Sandboxes, a platform primitive that uses prewarmed pools to deliver that subsecond startup. This isn't the standard ACA cold-start experience with a fresh coat of paint. It's a different architecture.

The tradeoffs are real. Express is HTTP workloads only, consumption CPU only. No GPU. No VNet integration. No Dapr. No service discovery between apps. No managed identity at runtime. No health probes. If you need any of those, standard ACA environments are still there. But for stateless HTTP agent backends, Express is dramatically faster to deploy and cheaper to run.

Here's what it takes to get a container running:

# Create an express environment
az containerapp env create \
  --name my-express-env \
  --resource-group rg-my-agents \
  --environment-mode express \
  --logs-destination none

# Deploy your app
az containerapp create \
  --name my-agent-api \
  --resource-group rg-my-agents \
  --environment my-express-env \
  --image mcr.microsoft.com/k8se/quickstart:latest \
  --target-port 80 \
  --ingress external \
  --min-replicas 0 \
  --max-replicas 1

Enter fullscreen mode Exit fullscreen mode

Your app is running in seconds. Not minutes. Seconds.

Express also has its own portal experience at containerapps.azure.com, separate from the Azure portal. If you're using the portal, you don't even need to create the environment yourself. It handles that automatically.

Why "Agent-First" Is the Right Framing

Microsoft is explicitly positioning Express for two audiences: developers who want to ship fast, and AI agents that deploy endpoints on demand. That second audience is the interesting one.

Think about how modern agent architectures work. An orchestrator spins up tool-use APIs, runs them for the duration of a task, and tears them down. The infrastructure needs to provision fast, scale from zero, and cost nothing when idle. That's exactly the Express model.

The platform is designed for MCP servers, tool-use endpoints, multi-step workflow APIs, and human-in-the-loop UIs that agents spin up dynamically. Scale-from-zero with subsecond cold starts means you're not paying for agent backends that aren't actively serving requests. And when a request does come in, the agent is ready almost instantly instead of waiting through a cold start.

Docker Compose for Agents: A Separate (and Complementary) Feature

Here's where a lot of early coverage got confused, and where I got it wrong in my first draft of this post. Docker Compose for Agents is not an Express feature. It deploys to standard ACA environments with workload profiles, not to Express.

Why? Because Compose for Agents supports GPU model serving, MCP gateway containers, sidecar processes, and multi-service stacks. All of those require capabilities that Express doesn't have (workload profiles, service discovery, sidecars). Different tool for a different job.

What Compose for Agents does is let you take the same compose.yml you use locally for development and deploy it directly to ACA. The CLI translates compose services into Container Apps resources automatically.

Here's what a compose file looks like for an agent stack:

services:
  my-agent-app:
    build: .
    ports:
      - "8080:8080"
    environment:
      - MCP_GATEWAY_URL=${MCP_GATEWAY_URL}

  mcp-gateway:
    image: docker/mcp-gateway
    x-azure-deployment:
      image: acateam.azurecr.io/preview-ai-compose/mcp-gateway:latest

models:
  gemma:
    model: ai/gemma3-qat
    x-azure-deployment:
      workloadProfiles:
        workloadProfileType: Consumption-GPU-NC8as-T4

Enter fullscreen mode Exit fullscreen mode

The x-azure-deployment directive is the bridge between local and cloud. Docker ignores it locally. ACA uses it during deployment. Same file, both environments.

What the CLI creates behind the scenes:

Your agent app as a Container App with ingress. An MCP gateway running as its own Container App with managed identity, dynamically managing MCP tool containers. Model serving via Docker's model runner on serverless GPU. The MCP gateway handles stdio-to-SSE translation, so your MCP servers run as standard Container Apps without modification.

To deploy it:

# Install the preview CLI extension
az extension remove --name containerapp
az extension add --source "<preview-extension-url>" --yes

# Deploy your compose file to a standard ACA environment
az containerapp compose create \
  --compose-file-path compose.yml \
  --resource-group rg-my-agents \
  --environment my-standard-env

Enter fullscreen mode Exit fullscreen mode

Notice that --environment flag. This deploys to a standard ACA environment, not Express. That's the distinction.

Where Each Fits in the Azure AI Stack

The Azure AI hosting landscape has gotten crowded. Here's how I think about the options as someone who's deployed on most of them:

Azure AI Foundry is for when you want managed model endpoints with built-in safety, content filtering, and enterprise governance. You're consuming models, not hosting infrastructure.

ACA Standard is for when you need GPU workloads (self-hosted Ollama, vLLM), microservices with Dapr, VNet isolation, or any enterprise feature that Express doesn't have yet. This is also where Docker Compose for Agents deploys.

ACA Express is for fast, cheap, stateless agent backends. Prototypes, MCP servers, tool-use APIs, webhook handlers, agent orchestrators that don't need GPU compute.

ACA Dynamic Sessions is for sandboxed code execution for AI-generated code. Hyper-V isolated, millisecond provisioning, MCP-integrated.

Express isn't replacing anything. It's filling the gap for lightweight agent infrastructure that's too simple for standard ACA but too complex for a serverless function.

What's Missing in Express (For Now)

This is a public preview, and the supported feature list reflects that. The "No" column is long:

No secrets management (no Key Vault integration). No managed identity at app runtime. No health probes. No custom domains or managed certificates. No VNet integration. No CORS, session affinity, or sidecar containers. No OpenTelemetry. No autoscaling rules (KEDA). Region-limited to West Central US and East Asia.

For production agent backends, these gaps matter. No managed identity means you're passing credentials through environment variables. No health probes means you're trusting the platform's defaults. No secrets means API keys sit in plain text config.

But for prototypes, internal tools, and agent backends in active development? These limitations are acceptable tradeoffs for the provisioning speed and cost model. And Microsoft is shipping features on what they describe as a "rapid cadence" through the preview period.

When to Use Which

If you're building a lightweight agent backend, an MCP server, or a tool-use API that handles HTTP requests and doesn't need GPU, go Express. You'll have a running endpoint in seconds with zero infrastructure decisions.

If you're building a full agent stack with model serving, an MCP gateway coordinating multiple tool containers, and GPU workloads, use Docker Compose for Agents on standard ACA. The compose file gives you local-to-cloud parity and the workload profiles give you the compute you need.

If you need both, use both. Express for the lightweight endpoints, standard ACA for the heavy lifting. They run on the same platform and can coexist in the same resource group.