惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
博客园 - 聂微东
Jina AI
Jina AI
Simon Willison's Weblog
Simon Willison's Weblog
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
博客园 - 叶小钗
P
Proofpoint News Feed
C
CXSECURITY Database RSS Feed - CXSecurity.com
N
Netflix TechBlog - Medium
WordPress大学
WordPress大学
B
Blog
D
Docker
MyScale Blog
MyScale Blog
The GitHub Blog
The GitHub Blog
S
Schneier on Security
G
Google Developers Blog
Microsoft Azure Blog
Microsoft Azure Blog
量子位
Security Latest
Security Latest
S
Secure Thoughts
T
Tor Project blog
E
Exploit-DB.com RSS Feed
D
DataBreaches.Net
N
News and Events Feed by Topic
B
Blog RSS Feed
IT之家
IT之家
N
News | PayPal Newsroom
Attack and Defense Labs
Attack and Defense Labs
C
Check Point Blog
V
V2EX
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Recorded Future
Recorded Future
Martin Fowler
Martin Fowler
S
SegmentFault 最新的问题
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
L
LangChain Blog
Hugging Face - Blog
Hugging Face - Blog
阮一峰的网络日志
阮一峰的网络日志
M
MIT News - Artificial intelligence
Last Week in AI
Last Week in AI
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园_首页
The Hacker News
The Hacker News
The Register - Security
The Register - Security
T
Threat Research - Cisco Blogs
腾讯CDC
P
Privacy International News Feed
T
Troy Hunt's Blog
云风的 BLOG
云风的 BLOG
L
LINUX DO - 最新话题

The New Stack | DevOps, Open Source, and Cloud Native News

Agentic development hinges on verification. For cloud-native software, that is a runtime problem. AI agents need infrastructure: Why Europe’s regional cloud strategy matters Transform your AI coding agent into a deterministic Java Spring expert WeAreDevelopers is coming to the US to give unsung developers a bigger voice Cleaner AI training data, fewer bugs: Sonar’s SonarSweep explained Observability overload is drowning engineers Google’s DiffusionGemma is 4x faster than its other Gemma models Fable 5: Guardrails and burn rate are annoying users, who say it’s still better than Opus 4.8 The Anthropic leader who built Claude Code says he ditched prompting — now he just writes loops. AWS can now mathematically prove your VMs are isolated Microsoft pulled 73 GitHub repos after malware attack — but still won’t say who’s compromised Databricks wants to kill the “email me a file” problem for AI agent skills Ramp bets forward deployed engineers can do what off-the-shelf finance AI can’t Git real: AI agents aren’t just for solo developers anymore Anthropic launches Claude Mythos/Fable 5, but you better try it soon This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions When your data model is the bottleneck: lessons from Medium’s feature store How long before we stop reading the code? The tokenmaxxing party is over, and Revenium is mopping up How AI is solving the memory crunch it created Microsoft’s pitch to enterprises: Ditch Azure Repos for GitHub, despite its rocky reliability record Claude Code’s biggest upgrade yet ran 5 agents at once — here’s what happened Why Anthropic just doubled Claude Cowork limits at no charge For years, Apache Cassandra handed this work to your team — 6.0 takes it back “A dangerous combination”: The 2 factors that can “corrupt” AI agent workflows With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability Microsoft unlocks Visual Studio for developers left behind by its own AI AI teams now deploy 1,000 times a month. Your pipeline wasn’t built for that. Microsoft just made the agent runtime free — and kept everything around it “Whoever builds the most joyous product wins”: The agent war begins Netlify CTO Dana Lawson: Writing code is no longer the job From Jupyter Notebook to production: How to ship AI systems that actually work OpenClaw used Gavriel Cohen’s code and exposed the AI Agent accountability problem Replit shows how vibe coding is getting its own financial stack — and a path to profit Cloudflare aqui-hires VoidZero: Did a piece of the open web just stabilize, or become more brittle? Cursor cuts prices and adds enterprise spend controls amid “tokenomics” reckoning Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop Snowflake thinks it knows what’s really slowing developers down Autonomous agents have met their biggest challenge yet: The database. Why agentic AI makes the ops platform the most important layer in the enterprise How to dramatically improve enterprise security alert tuning to battle cyberattacks Why the need for humans won’t disappear in the age of autonomous databases How to secure Kubernetes in the age of AI workloads Asana says its new AI “chief of staff” turns your Slack chaos into trackable work Nvidia’s best model is now live Mate Security’s Asaf Wiener made every backend engineer a model router. He’s right to. The AI cost crisis finally has a watchdog — just not the companies causing it How to get operational data off the factory floor without creating an IT breach Why CPUs still matter in the age of AI agents Rayfin: Microsoft’s answer to the gap between vibe coding and enterprise production Microsoft bets the enterprise AI race will be won on data context, not model power “A successful attack could be catastrophic”: Anthropic gives more groups access to Claude Mythos How GitHub plans to win developers back Microsoft really, really, really wants developers to love Windows again With Intelligent Terminal, Microsoft is reinventing the Windows terminal Microsoft debuts “Scout” at Build, a new personal agent for work OpenAI’s Codex adds new tools — Sites, Annotations, more plugins — for knowledge workers GitHub Copilot’s usage-based billing is live: Here’s what you need to know OpenAI, Anthropic, Google, Amazon, and xAI all fail on type of attack, study finds JetBrains open-sources Mellum2 to go where Claude Code can’t Claude Code vs. Cursor vs. Codex vs. Antigravity — six months in This coding agent doesn’t want your feedback — it ships without it “Blowing things up”: The one move vendors got wrong on AI agents At Sapphire, SAP makes the case that enterprise AI is a context problem Gavriel Cohen found his own code inside OpenClaw, so he walked away AI retrieval at scale is becoming a systems problem, not a tooling problem The DIY platform trap that’s burning out engineering teams I tested Cursor’s new Jira integration and it’s 5 stars, no notes. Here’s why. Why GPT-5.4, Claude, and Gemini can’t agree on basic, real-world facts Replit’s vibe coding platform just got a Visa-backed identity layer for AI agents — and it changes how agents spend money Opus 4.8 Made Claude Smarter. Token Discipline Got Urgent. Why Linux creator Linus Torvalds gets angry hearing “99% of code is AI” Vendor neutrality isn’t magic: A hard look at the OpenTelemetry ecosystem “The AI did it” won’t save you when EU regulators come knocking The fix for soaring AI cloud bills exists — so why won’t we trust it? AI is shipping code faster than security was built to handle Why AWS scrapped OpenSearch’s architecture to chase agent workloads Claude Opus 4.8 is here: effort controls, dynamic workflows, cheaper fast mode, better honesty, less deception Percona celebrates 20th birthday with new foundation — and a goat cake Why OpenAI and Anthropic are hiring forward deployed engineer teams Claw-style AI agents are coming to the enterprise. The governance infrastructure is still catching up. The agentic identity crisis: Why your security isn’t ready for the AI revolution Debugging the undebuggable: building observability into probabilistic AI systems Snowflake commits $6B to AWS as it pushes deeper into AI Why MotherDuck refuses to fork DuckDB Researcher “gave Claude Code ‘ADHD’… and it thinks 2x better now.” Outside experts want more proof. “There is no accountability”: AI coding agents are installing packages no one owns “Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding With Google’s debut, the most important AI agent feature is now the most boring one Why AI agents need a Context Lake Google ranks the best AI for building Android apps, and the winner isn’t Gemini Google pushes Pro, Ultra, and free users from open-source Gemini CLI to closed-source Antigravity CLI The reason enterprise outages almost never start where ops teams think Taming the agentic influx: a blueprint for AI business observability How the AC/DC framework helps teams govern AI coding agents GitLab 19.0 trades its string section for a full DevSecOps orchestra Who’s monitoring the agents? How Jaeger hit 8.6× compression on 10 million spans with ClickHouse What ClickHouse learned from a year of coding with AI agents OpenClaw passed 300,000 GitHub stars. Then Google launched Spark.
PagerDuty's CAIO says most AI incident tools are missing a critical layer
João Freitas · 2026-06-14 · via The New Stack | DevOps, Open Source, and Cloud Native News

AI is empowering software teams to ship code faster than ever. Given that an average of 70% of incidents stem directly from modifications and updates to live systems, higher velocity can also lead to more frequent incidents. 

As incident rates increase, we need to evolve from the traditional response approaches that were never designed for this speed.

“As incident rates increase, we need to evolve from the traditional response approaches that were never designed for this speed.”

The solution is to build an AI ecosystem that connects tools and draws on proprietary operational data to help teams diagnose, remediate, and even prevent incidents before they spiral out of control. Such a system requires a standardized way for AI tools to exchange information and perform actions, and the Model Context Protocol (MCP) has emerged as the leading standard for now.

However, simply having MCP connectors in place doesn’t guarantee success. MCP, by itself, is a standard protocol that allows agents to use various tools and access data resources. To do useful work in incident response, teams need AI agents that have access to the right data, can adapt to their incident response processes, and can leverage both short- and long-term memory. 

AI agents need to understand which data is relevant, how systems relate to one another, and which actions are safe to take. If teams can get the harness right, they will have agents that can meaningfully accelerate incident management. 

“AI agents need to understand which data is relevant, how systems relate to one another, and which actions are safe to take.”

In the case of incident management, AI agents will benefit from an agent harness that includes access to data points such as code changes, logs, metrics, events, traces, alerts, cloud infrastructure, past incidents and respective reviews, runbooks, service topology and dependencies, and on-call team information, as well as knowing the best person to respond to the issue at hand, amongst other items. 

Together, these assets provide the context necessary for the agent to triage, diagnose, and remediate the issue, accelerating incident response. Eventually, these signals can help prevent incidents before they occur, as common patterns emerge throughout the software development lifecycle.  

A practical use case is to use coding assistants, such as Claude Code or GitHub Copilot, to assess the risk of code changes before they get to production. Using agent skills (or similar) that leverage existing MCPs, coding assistants can leverage the incident management harness to deliver contextual risk scoring directly to teams as they work. The assistant can access weeks of historical incident data to identify common patterns that led to issues, previous incidents on the same service or adjacent services, and the target’s stability. 

The resulting score and recommendations help developers — or other AI agents — decide whether the code requires further improvements, additional verification, or, for example, that it can’t be pushed to production because an incident is taking place. 

Another important part of an agentic harness for incident management is the memory layer. Teams would want to enrich the context in meaningful ways and have the agent remember what happened during past incidents, what the distributed system and respective infrastructure look like, and specific service information. However, they don’t want to poison the context or fill it with irrelevant data. 

Thus, they need to create the appropriate structure for the agent to navigate and populate its memory with what is relevant to the ongoing investigation. Often, during an investigation, hypotheses change as new facts emerge from monitoring tools, customer tickets, or the experts’ brainstorming, so the memory layer needs to be able to create new semantic relationships, invalidate facts, and learn from new information.

Harnessing the potential

Even with the best tools in place, it’s not always possible to prevent incidents. However, it is possible for AI agents, with the right harness in place, to be the first to investigate an issue and escalate to a human, depending on their success during triage, diagnosis, and remediation, how far the team will trust the agent to go, and the severity of the issue. 

At a minimum, teams can provide incident responders with detailed context and a potential diagnosis to accelerate response and remediation. Eventually, for less critical services, they may trust the AI agent to act, use human escalations only when confidence is low, and avoid notifications in the middle of the night.

“For trust to take place, the agent’s harness needs to provide the right level of transparency and control.”

For trust to take place, the agent’s harness needs to provide the right level of transparency and control. This includes the user being able to configure which actions the agent can perform, which actions are forbidden, and in which cases the agent should request human approval. Additionally, when scaling to large enterprises with multiple teams and varying team permissions, they want the agent to inherit the permissions and privileges of those teams to avoid access and answers that include unauthorized data.

Toward continuous improvement

The real opportunity goes beyond faster incident management to building an AI agent harness that gets smarter over time. By combining shared agent memory, runbooks, incident history, and post-incident learning, teams can create agents that continuously improve their ability to prevent and resolve incidents. The organizations that start building on top of that harness now will be the ones with the edge tomorrow.

YOUTUBE.COM/THENEWSTACK

Tech moves fast, don't miss an episode. Subscribe to our YouTube channel to stream all our podcasts, interviews, demos, and more.

Created with Sketch.