惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
V
Visual Studio Blog
P
Privacy & Cybersecurity Law Blog
博客园 - 【当耐特】
爱范儿
爱范儿
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
博客园_首页
Scott Helme
Scott Helme
G
GRAHAM CLULEY
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
有赞技术团队
有赞技术团队
T
Threat Research - Cisco Blogs
NISL@THU
NISL@THU
P
Privacy International News Feed
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
月光博客
月光博客
博客园 - Franky
博客园 - 叶小钗
D
Darknet – Hacking Tools, Hacker News & Cyber Security
S
SegmentFault 最新的问题
T
Threatpost
Recent Announcements
Recent Announcements
Cisco Talos Blog
Cisco Talos Blog
B
Blog
IT之家
IT之家
Security Latest
Security Latest
Apple Machine Learning Research
Apple Machine Learning Research
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
S
Schneier on Security
P
Palo Alto Networks Blog
L
Lohrmann on Cybersecurity
Simon Willison's Weblog
Simon Willison's Weblog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
K
Kaspersky official blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
L
LINUX DO - 最新话题
T
Tailwind CSS Blog
Stack Overflow Blog
Stack Overflow Blog
P
Proofpoint News Feed
Help Net Security
Help Net Security
T
Tenable Blog
N
Netflix TechBlog - Medium
P
Proofpoint News Feed
V
V2EX - 技术
MyScale Blog
MyScale Blog
罗磊的独立博客
C
Check Point Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

WhatIs

Hims & Hers launches AI agent for lab results Twilio revamps, updates customer engagement platform Most patients find appointment scheduling, billing overly complex Teradata's latest targets putting agentic AI into production AHA, Joint Commission launch cyber resilience program Tableau in transition as AI forces BI vendors to evolve California hospitals sue Elevance over out-of-network penalty CMS Health Tech Ecosystem adds electronic prior auth pledge Atlassian MCP updates take aim at AI token usage Leapfrog: Hospitals improved in 17 patient safety measures United promises another 30% cut to prior auths in 2026 AI outperforms docs on clinical reasoning, but not ready for solo work ServiceNow's Autonomous CRM takes aim at Salesforce ServiceNow reintroduces itself as an AI 'security company' New Tableau leader talks vendor's evolution in era of AI Deloitte warns of a "bubble effect" caused by the GLP-1 boom Tableau repositions for AI, unveils new knowledge layer IBM Bob AI coding agent ships, HashiCorp AIOps previewed DOJ forms West Coast Strike Force to stop healthcare fraud Most people benefit from the ACA's free preventive services SAP acquisitions of Dremio, Prior Labs target AI development Bridging the gap: Legacy tools gain enterprise AI support Amazon Connect Talent: AWS enters AI interviewing market AHA, West Health launch health tech adoption initiative How are states preparing for Medicaid work requirements? Medical device security improves, but cyberattacks remain pervasive Weekly news roundup: Musk vs. Altman, Google’s Pentagon AI deal, China and EU hit Meta Skin substitute spending driven by patients, products, prices Clinical AI company Aidoc snags $150M in new funding Qlik's Capone departs after eight years as CEO OIG: CMS paid millions in improper virtual care payments FDA moves toward real-time review of clinical trial data FQHCs in low-income neighborhoods have lower cancer screening rates Solving quantum computing's longstanding no-cloning problem Qdrant boosts performance, reliability to meet AI needs Racial health disparities still impact U.S. as policy changes loom Agentforce Operations tackles workflow orchestration Boehringer's dual agonist obesity drug spurs up to 16.6% weight loss Legacy architecture, awareness gaps stifle microsegmentation adoption in healthcare AMA alerts officials of health plans' No Surprises Act abuse Latest SAS capabilities focus on fostering reliable AI AHA calls for TEFCA individual access SOP delay, citing patient privacy concerns Actian targets secure, compliant AI with new vector database Payers promise standardized electronic prior auths MIT EmTech: 2026 is the year AI goes to work As Claude Design debuts, Adobe users -- and buyers -- shrug GoodData joins agentic AI development mix with Agent Builder Comfort, affordability top drivers of digital mental health tool use CMS accelerates Medicare coverage for breakthrough medical devices Weekly news roundup: Tim Cook exits Apple, Meta layoffs intensify and Anthropic investigates Claude Merck inks $1 billion AI drug development deal with Google Cloud OCR settles four HIPAA investigations, prioritizes risk analysis OpenAI launches ChatGPT for Clinicians 90% of patients re-check AI chatbot health info with other sources Gemini Enterprise Agent Platform adds 'connective tissue' to Vertex AI AMA urges greater oversight of AI mental health chatbots CMS benches BALANCE Model for Medicare Former ransomware negotiator pleads guilty to BlackCat conspiracy New Google TPUs multiply AI infrastructure efficiency When brand-name drugs need a prior auth, brace for delays Google unveils data cloud purpose built for agentic AI Snowflake updates further goal of being control pane for AI UnitedHealthcare eliminates prior authorization for rural providers Yelp launches appointment scheduling button from Zocdoc Oracle takes steps toward CMS Health Tech Ecosystem goals OpenAI debuts AI model GPT-Rosalind to speed up drug discovery Which patient care access barriers deter cancer screening? Redis unveils Feature Form to improve AI, ML workloads Adobe defines its AI-powered customer experience platform How to escape agentification pilot purgatory for scalable AI New HSCC guidance tackles third-party AI risk Data quality, fast failures and quick wins key to AI success Stop Overpaying for Storage: A FinOps Guide for CIOs AWS launches AI-driven tool to speed up early-stage antibody discovery AMA: Clinician burnout in specialties persists as overall rates drop Mental health parity remains elusive in 43 states Before revenue cycle AI, payers and providers need to get along Edge and physical AI poised to upend enterprise networks Salesforce releases Agentforce dev tools, updates Agent Fabric Cyberattack continues to disrupt operations at Signature Healthcare FDA reminds sponsors, researchers to report clinical trial results AI arms race leading to prior auth problems, reimbursement cuts Abridge dives deeper into clinical decision support with NEJM, AMA AI provider search is here. How can health orgs stay visible? Judge dismisses No Surprises Act lawsuit against HaloMD What IT leaders should know from Nutanix .NEXT HubSpot builds answer engine optimization into its platform Sutter Health, MemorialCare face class action lawsuit over AI scribe use Latest Qlik tools target helping users achieve AI goals CMS taps Verily, Noom, 150+ others to participate in ACCESS model Starburst intros AI assistant to boost analysis, exploration Payers face faster prior authorization approvals under CMS proposal Lenovo deploys AI data agent for marketing, UX, e-commerce Cisco Galileo buy reflects blurring lines in AI observability CMS proposes 2.4% IPPS bump, joint replacement model expansion Patients unsure what to trust amid health information overload Nutanix expands flexibility by building out external storage Amazon Pharmacy adds Lilly's obesity pill with same-day delivery ServiceNow AI pricing change takes on enterprise ROI struggles Oracle's Sudha Raghavan on AI's infrastructure renaissance
Datadog shops: AI incident management needs platform engineers | TechTarget
TechTarget · 2026-06-16 · via WhatIs

AI-driven incident management will only be as good as the human-designed platforms they run on, according to conference presenters at Datadog DASH.

AI incident management agents like those touted by Datadog at its recent DASH conference aren't magic. Behind every successful IT automation is a human platform engineer, according to the presenters at last week's DASH breakout sessions.

Similar to attendees at Cisco Live earlier this month, the DevOps and platform engineers whose sessions were livestreamed on June 10 are taking a cautious approach to adopting autonomous AI agents for incident management. While some engineers are experimenting with Datadog AI tools, their focus remains on building a clean platform foundation for production reliability before expanding agentic automation.

Krafton Inc., makers of the PUBG: Battlegrounds game played by millions of users worldwide, has spent nine years refining its incident management process, according to Junghun Kim, DevOps engineer at the Seoul-based video game publisher, in a DASH breakout session presentation.

Part of that refinement has included consolidating its original set of five observability tools into a single platform with Datadog, but Kim said changes to how application and SRE teams handled incident workflows were more important.

"The incident platform is not some kind of in-house product or internal platform tool," Kim said. "It's more like a mental model and an ecosystem around the incident response."

That model has three foundational layers before AI even enters the picture,  let alone autonomous AI agents, Kim said,.

"We want to reach that goal someday, but today AI can still make wrong judgments during incidents, and if it takes critical action that's hard to roll back, the risk is too high for production reliability," Kim said. "So, for now, our focus is on AI assistance. AI doesn't replace the human responder, and it helps responders work faster on top of the system we already built, which includes [context data about] code ownership, [incident] severity, and guardrails."

This result came from the response platform that we built, not some sort of AI magic.
Junghun Kim, DevOps engineer, Krafton Inc.

Krafton uses Datadog's MCP server and newly released Pup CLI to give coding agents access to its incident context data in Datadog, Kubernetes, Atlassian Jira and Slack. They assist human operators with tasks including cross-source debugging for root-cause analysis, drafting incident postmortems and runbooks, and documenting weekly on-call handoffs. The company's total incident volume has shrunk from 107 in 2024 to 24 so far in 2026. Time to detect an incident has dropped from 8.8 minutes on average to 1.6 minutes, and mean time to repair has dropped from 53.5 minutes to 10.3 minutes.

"The trend is clear across every number: earlier detection, faster repair, and less user impact," Kim said. "This result came from the response platform that we built, not some sort of AI magic."

Swish: No substitute for platform foundations

Platform engineers from Getswish AB, makers of the Swish Swedish payment processing app, told a similar story of incident management improvement in another livestreamed session. These engineers spearheaded the creation of a new internal incident management platform for the company after a 2021 outage was reported in the press before the company's IT management outsourcing provider was aware of it.

By late 2024, another high-profile incident made national news but was detected and resolved by an SRE team within five hours. Getswish is now working on extending observability and incident management practices to its financial transaction partners.

Between those two points came a comprehensive effort to rearchitect the company's code development process using GitOps and incident management practices using Google's seminal manual on SRE, said Jonas Cronholm-Lundin, head of platform at Getswish AB, during the June 10 DASH session.

"The SRE book that Google published [10] years ago, has a prioritization triangle," Cronholm-Lundin said, referring to the book's Service Reliability Hierarchy, represented by a pyramid, with monitoring at the base "If you have problems with reliability in your system, where do you start? … Adding on release procedures or performance testing and whatnot, will not do any good if you don't build your base in the triangle first. You need to actually understand how production runs and how it works. That is where everything like this needs to start."

The speakers also emphasized the importance of laying platform groundwork before AI agents can be effective. The company's revamped incident management process also now includes runbooks structured according to the OODA loop approach to decision making, Cronholm-Lundin said.

"When agents are coming in, like Bits [AI] SRE, and so on, if you have gathered your post-mortem reports [and] curated good runbooks, that is excellent for your agent to consume in the future when you have future incidents, and also for onboarding purposes," he said. "So it's super important to do the groundwork of actually bootstrapping your internal documentation in this area."

Data classification agents raise fears, opportunities

Datadog made more than 100 updates to its observability platform during DASH, several of them offering an AI-assisted approach to data gathering. These include auto-tagging features in its new Runtime Prioritization Engine for security vulnerabilities and a new Auto-Processing feature for log management with its Observability Pipelines.

Reached by email following the presentation, Cronholm-Lundin told TechTarget that his team is following Datadog's agentic updates with interest and plans to evaluate some, but it's too early to say which will be relevant to the company.

Another DASH attendee who saw a demo of Auto-Processing said he was concerned about a potential loss of skills among engineers as AI-driven automation takes hold.

"My takeaway was that developers no longer need to have discipline in what they generate for log output or how they format it; AI will take care of all the details so humans can focus on iterating faster and shipping more code," said Eric Swanson, senior site reliability engineer at MagicSchool AI, makers of a generative AI platform for education in Denver, Colo., in an online interview during DASH.

"My concern with these types of solutions is that we are choosing to hand our agency over to Agents, and blunting the sharpness of our skills honed through critical thinking," Swanson said, emphasizing that he was speaking on behalf of himself, and not his employer. "The skills themselves may become obsolete, but we run the risk of dulling our thinking next."

 However, one breakout session presenter who has worked extensively with Observability Pipelines noted that AI can offer new opportunities for engineers to improve incident management efficiency.

Not every log generated by applications at US Bank gets a "first-class seat" in Datadog's back end, but Observability Pipelines enables the platform to route the data to other repositories to manage retention costs, such as Amazon's S3, said Bryan Pierson, vice president of enterprise observability engineering at US Bank, during his DASH presentation.

But Pierson's team also came up with a way to route logs to Datadog in the event developers need its AI features, he said.

"Right before DASH, we ended up putting out a proof-of-concept application called Kickflip that puts the power back in the application team's hands to enable their debug logs if they do need them on the platform for use with things like Bits AI," Pierson said.

Beth Pariseau, senior news writer for Informa TechTarget, is an award-winning veteran of IT journalism. Have a tip? Email her or connect on LinkedIn.

Dig Deeper on IT systems management and monitoring