Agentic AI tooling: Why runtime security is the missing layer

Sysdig Blog

Masterclass: AI is more than ChatGPT and LLMs CVE-2026-39987 update: How attackers weaponized marimo to deploy a blockchain botnet via HuggingFace 5 steps to securing AI workloads Marimo OSS Python Notebook RCE: From Disclosure to Exploitation in Under 10 Hours Security briefing: March 2026 The Sysdig MCP server is now available in AWS Marketplace Risk isn’t reduced until you take action: How teams resolve issues in the cloud AI infrastructure security: Why it deserves its own category Three pillars for building effective runtime-powered cloud defense, the right way Closing the cloud security gap with runtime security Seeing risk isn’t stopping it: Why visibility alone isn’t enough TeamPCP expands: Supply chain compromise spreads from Trivy to Checkmarx GitHub Actions AI coding agents are running on your machines — Do you know what they're doing? Runtime security for AI coding agents: Protecting AI-assisted development How runtime insights power every cloud security use case CVE-2026-33017: How attackers compromised Langflow AI pipelines in 20 hours Inline Cloud Response: Accelerating AWS threat containment for SOC teams Runtime malware detection for AWS Fargate Detecting CVE-2026-3288 & CVE-2026-24512: Ingress-nginx configuration injection vulnerabilities for Kubernetes Malware detection with Sysdig Security briefing: February 2026 Leveling up Kubernetes Posture: From baselines to risk-aware admission Eliminating runtime blind spots: How CleanStart and Sysdig build continuous trust across the container lifecycle LLMjacking: From Emerging Threat to Black Market Reality Real risks live at runtime: Why CISOs must care about deep telemetry in 2026 Sysdig named a Leader in the Forrester Wave™: Cloud Native Application Protection Solutions, Q1 2026 How to run rootless containers AI-assisted cloud intrusion achieves admin access in 8 minutes Security briefing: January 2026 Securing GPU-accelerated AI workloads in Oracle Kubernetes Engine Bringing OSS runtime security to AWS: Falco integration with AWS Security Hub CSPM Our customers have spoken: Sysdig rated a Strong Performer in Gartner® Voice of the Customer for Cloud-Native Application Protection Platforms Protecting sensitive business data in preparation for the organization's Gen AI VoidLink threat analysis: Sysdig discovers C2-compiled kernel rootkits AI is still a workload: A practical guide to securing AI workloads How threat actors are using self-hosted GitHub Actions runners as backdoors How Sysdig Sage delivers AI-powered, real-world vulnerability management Security briefing: December 2025 Top 10 ways to get breached in 2026 EtherRAT dissected: How a React2Shell implant delivers 5 payloads through blockchain C2 Introducing runtime file integrity monitoring and response with Sysdig FIM How to detect multi-stage attacks with runtime behavioral analytics EtherRAT: DPRK uses novel Ethereum implant in React2Shell attacks Detecting React2Shell: The maximum-severity RCE vulnerability affecting React Server Components and Next.js The rise of AI agents: How autonomous AI Is transforming cloud security Kubernetes 1.35 - New security features The Urgency of Securing AI Workloads for CISOs Security briefing: November 2025 Quantum and the cloud: Science fiction turned security strategy Cloud security, the right way: What the industry should demand (and why "good enough" isn't) Return of the Shai-Hulud worm affects over 25,000 GitHub repositories Detecting CVE-2024-1086: The decade-old Linux kernel vulnerability that’s being actively exploited in ransomware campaigns What’s old is new again: How to demystify AI security with AIBOMs Securing Kubernetes with agentic cloud security How agentic cloud security reduces real risks Hunting reverse shells: How the Sysdig Threat Research Team builds smarter detection rules Shifting left with AI and MCP: Sysdig + Amazon Q Developer How Falco and Stratoshark close the gap between open source runtime detection and deep forensic analysis Investigating security issues with ChatGPT and the GitHub MCP server New runc vulnerabilities allow container escape: CVE-2025-31133, CVE-2025-52565, CVE-2025-52881 Harden your LLM security with OWASP Security briefing: October 2025 How agentic AI is changing cloud security Kubernetes Incident Response: Detect, investigate, and contain in under 10 minutes Sysdig recognized as a Cloud Security Leader in Latio Tech Cloud Security Market Report AI echolocation of cloud risks using Sysdig & Snyk MCP servers Sysdig MCP Server: Bridging AI and cloud security insights Understanding CVE-2025-49844: “RediShell” Critical Remote Code Execution in Redis How Sysdig secures your containers and Kubernetes Sysdig Security Briefing: September 2025 Cloud security, the right way: The 3 pillars of real-time defense Open source spotlight: Bringing web application security to Falco with Falcoya's Nginx plugin Malicious NPM packages: Are you exposed? AI for SOC teams: 5 cloud security prompts to start your day with Sysdig Sage™ Shai-Hulud: The novel self-replicating worm infecting hundreds of NPM packages ZynorRAT technical analysis: Reverse engineering a novel, Turkish Go-based RAT Modern vulnerability management, built for the cloud Build your AWS incident response playbook with open source tools 2025 Gartner® CNAPP Market Guide: Runtime visibility is no longer optional Threat hunting with Sysdig: Uncovering “IngressNightmare” Open source spotlight: From alerts to action with AI-powered Falco Vanguard From triage to action: How Sysdig’s agentic cloud security platform slashes noise and accelerates remediation The vision comes to life: Agentic cloud security with Sysdig Sage™ Data security findings: A technical deep dive Connecting runtime to source: Sysdig and Semgrep integration Fix what matters, faster: How Sysdig and Semgrep are unifying security without silos – from code to runtime Defending sensitive data with Sysdig Secure Redefining cloud security, the right way Join the movement: The Sysdig Open Source Community is live A smarter, safer cloud in the age of AI Unifying detection and response: Sysdig + Cortex XSOAR for security at cloud speed The future of security is open, and it needs a unified hub: The Sysdig Open Source Community is here CVE-2025-53104: Command injection via GitHub Actions workflow in gluestack-ui Why MCP server security is critical for AI-driven enterprises What’s new in Sysdig — June 2025 AI-powered CNAPP with Sysdig Sage™ Revolutionizing Cybersecurity Search with Sysdig Sage™ Sysdig Threat Bulletin: Iranian Cyber Threats The end of the prioritization-only era: Vulnerability management needs action Dangerous by default: Insecure GitHub Actions found in MITRE, Splunk, and other open source repositories

Alejandro Magallon · 2026-05-19 · via Sysdig Blog

A developer asks an AI coding agent to refactor a microservice. Within seconds, the agent opens source files, executes shell commands, calls external APIs, and modifies cloud configuration. No human approves each step. No security tool monitors what the agent does between receiving the prompt and delivering the result.

This is not theoretical. Over the past year, researchers and security teams have disclosed a growing set of real-world attacks against agent infrastructure (from MCP tool poisoning to credential theft through coding agents) and industry surveys consistently find that most organizations have little visibility into the machine-to-machine traffic their agents generate. The tooling ecosystem powering these agents is maturing fast. The security controls governing them are not.

The stack at a glance

Agent infrastructure spans five layers, each introducing distinct attack surface:

Model Context Protocol (MCP): Standard interface for agents to discover and interact with external systems, each exposing a set of tools (callable functions the agent can invoke). Every MCP server an agent can reach is part of its effective attack surface.
Skills: Handbooks that shape agent behavior (methodology, tool ordering, escalation paths). Effectively policy-as-code. Tampering with skills changes what the agent does, not just what it can do.
Agent SDKs (Anthropic, LangChain, CrewAI): Frameworks where guardrails live or are delegated entirely to the developer. The SDK determines what's enforced by default.
Managed Agent Platforms (Claude Managed Agents, Amazon Bedrock Agents): Solve orchestration and sandboxing but not behavioral monitoring. Sandbox boundaries are not visibility boundaries.
Orchestration Layers: Coordinate multi-agent workflows. Delegation chains create lateral movement paths invisible to traditional network monitoring.

What this ecosystem does not solve is visibility into what agents do after they start executing.

How agents get compromised

The attack surface is not theoretical. MITRE ATLAS, extended through a 2025 collaboration with Zenity Labs, now catalogues a set of agent-specific techniques (AML.T0080–T0086), and the 2025 OWASP Top 10 for LLM Applications dedicates entry LLM06 to Excessive Agency, the risks of granting LLMs unchecked autonomy and tool access.

MCP tool poisoning

Invariant Labs disclosed this class of attack in April 2025. MCP tool descriptions are free-text fields that the LLM reads to understand how to use a tool. A malicious server can embed adversarial instructions directly in those descriptions, invisible to the user, processed by the model:

The user sees weather_lookup and approves. The LLM reads the full description, including the hidden payload, silently reads the SSH key, and passes it to the attacker-controlled server. Tool descriptions are fetched dynamically: they can change after user approval. A related variant, tool shadowing, injects descriptions that modify agent behavior with respect to trusted tools on other connected servers: cross-server exfiltration through the agent's own trust chain.

Indirect prompt injection via tool responses

An agent calls a legitimate tool, like a web browser, a file reader or a database query. The returned content contains hidden adversarial instructions:

Tool outputs and instructions share the same context window. The agent then uses a legitimate tool (email, HTTP request, Slack message) to exfiltrate data. This turns prompt injection from an information leak into an action execution vulnerability.

The EchoLeak incident, disclosed in June 2025 by Aim Security as CVE-2025-32711, demonstrated exactly this pattern: a crafted email injected instructions into Microsoft 365 Copilot, which then exfiltrated sensitive data via markdown rendering to an attacker-controlled server, bypassing Content Security Policy through a Microsoft-approved domain.

Credential theft via coding agents

AI coding agents in CI/CD pipelines routinely have shell access and run with the credentials of the CI system. A malicious code comment in a pull request can exploit this:

‍

The agent, following its training to be helpful, executes the command. The script reads environment variables, harvests credentials from .git/config, exfiltrates everything, and cleans up.

This pattern was publicly demonstrated in July 2025 by researchers at General Analysis against a Supabase + Cursor setup: a prompt injection embedded in a support ticket caused the Cursor agent (running with Supabase service_role privileges) to read private tables and exfiltrate integration tokens back through the same support channel.

Threat model: Attack vectors by ecosystem layer

Every layer of the agentic stack introduces specific attack vectors. Mapping them to detection strategies is the first step toward a security posture that matches the architecture.

Ecosystem Layer	Attack Vector	MITRE ATLAS	Detection Strategy
MCP	Tool poisoning via hidden description payloads	AML.T0081	Monitor for sensitive file reads unrelated to task; audit tool descriptions at registration
MCP	Cross-server exfiltration through tool shadowing	AML.T0086	Detect outbound connections to unexpected endpoints from agent containers
Skills	Skill manipulation to alter agent behavioral policy	AML.T0081	Version-control skills; detect runtime modification of skill files
Agent SDKs	Prompt injection exploiting framework-level trust	AML.T0051	Tool-call auditing at SDK layer; input/output sanitization
Managed Platforms	Anomalous behavior within sandbox boundaries	AML.T0055	Runtime syscall monitoring inside sandbox; capability-scoped containers
Orchestration	Privilege escalation via agent delegation chains	AML.T0086	Monitor agent-to-agent credential passing; enforce delegation scope limits
Orchestration	Memory poisoning for persistent manipulation	AML.T0080	Audit agent memory stores; detect unexpected memory writes between sessions

Why baselines fail (and what works instead)

There is a hard truth most vendor content avoids: traditional behavioral baselines do not work for AI agents.

Container security relies on deterministic profiling. A web server always calls the same syscalls, reads the same files and connects to the same hosts. Deviations from baseline are anomalies. This is the foundation of drift detection and most runtime security products, including Sysdig's, which is why we’re extending the model for agentic workloads.

AI agents break this model. The same agent given the same task takes different action sequences each time. Temperature settings introduce randomness. User-driven prompts create unbounded variability. And the very actions you want to detect, like shell execution, file reads or outbound connections, are the agent's normal operating behavior. Worse: unlike traditional code, the decision-making lives in model weights and a context window you cannot inspect. There is no static analysis path, no policy you can write upfront that anticipates every input. What the agent does at runtime is the only ground truth.

So, agent behavior is nondeterministic, and traditional controls assume predictable execution. But this doesn't mean runtime security is useless for agents. It means the approach must be layered:

Known-bad detection: An agent reading SSH keys, connecting to known-malicious IPs, or executing post-exploitation tools is suspicious regardless of non-determinism. Syscall-level rules catch these reliably.
Capability scoping: Restricting agent containers with seccomp, AppArmor, network policies and read-only filesystems means detection monitors sandbox violations rather than behavioral deviations. The baseline becomes the sandbox boundary itself.
Tool-call-level auditing: Monitoring at the agent framework layer (which tools were called, with what arguments, in response to what prompt) provides the semantic context that syscall monitoring lacks.

Here's what known-bad detection looks like in practice using Falco, the open-source runtime security engine. An agent process reads cloud credentials unrelated to its task, a pattern consistent with MCP tool poisoning or prompt injection:

- rule: AI Agent Reads Sensitive Credentials
  desc: AI agent process reading cloud credentials, SSH keys, or API tokens.
  condition: >
    evt.type in (open, openat, openat2) and evt.dir = <
    and container.image.repository contains "ai-agent"
    and (fd.name startswith /root/.aws
         or fd.name startswith /root/.ssh
         or fd.name endswith .env
         or fd.name contains credentials)
  output: >
    AI agent read sensitive file
    (file=%fd.name command=%proc.cmdline container=%container.name)
  priority: CRITICAL
  tags: [container, ai, mitre_credential_access, AML.T0055]

Similar rules catch unexpected outbound connections from agent containers and writes outside the designated workspace, each mapping to the attack vectors in the table above. Agent behavior might be non-deterministic at the LLM layer, but every agent action ultimately produces deterministic infrastructure artifacts, like syscalls, file operations or network connections. Runtime security operates at that deterministic layer.

What security teams should do now

Instrument the runtime, especially inside sandboxes. Deploy syscall-level detection on every environment where agents operate. Managed platforms don't solve this for you.
Audit your MCP surface. Inventory every MCP server your agents can reach. Treat tool descriptions as untrusted input. Monitor for description changes after initial approval.
Scope capabilities aggressively. Use seccomp, AppArmor, network policies, and read-only filesystems to constrain agent containers. When baselines are unreliable, the sandbox boundary becomes the baseline.
Treat agent delegation as lateral movement. Multi-agent architectures create trust chains invisible to network monitoring. Log what agents delegate, to whom, and whether delegated tasks exceed the original scope.

Operationalizing this at scale with Sysdig

The Falco rule shown above is illustrative, a starting point, not a production ruleset. Running detection across a real agent fleet means keeping rules current as attack patterns evolve (new MCP servers, new SDK releases, fresh CVEs), mapping each alert to the right MITRE ATLAS technique, and correlating runtime signals with the rest of your cloud security posture. This is where Sysdig AI Workload Security extends what open-source Falco provides.

Managed detection rules for agentic workloads. Sysdig's Threat Research Team curates Falco Feeds, which is a continuously updated ruleset mapped to MITRE ATLAS (including the AML.T0080–T0086 agent-specific techniques). Rules tuned for MCP tool poisoning, prompt injection exfiltration, and coding-agent credential theft arrive pre-packaged, pre-tested, and kept up to date.
Agent runtime inventory. Sysdig AI Workload Security shows where AI engines and packages are running in your environment (across providers like OpenAI, Anthropic, and Bedrock). Each workload is correlated with its in-use AI packages, their exploitable vulnerabilities, public-exposure status, and the runtime events firing on it.
Response integrated with your existing workflow. Detections route through the same incident pipeline your team already uses: isolate the container, revoke the agent's credentials, page the right on-call, or trigger an automated playbook.

Open-source Falco remains the foundation: the detection layer, the rule language and the community. What Sysdig adds is the scale, curation, and integration needed to run it as a production-grade program rather than a project.

The layer where anomalies become visible

Every other layer of your stack assumes deterministic behavior. Agents break that assumption. Security tooling has to break its assumptions too: not by chasing baselines that no longer hold, but by moving detection to the layer where agents, opaque at the LLM level but observable at the system level, still produce observable artifacts.

See how Sysdig applies runtime detection to AI agent workloads. Request a demo.

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。

推荐订阅源

Sysdig Blog

The stack at a glance

How agents get compromised

MCP tool poisoning

Indirect prompt injection via tool responses

Credential theft via coding agents

Threat model: Attack vectors by ecosystem layer

Why baselines fail (and what works instead)

What security teams should do now

Operationalizing this at scale with Sysdig

The layer where anomalies become visible