
























Organizations are rapidly deploying autonomous and semi-autonomous AI agents that can make decisions, execute tasks and interact directly with systems without constant human oversight. That shift is driving investment, with the global agentic AI in cybersecurity market projected to grow to $322.39 billion by 2033.
The surge represents enormous gains in efficiency and agility — and also signals a dramatic increase in risk.
As AI agents become embedded in critical workflows, they gain access to sensitive data and operational systems. Without the right safeguards, those capabilities can be manipulated, turning productivity tools into attack paths.
In 2026, security teams need to govern AI agents with the same rigor as any privileged user, and account for agents taking action on their own.
Agentic AI refers to autonomous systems capable of executing tasks with minimal human intervention. Unlike traditional generative AI, which responds to prompts, agentic AI takes action across multiple steps and interacts with external tools and workflows.
Organizations are swiftly integrating agentic AI into day-to-day operations, with 35% adopting it within a two-year period, according to MIT Sloan Management Review. While this enables rapid efficiency gains, it also presents new security risks that organizations must address.
Rapid agentic AI adoption expands access points to sensitive systems and data, which cybercriminals can exploit. Attackers are also using agentic AI tools to scale and refine their tactics.
Even more concerning, agentic adoption creates an additional “agent layer” of risk. Agents behave in ways that are harder to predict, validate and trace than traditional software, and that means you can’t rely solely on controls built for rule-based systems. You also need to account for a dynamic environment where human intent and autonomous agent behavior both influence outcomes.
Without clear oversight into how agent activity unfolds, you risk exposure to threats. Securing this layer requires an integrated approach that brings human and agent activity into a single, manageable security framework.
Agentic AI risks stem from how these systems behave in real environments and how people use, trust and rely on them. Some of the most pressing agentic AI security risks include:
These risks commonly appear in the gaps between systems, such as a prompt that seems harmless, an integration that was approved for convenience, or an agent that is given too much freedom to act on its own.
Although they’re autonomous, AI agents can be influenced, which makes them vulnerable to manipulation. Attackers can embed malicious instructions in prompts, documents, messages, or webpages in an attempt to alter the agent’s behavior or override its intended purpose.
In many cases, the malicious content is hidden inside ordinary-looking text or layered into a workflow the agent already trusts. If the agent is not built to distinguish between user intent and attacker-controlled instructions, it may follow the wrong directive without obvious signs of compromise.
AI agents frequently handle private information such as proprietary or customer data. If safeguards aren’t in place, a single interaction can become a problem. For example, an agent could reveal too much context, copy data into the wrong system, or include sensitive details in an output that was not meant for broad distribution.
AI agents are surprisingly hungry for resources. Left unchecked, they may keep pulling data, making calls, or repeating tasks in ways that create activity that’s hard to spot. That lack of guardrails can easily drive up costs. It may seem efficient at first, but it can quickly become waste if an agent keeps working beyond what it was meant to do.
An AI agent is only trustworthy when it knows what should never be exposed. Without controls, agents may produce inappropriate, misleading, or off-brand content — or surface information that doesn’t belong in the output at all. That creates risk for both the organization and the people relying on the agent’s response.
When an agent is given access to tools, systems, or data, its permissions matter just as much as its instructions. If those permissions are too broad, a manipulated or misconfigured agent may reach farther than intended, opening the door to unauthorized actions and larger-scale impact. This is where least privilege becomes essential.
Even a well-designed agent can overstep when it’s allowed to act with too much freedom. It might complete a task that seems useful on the surface but moves outside the original workflow, approval path, or business intent.
That kind of behavior can create confusion, undermine trust, and introduce new risk. Keeping agents within clearly defined boundaries helps ensure they stay helpful without becoming a problem of their own.
Cybercriminals are adapting their tactics to take advantage of how agents operate.
Instead of sending obvious spam, cybercriminals can use AI to craft phishing messages that sound natural, reflect the target’s role or industry, and mimic the tone of a trusted sender. This helps them better target organizations with personalized messages based on publicly available information.
It also supports multilingual campaigns, voice-based impersonation, and other forms of social engineering limited by time or language barriers.
As AI-assisted phishing becomes more convincing, organizations need to assume that many malicious messages will look professionally written and contextually accurate, not sloppy or generic.
Cybercriminals no longer need to spend hours of manual research on targeted attacks. AI agents instantly scan sources like websites, social media, leaked datasets, and more to gather information. From there, cybercriminals can identify which systems are in use and where human behavior might create an opening.
For example, a cybercriminal might use AI to map an organization’s structure, learn the names of executives or finance staff, or identify vendors and platforms that could be used in a delivery chain.
Cybercriminals are using agentic AI to automate attack workflows, from reconnaissance and target identification to message generation and follow-up interactions. It produces faster, more scalable campaigns with shorter time to compromise. It can also make incidents harder to investigate as activity becomes distributed across multiple systems and adapts based on outcomes.
Although agentic AI introduces new risks, it can also change how you defend against threats.
Instead of relying on single tools or static automation, you can deploy specialized AI agents that independently coordinate toward a shared outcome. This “team of experts” model mirrors how security teams work, but at machine speed and scale.
Adoption is rising. In fact, Cyber Security Tribe reports that as of early 2026, 73% of organizations are already using or actively developing agentic AI within their cybersecurity programs. Used correctly, agentic AI can help enhance detection, automate workflows and support continuous monitoring.
AI agents detect anomalies and threats beyond human capacity, reducing investigation times from tens of minutes to near-real time. Gartner predicts that AI agents will reduce detection time by 50% before 2027.
Agents are able to specialize in tasks such as identifying phishing patterns, spotting unusual user behavior or flagging emerging indicators, which provides a complete view of risk across the organization. After catching anomalies, agents can automatically update firewalls or clear the cache, for example, to prevent risks.
Traditional automation typically handles isolated tasks. Agentic AI can orchestrate end-to-end workflows, with multiple agents coordinating actions such as triaging alerts, investigating incidents and initiating responses with less manual effort.
For example, one agent might analyze an incoming alert, another validates severity and a third triggers the appropriate response. This reflects how a human security team collaborates, but with more speed and consistency.
Agentic systems work around the clock, providing continuous monitoring across environments. This ensures threats are detected and addressed more quickly than they would be by human security teams.
This shift moves security operations toward continuously adaptive systems, where AI agents don’t just monitor environments but actively interpret signals and adjust responses in real time.
Securing agentic AI requires focus on visibility, governance and real-time intervention, not just static controls.
Apply least-privilege access. Agents should only have access to the systems and data required for their function. But permissions alone are not enough. Agents can still be manipulated within allowed scope, so additional layers of security are vital.
Track and analyze agent activity in real time. Maintain a centralized inventory of all agents your organization uses, while staying aware of unsanctioned or shadow AI. Also, monitor for anomalies such as unusual data access, privilege changes or new integrations.
Treat integrations as part of the active attack surface. Ensure connections are authenticated, monitored and regularly assessed for risk.
Human behavior still influences agentic AI security. Employees need guidance when they interact with AI systems. Provide them with ongoing security training as well as contextual feedback at the moment of interaction to reduce risk and improve decision-making.
Digital workforce security extends beyond traditional phishing awareness. It includes how employees interact with AI agents, how agents respond and how those interactions impact overall risk.
Effective digital workforce security can:
This integrated approach is essential to successfully secure AI agents and build trust in the workforce.
KnowBe4 secures the entire human-AI workforce by combining real-time agent governance with behavior-driven risk reduction.
With this approach, you can:
This goes beyond traditional security by actively governing agent behavior and improving human decision-making.
Learn how KnowBe4 reduces human and AI risk through agent governance and behavior-driven security
AI agents introduce risks such as prompt injection, data exposure, integration vulnerabilities and automated attack execution. These risks stem from technical weaknesses and human-agent interactions.
Cybercriminals can use agentic AI to automate phishing, conduct reconnaissance and execute multi-step attacks at scale with less human effort.
No. AI agents improve efficiency and detection, but human oversight remains essential for decision-making, governance and risk management.
To secure AI agents, organizations should implement least-privilege access, monitor behavior, secure integrations and invest in Human Risk Management alongside technical controls.
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。