惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
Cybersecurity and Infrastructure Security Agency CISA
月光博客
月光博客
Apple Machine Learning Research
Apple Machine Learning Research
量子位
Hugging Face - Blog
Hugging Face - Blog
罗磊的独立博客
小众软件
小众软件
T
Tailwind CSS Blog
博客园 - 聂微东
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
IT之家
IT之家
V
Visual Studio Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
T
The Exploit Database - CXSecurity.com
T
Tenable Blog
博客园 - 叶小钗
宝玉的分享
宝玉的分享
P
Privacy International News Feed
T
Tor Project blog
博客园_首页
AWS News Blog
AWS News Blog
雷峰网
雷峰网
C
Cisco Blogs
Help Net Security
Help Net Security
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
博客园 - 【当耐特】
T
Threat Research - Cisco Blogs
Last Week in AI
Last Week in AI
K
Kaspersky official blog
人人都是产品经理
人人都是产品经理
Recent Commits to openclaw:main
Recent Commits to openclaw:main
S
Schneier on Security
博客园 - Franky
W
WeLiveSecurity
L
LINUX DO - 热门话题
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
博客园 - 三生石上(FineUI控件)
WordPress大学
WordPress大学
爱范儿
爱范儿
酷 壳 – CoolShell
酷 壳 – CoolShell
P
Proofpoint News Feed
大猫的无限游戏
大猫的无限游戏
腾讯CDC
L
Lohrmann on Cybersecurity
J
Java Code Geeks
美团技术团队
博客园 - 司徒正美
The Cloudflare Blog
V
V2EX

Human Risk Management Blog

An Overview of Email Compliance Regulations and Reporting Report: AI-Assisted Fraud is Surging Attackers Use Spoofed ChatGPT Site to Deliver Malware I Love Device-Bound Session Credentials, But They Are Still Phishable and Hackable Nearly Two-Thirds of CEOs Cite Cyberattacks as Their Top Concern A Look at Spam vs. Phishing: 4 Key Differences KnowBe4 Wins Multiple 2026 TrustRadius Top Rated Awards Cyber Insurance for Mid‑Market Organizations in Southeast Asia KnowBe4 Earns Multiple 2026 Buyer's Choice Awards from TrustRadius The New Frontier: Securing Japan’s Hybrid Digital Workforce (2026 & Beyond) CyberheistNews Vol 16 #23 Now Phishing Attacks Use Real Hotel Reservations to Target Travelers Report: AI-Enabled Social Engineering Attacks Are on the Rise Your KnowBe4 Fresh Compliance Plus Content Updates from May 2026 FBI: Kali365 Phishing Kit is Targeting Microsoft 365 Accounts KB4-CON - AI Is Everything How to Secure AI Adoption In Your Organization Your KnowBe4 Fresh Content Updates from May 2026 The Silent Invitation: A Deep Dive into Calendar Invite Phishing Cyber Insurance for Mid‑Market Organizations in Southeast Asia Chinese-Language Phishing Kits Are Growing More Advanced Phishing Attacks Are Using Real Hotel Reservation Info to Target Travelers Warning: Scammers are Exploiting Geopolitical Unrest Athletes Are Increasingly Targeted by Social Engineering Attacks AI Agent Governance Part 3 - Runtime Governance: The Hidden Performance Cost of Agentic AI AI Agent Governance Part 2 - What Good Looks Like: Governing AI Agents in Practice 8 Ways to Reduce False Positives in Email Security Ransomware Attacks Drive a Surge in Cyber Insurance Claims My Favorite 5 KnowBe4 Agents Perry Carpenter KB4-CON 2026 Q&A: Deepfakes & Deception Free Gift Fallacy: How Attackers Harvest Credit Cards via Fake Surveys When Global Conflict Becomes a Cyber Weapon: How Iran Tensions and Other Stressful Events Fuel Social Engineering Attacks CyberheistNews Vol 16 #21 [Heads Up] GitHub Breach Shows Developer Tools Are Social Engineering Targets Alert: Extortion Groups Are Using Phishing Kits to Automate Their Attacks Beyond the Chatbot: Why Your AI Agents are Your Newest (and Most Vulnerable) Colleagues Report: Adversarial Use of AI is Evolving
How to Secure AI Agents: 4 Best Practices
KnowBe4 Team · 2026-06-13 · via Human Risk Management Blog

Imagine you give an AI agent permission to triage support tickets. A few weeks later, it’s accessing a system no one intended it to reach, putting the data within at risk of exposure or misuse.

Nothing dramatic happens at the moment. That’s what makes the risk tricky. AI agents don’t wait for approval the way traditional systems do, and they move faster than the controls you’ve set around them. That autonomy can be useful, but it also creates new ways for things to go wrong — and they aren’t always easy to spot in the moment.

Figuring out how to secure AI agents means tightening control without giving up the speed and efficiency that made them useful in the first place. This guide breaks down how to do exactly that.

Key Takeaways

  • AI agents can create useful speed, but autonomy also expands risk if no one defines the guardrails.
  • Security teams need visibility into what agents can access, what they can do, and where human review still matters.
  • Monitoring should continue after deployment, since risk can emerge as use cases, permissions and workflows change.
  • Strong controls work best when technical safeguards and human oversight reinforce each other over time.

What Are AI Agents and Why Is It Important to Secure Them?

AI agents are autonomous systems that can interpret goals, use tools like web browsers, retrieve information, and take action to complete tasks. Unlike traditional generative AI tools, which need a prompt to get started, AI agents can initiate actions on their own based on a loose set of instructions.

In terms of securing a digital workforce, agents’ greater autonomy requires more oversight. That means tracking what the agent is doing, defining what it should not do, and flagging when human intervention is needed.

Why AI Agents Can Create New Security Concerns

Implementing AI agents can boost your organization’s productivity. However, because they follow a long chain of actions across systems you may not watch closely, they also create risk.

If they reach data they shouldn’t or follow the wrong instruction, they could cause significant damage before anyone has a chance to catch the error.

When an agent has access to multiple tools, it also inherits each of their risks. A compromised SaaS account, a misconfigured API, or a malicious input can harm everything the agent touches.

That’s why securing agentic AI isn’t only about the technology. It also comes down to training humans and AI agents to minimize risk.

4 Best Practices for Securing AI Agents

There’s no single control that can fully secure AI agents, because the risk shows up in more than one place. An agent can be over-permissioned, fed bad input, make the wrong decision, act on the wrong system, or go too far before anyone notices.

A layered approach prevents one weak point in your system from becoming a larger problem. The best practices below align with four core pillars of agent risk protection: discover, monitor, detect, and protect.

  1. Discover every AI agent in your environment
  2. Monitor agent activity with searchable audit trails
  3. Detect risky behavior and policy drift early
  4. Protect sensitive workflows with flexible controls

1. Discover Every AI Agent in Your Environment

Don’t assume your IT department knows about every AI agent deployed within your organization’s systems. When many teams are experimenting with AI independently, it’s easy to end up with shadow deployments that never make it into a formal inventory.

These different avenues create blind spots that create unseen risk, with no one being responsible for tracking the full picture.

Security teams need to know where each agent lives, what systems it connects to, what data it can reach, and which users or teams rely on it.

A complete inventory creates the foundation for human risk management, by revealing where automation is already shaping behavior and where governance still needs to catch up.

2. Monitor Agent Activity with Searchable Audit Trails

Once agents are active, you need a clear record of what they’re doing. To effectively monitor agent activity, you must capture:

  • Prompts
  • Tool calls
  • System interactions
  • Outputs
  • Any escalations or exceptions

Without that trail, it’s hard to solve for missteps, especially if an agent has been acting across multiple systems or workflows.

For example, an agent may reach into a system it shouldn’t access, respond to a prompt too literally, or carry out a task that was never intended. A searchable audit trail gives security teams the ability to investigate those events quickly and understand whether they represent a one-off mistake or part of a larger pattern.

3. Detect Risky Behavior and Policy Drift Early

Early detection identifies when a workflow has become too dependent on an agent operating without enough guardrails. It also catches risky behaviors that suggest:

Humans often create risks unintentionally. People adapt to convenience, overlook warning signs, or assume a system is behaving correctly because it has worked before. Detecting drift early gives security teams a chance to intervene before a workflow becomes an exposure.

4. Protect Sensitive Workflows with Flexible Controls

Visibility and detection are essential, but they are not enough on their own. Organizations also need controls that respond differently depending on the sensitivity of the task and the level of risk involved.

For example, some agents may only require monitoring, and others may need approval, step-up verification, restrictions on data access, or a hard stop before they reach a sensitive system.

Flexible controls are necessary because AI agents do not operate in a fixed-risk environment. A workflow that is safe today may become risky tomorrow if the data changes or the agent is connected to a new system. Strong protection needs to account for that variability without making the technology unusable.

Why Human Risk Management Matters for Safer AI Agent Adoption

To effectively manage agent-related risk, just controlling the agents is not enough. People ultimately shape how AI is used day to day, by choosing what tools to adopt and what to do with them. That makes human risk management (HRM) just as important to AI agent adoption as it is for other security risks.

Technical Controls Alone Are Not Enough

No set of controls is perfect. Because AI agents can behave in unexpected ways, it’s impossible to design systems that anticipate every risk. Ultimately, securing AI agents depends just as much on human activity as it does on automated protections.

Safer AI Adoption Depends on User Behavior

Employees influence AI outcomes more than they realize. They decide what inputs to provide, when to override an agent, and whether to question unusual behavior. If they treat agents like infallible systems, risk increases quickly.

Employees need training to approach AI critically, so they can judge which outputs need double-checking, and which are okay to trust.

Adaptive Reinforcement Can Support Safer Outcomes

Ongoing, adaptive reinforcement helps people recognize when something feels off and avoid repeating mistakes over time. When you engage employees in the moment, while they’re interacting with an AI agent, you turn everyday guidance into a learning opportunity that sticks. Instead of generic follow-up, personalized training pinpoints exactly where someone needs reinforcement, then delivers the right message at the right time.

Secure AI Agents With Strong Guardrails and Reduced Human Risk

AI adoption should not create unmanaged human or workflow risk. Securing AI agents requires governance with clear guardrails, least-privilege access, data protection, ongoing monitoring and human oversight where it matters.

However, technical safeguards are not enough. Organizations also need to reduce the human risk that surrounds AI adoption.

Learn how KnowBe4’s AI Defense Agents strengthen security behavior with personalized training, adaptive phishing simulations, and intelligent automation.