惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

T
Threat Research - Cisco Blogs
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
V
Vulnerabilities – Threatpost
GbyAI
GbyAI
P
Proofpoint News Feed
L
LINUX DO - 热门话题
P
Palo Alto Networks Blog
A
About on SuperTechFans
T
Tenable Blog
M
MIT News - Artificial intelligence
IT之家
IT之家
I
Intezer
D
DataBreaches.Net
爱范儿
爱范儿
T
Threatpost
C
CERT Recently Published Vulnerability Notes
云风的 BLOG
云风的 BLOG
博客园 - 三生石上(FineUI控件)
WordPress大学
WordPress大学
K
Kaspersky official blog
大猫的无限游戏
大猫的无限游戏
A
Arctic Wolf
Y
Y Combinator Blog
Cyberwarzone
Cyberwarzone
酷 壳 – CoolShell
酷 壳 – CoolShell
D
Darknet – Hacking Tools, Hacker News & Cyber Security
H
Help Net Security
Microsoft Security Blog
Microsoft Security Blog
Spread Privacy
Spread Privacy
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
AWS News Blog
AWS News Blog
博客园 - 聂微东
C
Check Point Blog
S
Securelist
有赞技术团队
有赞技术团队
雷峰网
雷峰网
aimingoo的专栏
aimingoo的专栏
Last Week in AI
Last Week in AI
Stack Overflow Blog
Stack Overflow Blog
MongoDB | Blog
MongoDB | Blog
D
Docker
G
GRAHAM CLULEY
T
The Exploit Database - CXSecurity.com
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tailwind CSS Blog
L
Lohrmann on Cybersecurity
G
Google Developers Blog
C
Cyber Attacks, Cyber Crime and Cyber Security
L
LangChain Blog

Google adds end-to-end Gmail encryption to Android, iOS devices for enterprises | CSO Online

Flowise’s MCP implementation can run ghost commands CSO30 ASEAN & Hong Kong Awards 2026 open for nominations GDPR set the tone for regulatory action — and the AI fine pushback to come 6 critical security gaps every CISO must address Notepad++ vulnerabilities could enable arbitrary code execution on Windows systems The Gentlemen are coming for your files, and then your network Cybersecurity trends in SEC filings Russia-aligned crime group Greyvibe extensively uses AI in attacks Microsoft and security researcher’s dueling posts about cybersecurity disclosures get nasty DNS-AID will make AI agents easier to discover, says Linux Foundation Certifiably random: Swiss researchers claim perfect random number source Indian CERT urges firms to contain exploited internet-facing flaws within 12 hours GlassWorm falls, but the repo problem is far from solved The AI governance imperative you can’t afford to ignore What the industrialization of exploitation means for defenders Employees are unknowingly inviting tech support impersonators into firms, says FBI IBM and Red Hat want to become the ‘security clearinghouse’ for open source applications in the enterprise Lack of response to critical vulnerability in Gogs is a reminder of the limits of open source projects AI models more vulnerable than claimed when faced with iterative attacks The NSA, ‘Mythos’ and the quiet emergence of AI cyber doctrine DSPM buyer’s guide: Top 10 data security posture management tools Another IT governance headache: AI-enabled sanction evasion Vulnerabilities have become cyber attackers’ No. 1 door to the enterprise Security experts caution MFA alone can no longer stop threat actors Stop treating AI governance as a review layer. Make it release infrastructure TrapDoor malware campaign puts developer workstations in CISO spotlight GitHub Actions abused by Megalodon attack to slip malicious commits into 5,500 repos To pay, or not to pay: 58% of CISOs say they would pay the ransom for their data AI security needs a shift from models to systems, researchers argue Project Glasswing has uncovered 10,000 vulnerabilities: Anthropic Identity as the primary attack surface: What modern breaches are really exploiting Why your AI strategy stops where the PLC starts: Hard lessons from the OT frontlines Google leaks details for Chromium bug that can turn browsers into bots Microsoft patches two zero-day flaws in Defender Unpatched ChromaDB flaw leaves servers open to remote code execution AI becoming an SOC imperative for curtailing emerging cyber threats SHub Reaper impersonates Apple, Google, and Microsoft in one MacOS attack chain Why some security fixes never reach your vulnerability dashboard Drupal admins rushing to patch maximum severity SQL injection vulnerability Internet Explorer may be dead, but its ghost still runs malware 7 tips for accelerating cyber incident recovery Schwachstellen managen: Die besten Vulnerability-Management-Tools SIEM-Kaufratgeber Security-Infotainment: Die besten Hacker-Dokus Contractor’s public GitHub account exposed GovCloud and CISA credentials Microsoft disrupts malware code-signing service used by ransomware gangs AI cyberattackers are getting better faster New image-based prompt injection attack targets multimodal AI models ‘Patched’ Windows bug resurfaces 6 years later as working SYSTEM-level exploit Why the best security investment a board can make in 2026 isn’t another tool AI coding is fueling a secrets-sprawl crisis few CISOs are containing Expired domain leads to supply chain attack on node-ipc npm package Exchange Server zero-day vulnerability can be triggered by opening a malicious email Autonomous systems are finally working. Security is next EU’s Cyber Resiliency Act will put IT leaders to the test The economics of ransomware 3.0 AI agent finds 18-year-old remote code execution flaw in Nginx Meet Fragnesia, the third Linux kernel vulnerability in a month FlowerStorm phishing gang adopts virtual-machine obfuscation to evade email defenses PraisonAI vulnerability gets scanned within 4 hours of disclosure What CISOs need to land a board role Fired employee sought AI help to hide deletion of hosting firm’s customer data Fortinet fixes two critical RCE flaws in FortiAuthenticator and FortiSandbox What happens when China’s AI catches up to Mythos? Palo Alto bets on identity security for autonomous AI with Idira launch ClickFix finds a backup plan in PySoxy proxy chains CISA’s AI SBOM guidance pushes software supply-chain oversight into new territory 2026 CSO Award winners showcase business-enabling cyber innovation Google entdeckt erstmals KI-basierten Zero-Day-Exploit Der Kaufratgeber für Breach & Attack Simulation Tools May Patch Tuesday roundup: Critical holes in Windows Netlogon, DNS, and SAP S/4HANA Fake Claude Code takes the IElevator to your browser secrets cPanel flaw exposes enterprises to hosting supply-chain risks Developer workstations are the new beachhead CISOs step into the AI spotlight Why patching SLAs should be the floor, not the strategy Cybersicherheitsvorschriften: So erfüllen Sie Ihre Compliance-Anforderungen Customer Identity & Access Management: Die besten CIAM-Tools Entries now open for the 2026 CSO30 Australia Awards Google discovers weaponized zero-day exploits created with AI New ‘Dirty Frag’ exploit targets Linux kernel for root access AI security is repeating endpoint security's biggest mistake 8 guiding principles for reskilling the SOC for agentic AI 1,800+ MCP servers exposed without authentication: How zero trust can secure the AI agent revolution Five new holes, one exploited, found in Ivanti Endpoint Manager Mobile Claude in Chrome is taking orders from the wrong extensions Your CTEM program is probably ignoring MCP. Here’s how to fix it Pen tests show AI security flaws far more severe than legacy software bugs Your refresh plan has a CVE blind spot Become a millionaire by bug hunting on Android Ollama vulnerability highlights danger of AI frameworks with unrestricted access Bots in translation: Can AI really fix SIEM rule sprawl across vendors? Critical Palo Alto Networks software bug hits exposed firewalls CISOs: Align cyber risk communication with boardroom psychology Ten years later, has the GDPR fulfilled its purpose? Iranian state-backed spies pose as ransomware slingers in false flag attacks New malware turns Linux systems into P2P attack networks Poisoned truth: The quiet security threat inside enterprise AI Train like you fight: Why cyber operations teams need no-notice drills Die besten DAST- & SAST-Tools
Prompt injection breaks today’s AI agents, study warns
by Gyana Swain · 2026-06-12 · via Google adds end-to-end Gmail encryption to Android, iOS devices for enterprises | CSO Online

Researchers say current AI agents fail to consistently resist prompt injection attacks, exposing enterprises to failures that conventional security testing may overlook.

Today’s AI web agents have no dependable defenses against prompt injection, according to new research showing that not a single attack scenario was consistently blocked across leading systems powered by GPT‑5 and Gemini.

The findings come from StakeBench, a stakeholder-centric benchmark developed by researchers from Nanyang Technological University, ST Engineering, IBM Research, and the University of Illinois Urbana-Champaign to evaluate prompt injection attacks against AI agents operating in realistic web environments.

The researchers executed 3,168 adversarial runs across NanoBrowser and BrowserUse using 264 benchmark cases. Indirect prompt injection attacks, where malicious instructions are hidden inside ordinary web content such as product reviews and metadata, achieved attack success rates ranging from 41.67% to 68.16%, while direct prompt injection exceeded 79% across all tested configurations.

“Crucially, these failures exhibit distinct patterns when analysed through a stakeholder lens: some attacks succeed without disrupting the user’s delegated task while disproportionately harming third parties (stealthy parasitism), whereas others disrupt task completion without realizing the adversarial objective (misaligned disruption),” the researchers wrote in a paper.

OpenAI and Google did not immediately respond to requests for comment.

Every attack objective exposed at least one failure mode

The benchmark evaluated web agents across four possible outcomes: Robust Behavior, Stealthy Parasitism, Misaligned Disruption, and Compounded Failure. Robust Behavior represents the ideal state in which an agent completes a user’s task without advancing an attacker’s objective or exhibiting execution instability.

The researchers argue that the findings reveal a broader problem than high attack success rates.

“The Robust Behavior region remains unpopulated across all evaluated configurations,” they wrote, meaning every tested attack objective resulted in at least one meaningful failure dimension, whether successful adversarial manipulation, disruption of the user’s intended task, or execution instability.

The authors say this demonstrates that “prompt-injection vulnerability in deployable web agents cannot be characterized by any single metric in isolation,” because attack success and task disruption are “weakly coupled in practice.”

Attacks can succeed while users see nothing wrong

One of the failure modes identified by the benchmark is what the researchers call “stealthy parasitism,” in which an AI agent completes the user’s delegated task while simultaneously advancing an attacker’s objective.

The paper illustrates the risk with an online shopping scenario: “A malicious prompt injected into product reviews may bias an agent toward a specific item: although the user may still receive an acceptable recommendation, the same behaviour can disadvantage competing sellers and undermine platform integrity.”

The researchers argue that prompt injection has evolved into “a system-level security problem with multi-party harm,” rather than a model safety issue affecting only the end user.

Different stakeholders face different risks

Unlike existing benchmarks that primarily measure attack success, StakeBench evaluates harm across three stakeholder groups: end users, third-party sellers, and platforms.

The results show that those groups experience materially different risks.

Seller-targeted attacks recorded the highest attack success rates across both evaluated web agents. User-targeted attacks, however, produced the lowest task deviation rates, suggesting they may be harder to detect because workflows continue to appear normal even when adversarial objectives are achieved.

According to the researchers, “the same agent can simultaneously appear stealthy on user-targeted attacks, susceptible on seller-targeted attacks, and unstable on platform-targeted attacks.”

That, they argue, makes “aggregate ASR alone insufficient to characterize stakeholder-specific vulnerability.”

Models and architectures influence outcomes

The benchmark also found meaningful differences between AI models and agent architectures.

Replacing GPT-5 with Gemini-2.5-Flash increased indirect prompt injection success rates by 26.49 percentage points on NanoBrowser and by 6.2 percentage points on BrowserUse, the paper said. BrowserUse also consistently exhibited higher task deviation and behavioral irregularity than NanoBrowser, it added.

According to the researchers, the findings suggested prompt injection resilience depends not only on the language model but also on how it is implemented within an autonomous agent.

“These results indicate that prompt-injection security in deployable web agents is not a scalar property of the backbone model but a distribution of harm whose realisation is jointly determined by the affected stakeholder, the semantic alignment between the injected objective and the user’s task, and the architectural context in which the backbone is deployed,” the paper added.

Images may emerge as the next attack vector

The researchers also explored whether prompt injection could extend beyond text.

In a preliminary multimodal experiment, they modified only a product image while leaving accompanying text, ratings, and page structure unchanged. The manipulated product’s selection rate increased from 10% to 76.67% without rating signals, suggesting visual content alone may significantly influence AI agent decisions.

While the experiment was limited in scope, the researchers said the results indicate “the IPI surface relevant to deployable web agents may extend beyond textual channels to visual ones,” pointing to another emerging attack vector as enterprises increasingly deploy autonomous AI systems.

SUBSCRIBE TO OUR NEWSLETTER

From our editors straight to your inbox

Get started by entering your email address below.