惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Fox-IT International blog
Recent Announcements
Recent Announcements
D
Docker
IT之家
IT之家
B
Blog
Jina AI
Jina AI
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
量子位
C
Check Point Blog
Microsoft Azure Blog
Microsoft Azure Blog
罗磊的独立博客
博客园 - 司徒正美
李成银的技术随笔
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
雷峰网
雷峰网
The GitHub Blog
The GitHub Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
MongoDB | Blog
MongoDB | Blog
P
Proofpoint News Feed
L
LangChain Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Y
Y Combinator Blog
大猫的无限游戏
大猫的无限游戏
有赞技术团队
有赞技术团队
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
V
Visual Studio Blog
T
Tailwind CSS Blog
H
Help Net Security
Engineering at Meta
Engineering at Meta
小众软件
小众软件
B
Blog RSS Feed
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
M
Microsoft Research Blog - Microsoft Research
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog

The Register - Security: Research

Kids say they can beat age checks by drawing on a fake mustache Kids say they can beat age checks by drawing on a fake mustache What type of 'C2 on a sleep cycle' do they leave behind? Novel Chinese spy group found in critical networks in Poland, Asia Researchers move in the right direction, develop powerful GPS interference alarm ORNL builds more sensitive GPS interference detector GitHub: Woah, a genuinely helpful AI-assisted bug report that isn't total slop. Here, Wiz, take this wad of cash Researchers find cyber-sabotage malware that may predate Stuxnet by five years Researchers find cyber-sabotage malware that may predate Stuxnet by five years Weak security means attackers could disable all of a city's public EV chargers Vibe coding upstart Lovable denies data leak, cites 'intentional behavior,' then throws HackerOne under the bus Agents hooked into GitHub can steal creds – but Anthropic, Google, and Microsoft haven't warned users Security researchers tricked Apple Intelligence into cursing at users. It could have been a lot worse Anthropic: All your zero-days are belong to Mythos Don't open that WhatsApp message, Microsoft warns Don't open that WhatsApp message, Microsoft warns Security boffins scoured the web and found hundreds of valid API keys Security boffins scoured the web and found hundreds of valid API keys Scammers have virtual smartphones on speed dial for fraud 1K+ cloud environments infected following Trivy supply chain attack Claude attacks were 'Rorschach test' for infosec community Lightning-fast exploits mean patch fast, says Cisco Talos AI agents are 'gullible' and easy to turn into your minions Smooth criminals talking their way into cloud environments, Google says Snoops plant info-stealing malware on iPhones, Google warns Snoops plant info-stealing malware on iPhones, Google warns Cybercrime up 245% since the start of the Iran war Rogue AI agents can work together to hack systems and steal secrets Rogue AI agents can work together to hack systems and steal secrets Fake job applications pack malware that kills endpoint detection before stealing data Fake job applications pack malware that kills endpoint detection before stealing data AI vs AI: Agent hacked McKinsey's chatbot and gained full read-write access in just two hours Kaspersky dismisses claims Coruna iPhone exploit kit is connected to NSA-linked operation Until last month, attackers could've stolen info from Perplexity Comet users just by sending a calendar invite Until last month, attackers could've stolen info from Perplexity Comet users just by sending a calendar invite Denizens of DEF CON are 'fed up with government' DEF CON hackers 'fed up with government,' Jake Braun says Ransomware payments cratered in 2025, but attacks surged to record highs Ransomware payments cratered in 2025 – attacks did not Claude collaboration tools left the door wide open to remote code execution Claude collaboration tools left the door wide open to remote code execution AI takes a swing at online anonymity Fake 'interview' repos lure Next.js devs into running secret-stealing malware Threat intelligence supply chain is full of weak links Threat intelligence supply chain is full of weak links RAT disguised as an RMM costs crims $300 a month Android malware taps Gemini to navigate infected devices Android malware taps Gemini to navigate infected devices Posting AI-generated caricatures on social media is risky, infosec killjoys warn Posting AI caricatures on social media is bad for security Payroll pirates conned the help desk, stole employee’s pay Microsoft boffins show LLM safety can be trained away For the price of Netflix, crooks can now rent AI to run cybercrime For the price of Netflix, crooks can now rent AI to run cybercrime Fast Pair, loose security: Bluetooth accessories open to silent hijack Fast Pair flaw exposes Bluetooth devices to hijacking A simple CodeBuild flaw put every AWS environment at risk – and pwned 'the central nervous system of the cloud' A simple CodeBuild flaw put every AWS environment at risk – and pwned 'the central nervous system of the cloud' 'Imagination the limit': DeadLock ransomware gang using smart contracts to hide their work 'Imagination the limit': DeadLock ransomware gang using smart contracts to hide their work Popular Python libraries used in Hugging Face models subject to poisoned metadata attack Mandiant open sources tool to prevent leaky Salesforce misconfigs OpenAI putting bandaids on bandaids as prompt injection problems keep festering OpenAI patches déjà vu prompt injection vuln in ChatGPT Fake Windows BSODs check in at Europe's hotels to con staff into running malware Hotel staff tricked into installing malware by bogus BSODs Your car’s web browser may be on the road to cyber ruin Your car’s web browser may be on the road to cyber ruin China's Ink Dragon hides out in European government networks China's Ink Dragon hides out in European government networks Browser 'privacy' extensions have eye on your AI, log all your chats Honeypots can help defenders, or damn them if implemented badly 10K Docker images spray live cloud creds across the internet 10K Docker images spray live cloud creds across the internet As humanoid robots enter the mainstream, security pros flag the risk of botnets on legs As humanoid robots enter the mainstream, security pros flag the risk of botnets on legs Apache warns of 10.0-rated flaw in Tika metadata ingestion tool Novel clickjacking attack relies on CSS and SVG Novel clickjacking attack relies on CSS and SVG 'Exploitation is imminent' as 39 percent of cloud environs have max-severity React hole Swiss government says give M365, and all SaaS, a miss as it lacks end-to-end encryption Zendesk users targeted as Scattered Lapsus$ Hunters spin up fake support sites Zendesk users targeted as Scattered Lapsus$ Hunters spin up fake support sites HashJack attack shows AI browsers can be fooled with a simple ‘#’ Fresh ClickFix attacks use Windows Update trick-pics to steal credentials Years-old bugs in open source tool left every major cloud open to disruption LLM-generated malware is improving, but don't expect autonomous attacks tomorrow LLM-generated malware improving, but not operational (yet) Researchers claim 'largest leak ever' after uncovering WhatsApp enumeration flaw Researchers claim 'largest leak ever' after uncovering WhatsApp enumeration flaw Tens of thousands more ASUS routers pwned by suspected, evolving China operation Overconfidence is the new zero-day as teams stumble through cyber simulations LLM side-channel attack could allow snoops to guess topic Landfall spyware used in 0-day attacks on Samsung phones MIT Sloan quietly shelves AI ransomware study after researcher calls BS This security hole can crash billions of Chromium browsers, and Google hasn't patched it yet Researchers exploit OpenAI's Atlas by disguising prompts as URLs Devs are writing VS Code extensions that blab secrets by the bucketload AI chatbots that butter you up make you worse at conflict, study finds Tile trackers are a stalker's dream, say Georgia Tech researchers Beijing's RedNovember hacked critical US, global orgs
AI agents abound, unbound by rules or safety disclosures
2026-02-20 · via The Register - Security: Research

AI agents are becoming more common and more capable, without consensus or standards on how they should behave, say academic researchers.

So says MIT’s Computer Science & Artificial Intelligence Laboratory (CSAIL), which analyzed 30 AI agents for its 2025 AI Agent Index, which assesses machine learning models that can take action online through their access to software services.

AI agents may take the form of chat applications with tools (Manus AI, ChatGPT Agent, Claude Code), browser-based agents (Perplexity Comet, ChatGPT Atlas, ByteDance Agent TARS), or enterprise workflow agents (Microsoft Copilot Studio, ServiceNow Agent).

REG AD

The paper accompanying the AI Agent Index observes that despite growing interest and investment in AI agents, "key aspects of their real-world development and deployment remain opaque, with little information made publicly available to researchers or policymakers."

REG AD

The AI community frenzy around open source agent platform OpenClaw, and its accompanying agent interaction network Moltbook – plus ongoing frustration with AI-generated code submissions to open source projects – underscores the consequences of letting agents loose without behavioral rules.

In the paper, the authors note that the tendency of AI agents to ignore the Robot Exclusion Protocol – which uses robots.txt files to signal no consent to scraping websites – suggests that established web protocols may no longer be sufficient to stop agents.

It's a timely topic. Anthropic, one of the main providers of AI agents, on Wednesday published its own analysis of AI agent autonomy, focused more on how agents are used than the consequences of their use.

"AI agents are here, and already they're being deployed across contexts that vary widely in consequence, from email triage to cyber espionage," the company said. "Understanding this spectrum is critical for deploying AI safely, yet we know surprisingly little about how people actually use agents in the real world."

According to consultancy McKinsey, AI agents have the potential to add $2.9 trillion to the US economy by 2030 – assuming the vast capital expenditures by OpenAI and other tech firms haven't derailed the hype train. We note that enterprises aren't yet seeing much of a return on their AI investments. And researchers last year found AI agents could only complete about a third of multi-step office tasks. But AI models have improved since then.

MIT CSAIL's 2025 AI Agent Index covers 30 AI agents. It is smaller than its 2024 predecessor, which looked at 67 agentic systems. The authors say the 2025 edition goes into greater depth, analyzing agents across six categories: legal, technical capabilities, autonomy & control, ecosystem interaction, evaluation, and safety. The AI Agent Index site makes this information available for every listed agent, each with 45 annotation fields.

According to the researchers, 24 of the 30 agents studied were released or received major feature updates during the 2024-2025 period. But the developers of agents talk more about product features than about safety practices.

"Of the 13 agents exhibiting frontier levels of autonomy, only four disclose any agentic safety evaluations (ChatGPT Agent, OpenAI Codex, Claude Code, Gemini 2.5 Computer Use)," according to the researchers.

REG AD

Developers of 25 of the 30 agents covered provide no details about safety testing and 23 offer no third-party testing data.

To complicate matters, most agents rely on a handful of foundation models – the majority are harnesses or wrappers for models made by Anthropic, Google, and OpenAI, supported by scaffolding and orchestration layers.

The result is a series of dependencies that are difficult to evaluate because no single entity is responsible, the MIT boffins say.

Delaware-incorporated companies created 13 of the agents evaluated by the authors. Five come from China-incorporated organizations, and four come have non-US, non-China origins: specifically Germany (SAP, n8n), Norway (Opera), and Cayman Islands (Manus).

Among the five Chinese-incorporated agent makers, one has a published safety framework and one has a compliance standard.

For agents originating outside of China, 15 point to safety frameworks like Anthropic's Responsible Scaling Policy, OpenAI's Preparedness Framework, or Microsoft's Responsible AI Standard. The other ten lack safety framework documentation. Enterprise assurance standards are more common, with only five of 30 agents having no compliance standards documented.

Twenty-three of the evaluated agents are closed-source. Developers of seven agents open-sourced their agent framework or harness – Alibaba MobileAgent, Browser Use, ByteDance Agent TARS, Google Gemini CLI, n8n Agents, OpenAI Codex, and WRITER.

All told, the Index found agent makers reveal too little safety information, and that a handful of companies dominate the market. Other major findings include the difficulty of analyzing agents given their layers of dependencies, and that agents aren't necessarily welcome at every website.

REG AD

The paper lists the following authors: Leon Staufer (University of Cambridge), Kevin Feng (University of Washington), Kevin Wei (Harvard Law School), Luke Bailey (Stanford University), Yawen Duan (Concordia AI), Mick Yang (University of Pennsylvania), A. Pinar Ozisik (MIT), Stephen Casper (MIT), and Noam Kolt (Hebrew University of Jerusalem). ®