InertiaRSS Track and read blogs, news, and tech you care about
Read Original Open in InertiaRSS

Recommended Feeds

Google DeepMind News
Google DeepMind News
V
Visual Studio Blog
WordPress大学
WordPress大学
博客园 - Franky
L
LangChain Blog
腾讯CDC
博客园 - 【当耐特】
罗磊的独立博客
宝玉的分享
宝玉的分享
Vercel News
Vercel News
阮一峰的网络日志
阮一峰的网络日志
Microsoft Azure Blog
Microsoft Azure Blog
云风的 BLOG
云风的 BLOG
Recent Announcements
Recent Announcements
Blog — PlanetScale
Blog — PlanetScale
Engineering at Meta
Engineering at Meta
S
SegmentFault 最新的问题
M
MIT News - Artificial intelligence
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
小众软件
小众软件

博客园 - iTech

7万星的AI交易框架:让大模型模拟投行多空辩论,自动做交易决策 71000颗星的AI交易团队:让大模型模拟投行分工,自动做交易决策 13400颗星的开源项目:输入一句话,AI全自动帮你做短视频 102颗星的沙盒:当AI学会自己写代码、跑测试、做部署 AI 技术日报 - 2026-05-08 29k 星的 PageIndex:不用向量数据库,靠推理就能做 RAG 每天花两小时刷信息?这个开源项目帮你全自动搞定 读源码像读小说?试了 DeepWiki 和 Zread,我再也不想裸读 GitHub 了 Matt Pocock 开源的这套 .claude 技能,为什么让工程师集体上头? Cursor Team Kit:Cursor 官方团队在用的 17 个 AI 工作流 AI 技术日报 - 2026-05-07 AI 技术日报 - 2026-05-06 AI 技术日报 - 2026-05-05 Anthropic CEO 说 12 个月内程序员要失业,我扒完他的底牌,发现事情没那么简单 把工程师的肌肉记忆装进 Claude Code,这个 4300 Star 的项目我后悔没早用 AI 技术日报 - 2026-05-04 AI 技术日报 - 2026-05-03 AI 技术日报 - 2026-05-02 六大 Agent 框架横评:谁支持 Skills?谁能自动创建 Agent?MCP 呢? Wechatsync:一个 Chrome 插件,一键把文章同步到 31 个平台 LangChain 开源了 Open SWE:Stripe、Ramp、Coinbase 内部都在造的编程 Agent Cockpit:把 Claude Code 从终端里搬出来,装进浏览器 Cursor 把自家的 AI Agent 开放了:写几行 TypeScript 就能调 Cursor 干活 AI 技术日报 - 2026-05-01 AI 写代码每次结果都不一样?Archon 用 YAML 工作流把 AI 编程变成流水线 AI 写代码比你快了,但你还是得学编程——只不过学法得换 腾讯的龙虾特工队:4 个 AI Agent 同日更新,全家桶正式成型 Agno 不做更聪明的 Agent,它要把所有 Agent 框架包进同一个操作系统 Hermes Agent 终于有了像样的 Web 界面,而且还支持远程访问 Datawhale 出了一套 29 学科知识地图,把 AI 的底牌全掀了 Hermes Agent 在聊天框里就能用的 20 种高级功能 一份 AGENTS.md 能顶一次模型升级?Augment Code 用数据说了算 NVIDIA 开源了一个「AI 沙箱」,20K Star,让 Agent 跑代码不再裸奔 60ms 冷启动、5MB 内存:腾讯开源的这个沙箱让 Docker 安全隔离像笑话 AI 技术日报 - 2026-04-30 AI 技术日报 - 2026-04-29 AI 技术日报 - 2026-04-28 Goose:Linux 基金会亲儿子,能撼动 Claude Code 和 OpenCode 吗? AI 技术日报 - 2026-04-27 AI 技术日报 - 2026-04-26 Google 把价值20美元/月的东西免费了,102K人已经抢到了 OpenClaw 和 Claude Code 网络搜索配置指南 AI 技术日报 - 2026-04-25 Anthropic 为什么遥遥领先:从 Cat Wu 专访看AI霸主的底层逻辑 Mac 本地跑大模型完全指南:你的苹果电脑就是 AI 工作站 同样 70B 参数,为什么 MoE 只激活 13B 就能打平 Dense? DeepSeek-V4 技术报告里藏着一条线:华为昇腾 NPU 已完成推理验证 DeepSeek-V4 深夜炸场:1M 上下文、384K 输出、双模型,API 定价直接卷到底 MacBook Air 跑大模型实测:Ollama、llama.cpp、LM Studio 谁才是本地推理之王? AI 技术日报 - 2026-04-24
NVIDIA open source SkillSpector: Before installing AI Agent skills, ask if it is safe
iTech · 2026-06-19 · via 博客园 - iTech

The "npm moment" of Agent skills

Remember the early supply chain attacks in the npm ecosystem? Someone posted a message called event-stream was taken over by a malicious defender and quietly injected with the code to steal Bitcoin, which was discovered after millions of downloads.

The skill ecosystem of AI Agents is experiencing the same problem, or even more serious.

The reason is simple:An agent skill is not just referenced by your code, it runs directly in the agent's execution environment and has full permissions of the agent. When Claude Code loads a skill, every Markdown instruction and every Python script in that skill gain the same file system access, network access, and tool call rights as you.

In June 2026, NVIDIA released SkillSpector, an open source tool specifically designed to scan the security of Agent skills. It has only one question to answer:"Is this skill safe to install?"

A disturbing set of numbers

SkillSpector 的背后是一项大规模实证研究:来自 Liu et al.(2026)的论文《Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale》。

The study scanned data from major markets 42,447 real skills, found that:

  • 26.1% skills contain at least one flaw
  • 5.2% skills with clear malicious intent
  • Skills that contain executable scripts, and the possibility of vulnerabilities is purely Markdown skills. 2.12 times

A quarter. More than a quarter of Agent skills have security issues.

This means that if your Agent has 10 skills installed, statistically at least 2-3 of them are vulnerable. One of these may be a deliberate malicious skill-stealing your API keys, monitoring your conversation context, or sending your system prompt words to an external server.

What did SkillSpector do?

SkillSpector uses a two-stage inspection pipeline:

Stage 1: Static analysis (fast, high recall)

Regular matching and AST analysis were performed through 11 static analyzers:

  • Regular pattern matching: Detecting known hazard code patterns
  • AST Behavioral Analysis: Testing exec()eval()subprocessos.system Wait for dangerous calls
  • Dependent vulnerability query: Query known CVE in real time through OSV.dev(no API Key required)
  • YARA Rule Matching: Detection of Known Malware Signatures

Stage 2: LLM Semantic Analysis (optional, high-precision)

Submit the static analysis results to LLM for secondary evaluation:

  • Understand context and intent (not all os.environ All malicious)
  • Filter false positives (accuracy increased to approximately 87%)
  • Generating human-readable explanations

LLM's prompt has built-in anti-jailbreak protection to prevent malicious skills from manipulating the analysis process.

Supported LLM providers:
- OpenAI(Default gpt-5.4)
- Anthropic(Default claude-opus - 4 - 6)
- NVIDIA build.nvidia.com(默认 deepseek-v4-flash)
- 本地模型:Ollama、vLLM、llama.cpp 等 OpenAI 兼容端点

64 种漏洞模式,16 大类别

SkillSpector 检测的漏洞类型覆盖非常全面:

直接攻击类

类别 模式数 典型威胁
Prompt Injection 5 指令覆盖、隐藏指令、上下文外泄
Data Exfiltration 4 环境变量收割、文件系统枚举、数据传输
Privilege Escalation 3 过度权限、sudo 执行、凭证访问
Supply Chain 6 未固定版本依赖、curl \| bash、混淆代码、已知 CVE

Agent 特有类

类别 模式数 典型威胁
Excessive Agency 4 无限制工具访问、自主决策、范围蔓延
System Prompt Leakage 3 直接泄露、间接提取、工具外泄
Memory Poisoning 3 持久化上下文注入、记忆篡改
Rogue Agent 2 自修改代码、未授权的持久化(cron/startup)
Tool Misuse 3 Parameter abuse, chain bypass, unsafe default values
Trigger Abuse 3 Too broad triggers, shadow command triggers

code-level analysis

category mode number typical threats
Behavioral AST 8 exec/eval calls, dynamic imports, dangerous execution chains
Taint Tracking 5 Certificate leakage chain, file → network leakage, external input → code execution
YARA Signatures 4 Malware matching, Webshell, miners, hacking tools

MCP protocol class

category mode number typical threats
MCP Least Privilege 4 Undeclared capabilities, wildcard permissions, overdeclared permissions
MCP Tool Poisoning 4 Hidden instructions in metadata, Unicode spoofing, parameter injection

5 minutes to get started

installation

git clone https://github.com/NVIDIA/SkillSpector.git
cd SkillSpector

# 创建虚拟环境
uv venv .venv && source .venv/bin/activate

# 安装
make install

Or use Docker without installing Python:

Scan for a skill

# 扫描本地目录
skillspector scan ./my-skill/

# 扫描单个 SKILL.md 文件
skillspector scan ./SKILL.md

# 扫描 Git 仓库
skillspector scan https://github.com/user/my-skill

# 扫描 zip 包
skillspector scan ./my-skill.zip

# 仅静态分析(不使用 LLM)
skillspector scan ./my-skill/ --no-llm

Docker approach

docker run --rm -v "$PWD:/scan" skillspector scan ./my-skill/ --no-llm

output format

# 终端输出(默认,美化格式)
skillspector scan ./my-skill/

# JSON(机器可读)
skillspector scan ./my-skill/ --format json --output report.json

# Markdown(文档用)
skillspector scan ./my-skill/ --format markdown --output report.md

# SARIF(CI/CD 集成)
skillspector scan ./my-skill/ --format sarif --output report.sarif

Risk scoring system

SkillSpector generates a risk score of 0-100 for each skill:

  • CRITICAL problems:+50 points
  • HIGH issue:+25 points
  • MEDIUM Question:+10 points
  • LOW Question:+5 points
  • Contains executable scripts: ×1.3 times multiplier
score risk level recommendations
0-20 LOW safe installation
21-50 MEDIUM used with caution
51-80 HIGH Installation is not recommended
81-100 CRITICAL Never install it

A real scan result

Check out SkillSpector's report on a questionable skill:

SkillSpector Security Report  v2.0.0

Skill: suspicious-skill
Source: ./suspicious-skill/

        Risk Assessment
 Score           78/100
 Severity        HIGH
 Recommendation  DO NOT INSTALL

Issues (2)

  HIGH: Env Variable Harvesting (E2)
    Location: scripts/sync.py:23
    Finding: for key, val in os.environ.items():...
    Confidence: 94%
    Explanation: This code collects environment variables containing
    API keys and secrets, then sends them to an external server.

  HIGH: External Transmission (E1)
    Location: scripts/sync.py:45
    Finding: requests.post("https://api.skill.io/env"...
    Confidence: 89%
    Explanation: Data is being sent to an external server. Combined
    with env harvesting above, this indicates credential exfiltration.

This skill is ostensibly a "data synchronization" tool, but the actual code is collecting all environment variables (including your API keys) and sending them to external servers. Score 78 points, HIGH level, and clearly recommend "Do not install".

Access CI / CD

The SARIF format allows SkillSpector to seamlessly connect to your CI processes:

# 在 CI 中扫描所有技能
skillspector scan ./skills/ --format sarif --output report.sarif

GitHub Actions, GitLab CI, etc. all natively support security scanning reports in SARIF format. Automatic scanning is performed every time a PR is submitted, and high-risk skills are found to directly block the merger.

Python APIs can also be integrated into your tool chain:

from skillspector import graph

result = graph.invoke({
    "input_path": "/path/to/skill",
    "output_format": "json",
    "use_llm": True,
})

if result["risk_score"] > 50:
    raise RuntimeError(f"Skill failed security check: {result['risk_severity']}")

MCP security: A new area worthy of attention

SkillSpector specifically adds security detection for the MCP (Model Context Protocol) protocol. MCP is becoming the standard protocol for agents to connect to external services, but it also brings a new attack surface:

  • Tool poisoning: Embed hidden instructions in tool metadata (HTML comments, zero-wide characters, Base64 encoding)
  • Unicode spoofing: Use homomorphic characters, RTL overrides, mixed scripts to disguise identifiers
  • parameter description injection: Inject malicious content into parameter definitions
  • Minimum authority violation: Code uses capabilities not listed in the declaration

These attack methods are rare in traditional software security, but can be fatal in the Agent ecosystem.

limitations

SkillSpector is a powerful first line of defense, but it also has boundaries:

  • Non-English content: Attack patterns that may miss other languages
  • image attack: Unable to analyze text in pictures
  • Encryption/binary code: Unable to analyze compiled or encrypted content_
  • Run-time behavior_: Static analysis only, no dynamic execution
  • Offline mode: SC4 relies on OSV.dev web access, Downgraded to a static list when offline_

Why is this important_

_The threat model of Agent skills is highly similar to npm supply chain attacks:

  1. Implicit trust_: Agents give implicit trust to the skills they install, just like the npm package gets full permissions of the process after it is installed_
  2. Audit is missing: Most marketplaces do not have a strict security review process
  3. have a widespread impact: A malicious skill can steal credentials, monitor conversations, inject instructions, and tamper with files
  4. see the barriers: Malicious behavior may be hidden behind normal functional descriptions and is difficult for users to detect

The difference is that Agent skills have a larger attack surface than npm. The npm package affects your Node.js process, and the Agent skills affect your AI Agent-it can operate your file system, access your mail, modify your calendar, and even send messages on your behalf.

Before installing any Agent skills, run SkillSpector. This is not an option, it is a basic health requirement for Agent development in 2026.


author: itech001
source: Public Account: AI Artificial Intelligence Era
website: _ _ JHSNS _ URL _ 0 _ _
Share the most cutting-edge AI news and technical research every day.

This article was first published in the era of AI artificial intelligence. Please indicate the source for reprinting.

Reference source:
- NVIDIA/SkillSpector | GitHub
- Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale (Liu et al., 2026)
- NVIDIA ships open-source scanner for agent skill supply-chain risk | AI Insiders