惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Securelist
O
OpenAI News
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
T
Threat Research - Cisco Blogs
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Google Online Security Blog
Google Online Security Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
N
News and Events Feed by Topic
S
Security Affairs
SecWiki News
SecWiki News
Project Zero
Project Zero
L
Lohrmann on Cybersecurity
P
Proofpoint News Feed
P
Palo Alto Networks Blog
L
LINUX DO - 最新话题
H
Hacker News: Front Page
Recent Commits to openclaw:main
Recent Commits to openclaw:main
I
Intezer
Simon Willison's Weblog
Simon Willison's Weblog
W
WeLiveSecurity
T
The Exploit Database - CXSecurity.com
K
Kaspersky official blog
The GitHub Blog
The GitHub Blog
I
InfoQ
云风的 BLOG
云风的 BLOG
雷峰网
雷峰网
B
Blog
IT之家
IT之家
AWS News Blog
AWS News Blog
Jina AI
Jina AI
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
Google DeepMind News
Google DeepMind News
Spread Privacy
Spread Privacy
N
News and Events Feed by Topic
Security Latest
Security Latest
美团技术团队
C
Check Point Blog
WordPress大学
WordPress大学
T
Tenable Blog
S
Security @ Cisco Blogs
Last Week in AI
Last Week in AI
博客园 - 聂微东
月光博客
月光博客
博客园 - 【当耐特】
S
Schneier on Security
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
S
Secure Thoughts
Schneier on Security
Schneier on Security
C
Cisco Blogs
Cyberwarzone
Cyberwarzone

Show HN

暂无文章

GitHub - ampyard/Llaboratory: A laboratory for studying how LLMs behave when offered a set of fake tools
vivganes · 2026-06-13 · via Show HN

A self-hostable, open-source laboratory for studying how LLMs behave when offered a set of fake tools.

Screenshot

Screenshot of fake tools page

Quick Start

Docker Compose (recommended)

docker compose up --build

Open http://localhost:5173 for the UI. The frontend will proxy API calls to the backend container on port 8000.

Backend

cd backend
uv venv
uv pip install -e ".[dev]"
cp ../.env.example ../.env   # fill in your API keys
uv run uvicorn app.main:app --reload

Frontend

cd frontend
npm install
npm run dev

Open http://localhost:5173 — the frontend proxies /api to :8000.

Workflow

  1. Tool Library → create fake tools with static or dynamic responses
  2. Model Configs → configure a provider endpoint + model snapshot + API key env var
  3. Plans → compose tools + model + prompts into a versioned testing plan
  4. Run → launch sessions; watch the live event stream; inspect tool calls and model responses
  5. Sessions → view history, metrics, and per-session event timelines

Security

Dynamic tool code runs in-process without sandboxing. This is intentional for locally-authored tools. Never execute dynamic code from untrusted sources. See §10.6 of the PRD for the full rationale.

Running Tests

Backend

cd backend
uv run pytest -v --tb=short

Frontend

Hits