惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Hacker News - Newest:
Hacker News - Newest: "LLM"
阮一峰的网络日志
阮一峰的网络日志
博客园 - 聂微东
S
SegmentFault 最新的问题
Jina AI
Jina AI
T
Tailwind CSS Blog
月光博客
月光博客
NISL@THU
NISL@THU
WordPress大学
WordPress大学
Google Online Security Blog
Google Online Security Blog
云风的 BLOG
云风的 BLOG
Cisco Talos Blog
Cisco Talos Blog
小众软件
小众软件
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
Security @ Cisco Blogs
P
Proofpoint News Feed
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
罗磊的独立博客
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cisco Blogs
Scott Helme
Scott Helme
S
Securelist
H
Help Net Security
S
Schneier on Security
Martin Fowler
Martin Fowler
AWS News Blog
AWS News Blog
Security Archives - TechRepublic
Security Archives - TechRepublic
S
Secure Thoughts
酷 壳 – CoolShell
酷 壳 – CoolShell
博客园 - 【当耐特】
Last Week in AI
Last Week in AI
T
Tor Project blog
F
Fortinet All Blogs
S
Security Affairs
TaoSecurity Blog
TaoSecurity Blog
Schneier on Security
Schneier on Security
Cloudbric
Cloudbric
C
Cyber Attacks, Cyber Crime and Cyber Security
The GitHub Blog
The GitHub Blog
V
V2EX
SecWiki News
SecWiki News
C
CERT Recently Published Vulnerability Notes
Hacker News: Ask HN
Hacker News: Ask HN
博客园 - 司徒正美
T
Threatpost
T
Tenable Blog
W
WeLiveSecurity
B
Blog RSS Feed
V
Vulnerabilities – Threatpost
Attack and Defense Labs
Attack and Defense Labs

Hacker News - Newest: "LLM"

GitHub - lechmazur/position_bias: A benchmark for testing whether LLM judges keep the same preference when two lightly edited versions of the same story are shown in opposite orders. Flex routing (EU and EFTA) Dark Factories: Retooling for LLM Velocity Ask HN: What would be the impact of a LLM output injection attack? GitHub - AronDaron/dataset-generator: No-code desktop app for generating high-quality synthetic datasets to fine-tune LLMs — plan-then-execute pipeline, LLM-as-judge, HuggingFace upload. GitHub - Oaklight/llm-rosetta: Production-ready LLM API translation layer for Python — bidirectional conversion between OpenAI, Anthropic & Google formats via hub-and-spoke IR. Optional API gateway. Streaming & non-streaming. Zero core deps. Contributions welcome! GitHub - browser-use/browser-harness: Self-healing browser harness that enables LLMs to complete any task. GitHub - moeen-mahmud/remen: Remen turns thoughts into something you can return to Analyzing 156 LLM Launch Posts on Hacker News ChatGPT vs Gemini vs Claude: The Best LLM Subscription You Should Buy GitHub - salaamalykum/quran-semantic-search: High-density RAG Semantic Search Engine & Quran Corpus (GEO/SEO Architecture) GitHub - NVIDIA/TensorRT-LLM: TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way. The State of LLM Bug Bounties in 2026 Operational Readiness Criteria for Tool-Using LLM Agents Meshcore: Architecture for a Decentralized P2P LLM Inference Network How an LLM becomes more coherent as we train it GitHub - seetrex-ai/laimark GitHub - Jossifresben/BibCrit: AI-assited biblical textual criticism GitHub - wastedcode/memex: File system based wiki, maintained by Claude 99helpers.com GitHub - cliver-project/AITrigram GitHub - unbody-io/adapt: A self-evolving memory layer for AI agents. GitHub - hb20007/awesome-gen-ai-fails: A list of incidents where reliance on generative AI and LLMs resulted in harm to companies, individuals, or society GitHub - nevenkordic/localmind: Run any local LLM with persistent memory and context. CLI agent over Ollama with SQLite-backed hybrid recall. No cloud. Ask HN: What are the machine requirements for a LLM like Llama-3.1-8B? Faster LLM Inference via Sequential Monte Carlo grpo explained: group relative policy optimization for llm finetuning - cgft Stop comparing price per million tokens: the hidden LLM API costs · TensorZero Andrej Karpathy's LLM Wiki Is a Bad Idea GitHub - GG-QandV/mnemostroma: Offline RAM-first cognitive leer/coprocessor for AI agents and robotics. Solves "Context Abandonment" with 20-80ms latency using a dual-thread biomimetic memory architecture (ONNX + SQLite WAL). mempalace/agent at agent · skorotkiewicz/mempalace GitHub - Nyquest-ai/nyquest-rust-fullstack-pub: Nyquest — Semantic Compression Proxy for LLMs. 350+ rules, local LLM stage, 15-75% token savings. Full Rust stack. GitHub - TheoV823/mneme: Enforce architectural decisions in AI-assisted development. GitHub - klemenvod/TokenBrawl: A 1v1 Bomberman-style game where two LLM agents play autonomously against each other. No human plays — you watch the AIs fight. Each agent receives a text description of the board state, reasons about it, and outputs a move as JSON. The game engine executes it. Introducing the Common AI Provider: LLM and AI Agent Support for Apache Airflow Power Circuit AI: Designing Power Electronic Circuits for Motor Drives with Generative Artificial Intelligence Ask HN: How to program with IDE and LLM on CPU locally? Show HN: Agent-cache – Multi-tier LLM/tool/session caching for Valkey and Redis Bonsai 1-bit WebGPU - a Hugging Face Space by webml-community The LLM Fallacy: Misattribution in AI-Assisted Cognitive Workflows Ask HN: Simple tooling for local LLM code critique without IDE integration? Can a General LLM Diagnose a DICOM Slice? A 10-Case Public Benchmark Charts-of-Thought: Enhancing LLM Visualization Literacy (PDF, 2026) GitHub - Mesh-LLM/mesh-llm: Distributed AI/LLM for the people. Share compute privately or publicly to power your agents and chat. GitHub - seamus-brady/springdrift: A persistent runtime for long-lived LLM agents Writing an LLM from scratch, part 32k -- Interventions: training a better model locally with gradient accumulation Ask HN: Which LLM model and agentic CLI are you using for local development? GitHub - wayneColt/modelcascade: Route local. Escalate smart. Never overspend. Open-source multi-model cascade routing for autonomous agents. LLM pricing is 100x harder than you think GitHub - asakin/llm-primer: Pre-warmed Claude Code sessions in tmux. No startup wait. GitHub - EggerMarc/chat-rs: A multi-provider LLM framework for Rust. GitHub - SynapseKit/SynapseKit: Minimal, async-first Python framework for production LLM apps- 2 hard deps, no magic, no SaaS. A Claude Skill that Makes LLM Paragraphs More Bearable Does Gas Town 'steal' usage from users' LLM credits & paid services to improve itself? What's Claude Code Actually Doing? Open the Black Box with the Arthur Engine Milla Jovovich's New Open Source LLM Memory App and the Dark Code Problem Your intuition of LLM token usage might be wrong Show HN: Bloomberg Terminal for LLM ops – free and open source GitHub - 0xchamin/mcptube: Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server for Claude, Copilot & more. Show HN: Open KB: Open LLM Knowledge Base Your LLM is a compiler, not a runtime GitHub - sapountzis/Unslop: A Web Feed That Deserves You crates.io: Rust Package Registry Beyond Karpathy's LLM-Wiki: The Necessity of Cognitive Governance GitHub - amitshekhariitbhu/llm-internals: Learn LLM internals step by step - from tokenization to attention to inference optimization. GitHub - parallem-ai/parallem: An expressive library for running agents with the Batch API. GitHub - stfurkan/pi-llm LLM-Wiki Show HN: Formal – Formal verification for AI-generated code using Lean 4 LRTS – Regression testing for LLM prompts (open source, local-first) LLM Wiki Skill: Build a Second Brain with Claude Code and Obsidian I built an LLM Wiki and RAG solution: here's a demo for a security KB The biggest advance in AI since the LLM Predict-Rlm: The LLM Runtime That Lets Models Write Their Own Control Flow the-synthetic-library/the-synthetic-mind at main · joshferrer1/the-synthetic-library GitHub - yisding/reviewwiggum GitHub - Donnyb369/mcp-spine: Context Minifier & State Guard — Local-first MCP middleware proxy GitHub - Beledarian/wgpu-llm: A from-scratch LLM inference engine that uses wgpu (the cross-platform WebGPU implementation) to dispatch WGSL compute shaders for every math operation a Transformer needs. No CUDA. No Python. No massive framework dependencies. Just Rust, raw shaders, and your GPU. GitHub - anitiue/Hindsight: An experience-driven self-improvement framework for LLM agents — 基于经验的 LLM Agent 自我改进框架 GitHub - stef41/lmscan: 🔍 Detect AI-generated text and fingerprint which LLM wrote it. Open-source GPTZero alternative. Zero dependencies, works offline. GitHub - alainnothere/AmdPerformanceTesting: Amd Performance Testing Ask HN: Is a purely Markdown-based CRM a terrible idea? Optimized for LLM agents Context Engineering - LLM Memory and Retrieval for AI Agents | Weaviate little_helper_tui/letter.md at main · sleepyeldrazi/little_helper_tui GitHub - EvanZhouDev/umr: The Unified Model Registry for all your local AI apps. GitHub - JordanCT/VigIA-Orchestrator Your Agent Is Mine: Measuring Malicious Intermediary Attacks on the LLM Supply Chain A Taxonomy of RL Environments for LLM Agents Llama LLM Network Feture GitHub - genedeng-ca/ai-mac-migration: AI-powered Mac-to-Mac migration tool - replace Apple Migration Assistant with intelligent, selective transfer using local LLMs GitHub - lunargate-ai/gateway: High-performance self-hosted AI gateway (OpenAI-compatible) with routing, retries, and streaming GitHub - AuthBits/webmcp: A lightweight, prompt-driven MCP web research server for high-quality LLM powered information extraction. Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception High-Stakes Personalization: Rethinking LLM Customization for Individual Investor Decision-Making From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents HUOZIIME: An On-Device LLM-enhanced Input Method for Deep Personalization TIDE: Token-Informed Depth Execution for Per-Token Early Exit in LLM Inference Characterizing WebGPU Dispatch Overhead for LLM Inference Across Four GPU Vendors, Three Backends, and Three Browsers LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users
Introducing Lightpanda Agent and PandaScript: LLM at buildtime not runtime - Blog | Lightpanda
Francis Bouvier · 2026-06-17 · via Hacker News - Newest: "LLM"

Francis Bouvier

Francis Bouvier

Cofounder & CEO

Introducing Lightpanda Agent and PandaScript: LLM at buildtime not runtime

TL;DR

A browser agent today is four separate tools wired together: Chrome, a CDP library, an LLM, and an agent framework. Our native agent collapses those four layers into one binary.

You talk to it in natural language and it does the work against a real browser. “Go to this website”, “login”, “extract this data”, “tell me that”: language is the new native interface for the browser. And if you want to replay your session, it hands you a reproducible script. No CDP, no heavy browser server, no complex setup, no LLM at runtime: everything is built in into Lightpanda.

To run a browser agent today, you install Chrome, drive it over CDP with Puppeteer or Playwright, wire in an LLM to make decisions, then wrap the whole thing in an agent framework to orchestrate the calls. It works. Well, kind of: it’s a lot of moving parts for what’s often, “load a page and read some elements off it.”

Each of those layers exists to translate between a human-facing browser and a machine that wants to drive it. CDP (the Chrome DevTools Protocol) was built so a developer could inspect a running browser from the outside, not so a program could operate one. And to wire an LLM, you need an MCP-to-CDP layer. The agent wraps a model around a tool that was never meant to be driven by a model. You are paying a translation tax at every layer.

When we started Lightpanda four years ago, the question was: “what would a browser look like if you built it for machines instead of people looking at a screen?” is that bet applied to the agentic stack. It is one binary that contains the browser, the runtime, and the agent.

The browser is the same engine behind and : it loads webpages, runs JavaScript, and handles the DOM.

The runtime consists of a small set of native tools (, , , , , ), that let you drive the browser (slash commands). The LLM that reads your request and picks tools is optional. It runs against Anthropic, OpenAI, Gemini, Hugging Face or local Ollama, or with no key at all in slash-only mode.

The LLM runs at buildtime, not runtime

This is the idea that everything else hangs off. Every other browser agent is a black box that calls a model on every step. With , the model figures out the task, we capture that work as code, and generate a script (we call it PandaScript) that is reproducible and deterministic. When you replay it, you don’t need a model and the LLM is gone from the loop.

There’s no model call sitting between you and the next action at replay time, no Chrome process to host, no NodeJS/Python environment to setup, and no Playwright/Puppeteer code to write. You just pass the script to Lightpanda binary, so a run is bound only by how fast the engine drives the page. And because nothing non-deterministic runs at replay, the same script produces the same result every time. You pay the model once to write the file. After that, it’s plain JavaScript that you own.

Here’s what that PandaScript looks like. This script grabs the top 3 Hacker News stories, then visits each thread for its top comments:

Readable vanilla JavaScript with loops, map, filter, and try/catch. And a few built-in primitives for our native tools. That’s it. The last top-level expression auto-prints as JSON.

And of course you can also write or edit a PandaScript manually, or ask your AI coding assistant to do so if you prefer.

PandaScript doesn’t use CDP, by design

Lightpanda browser still speaks CDP with , and we are actively developing it. The decision here is narrower: we chose not to put CDP inside the agent.

Traditional headless automation marshals every action across CDP, with hundreds of methods running against a browser in a separate process.. Our agent skips that. It runs in-process against Lightpanda’s engine and calls a small set of native commands directly.

This gives you two things:

  1. There is no serialization overhead, because a native in-process call replaces marshalling every click and fill across a wire protocol.
  2. Setup is easier, because one binary drives itself: there is no separate browser to launch, no debugging port to wire up, no NodeJS/Python environment to setup, and no CDP client to manage.

As a bonus, the native commands are the same tool surface whether you drive them with natural language instructions, with commands, or with an external LLM through .

That’s one (of many) advantages of developing a browser from scratch instead of forking Chromium: it allows us to build new AI features natively.

Get started

Point an API key at it, or run it with none:

In the REPL, explore in English or with , , and the rest. You can generate a reproducible PandaScript from the current session with (alpha feature), and then , then replay it with .

Try the end-to-end walkthrough

The agent tutorial  takes you through the whole loop: log in to Hacker News, extract stories, save, and replay. If you want the full reference, the agent docs cover every provider, slash command, and flag, and the script format docs explain the JavaScript API.

FAQ

What is Lightpanda Agent?

It is a built-in agent that drives a headless browser by translating your requests into native browser actions, in natural language or as slash commands. It runs in-process against Lightpanda’s own engine and can use a model from Anthropic, OpenAI, Gemini, Hugging Face or local Ollama, or no model at all.

What is a PandaScript?

PandaScript  is a script to automate browser actions and workflows, designed to replace Puppeteer or Playwright. It can be run directly on the Lightpanda binary without needing a separate client setup with NodeJS/Python. It’s vanilla JavaScript with a small set of native primitives.

Does the agent use CDP?

No, and that is deliberate. The agent calls native in-process commands instead. Lightpanda is still actively developing CDP for , so existing Puppeteer and Playwright workflows are unaffected.

Can I run it without an LLM or API key?

Yes. Use for slash-commands-only mode, where you type , , and directly. Replaying a saved script doesn’t require a model or an API key, so recorded sessions run token-free and deterministically.

How does work?

During a session, every state-changing call is recorded. With an LLM connected, generates a reproducible PandaScript from the session intent (this is an alpha feature, we are actively developing and improving it). Without an LLM, it transcribes the recorded calls verbatim.

How are credentials handled?

Secrets are written as placeholders, like . They are resolved at runtime inside Lightpanda, so they never enter the model context or the saved script file.

Can I drive Lightpanda from an external agent instead?

Yes. The native MCP server  exposes the same tool surface to any MCP-aware client, like Claude Code, with no model running inside Lightpanda. See the MCP server guide  for setup.


Francis Bouvier

Francis Bouvier

Cofounder & CEO

Francis previously cofounded BlueBoard, an ecommerce analytics platform acquired by ChannelAdvisor in 2020. While running large automation systems he saw how limited existing browsers were for this kind of work. Lightpanda grew from his wish to give developers a faster and more reliable way to automate the web.