LLM-Powered Server Setup from Markdown - 惯性聚合

推荐订阅源

酷壳 – CoolShell

Hacker News: Front Page

Palo Alto Networks Blog

Apple Machine Learning Research

博客园_首页

True Tiger Recordings

Privacy & Cybersecurity Law Blog

Last Week in AI

Full Disclosure

Hacker News: Ask HN

Comments on: Blog

Microsoft Azure Blog

Cybersecurity and Infrastructure Security Agency CISA

Microsoft Security Blog

博客园 - 【当耐特】

News and Events Feed by Topic

Security Latest

李成银的技术随笔

Microsoft Research Blog - Microsoft Research

Lohrmann on Cybersecurity

cs.CL updates on arXiv.org

Check Point Blog

Y Combinator Blog

Recent Announcements

博客园 - Franky

News | PayPal Newsroom

About on SuperTechFans

The Register - Security

奇客Solidot–传递最新科技情报

Google Online Security Blog

Cisco Talos Blog

WordPress大学

Cyber Attacks, Cyber Crime and Cyber Security

The Hacker News

IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog

LINUX DO - 最新话题

freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

Hacker News - Newest: "LLM"

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self-⁠Play If an LLM is too expensive it won't be next year "This paper is LLM reviewed" > "this paper is peer-reviewed" StepStone: LLM-Based GPU Kernel Driver Fuzzing via User-Space Libraries [pdf] GitHub - AssimilatedHuman/LLM-Inquisitor: Evaluating AI behaviour under real‑world work conditions to surface issues before they become problems. LLM INQUISITOR identifies failures (drift, instability etc) by observing AI during normal tasks — a tool the industry desperately needs to stem the 85% failure rate. Includes Quick Start, Practitioner’s Guide and Methodology. Creating another MCP server, but this one is for research LLM Wiki v2 — extending Karpathy's LLM Wiki pattern with lessons from building agentmemory A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents Sator Arepo - a Hugging Face Space by akolpakov Customizing an LLM for Enterprise Software Engineering Most AI agent papers stack one LLM with a vector store, we flipped it Evaluating job search ranking with LLM judged NDCG GitHub - quadracollision/llmisp: JSON AST > Clojure Parity Contracts for Polyglot LLM Commerce: A Case Study GitHub - ndom91/llama-dash: The operations layer for your local LLM stack Agentically optimizing LLM prompt cache TTLs for fun and profit Ask HN: What's your go-to LLM for coding? How do you reduce LLM spam in PR reviews? Ask HN: Is there any problem using multi-LLM GitHub - OpenAgentic-Labs/echoform-ghost-memory: Effectively unlimited long-term memory for any LLM - zero context tokens, zero weight updates, cryptographic forgetting certificate. PSA — Posture Sequence Analysis Why More Context Can Make an LLM Worse GitHub - robertoranon/tokoro: A toolbox for building event publish & discovery web sites, apps, feeds, and more GitHub - sermakarevich/chunker: Agentic approach to chunking a document A new EDIT tool for LLM agents LLMCap — Hard Dollar Caps on LLM API Calls MLSys @ WukLab - Nitsum: Serving Tiered LLM Requests with Adaptive Tensor Parallelism SuperInfer: SLO-Aware Rotary Scheduling and Memory Management for LLM Inference on Superchips What political censorship looks like inside an LLM's weights — a mechanistic-interpretability study of Qwen 3.5 Managing metadata is essential in LLM world Fixing LLM Writing with Distribution Fine Tuning twitter.com Show HN: An LLM that's better at writing The local shape of LLM stable regions GitHub - msunda17/impactarbiter-cli The Infrastructure Behind Making Local LLM Agents Useful PostgreSQL ext makes LLM available as an index for similarity searches,inference GitHub - Tetrahedroned/Agent-Braille: Deterministic 8-bit machine-to-machine protocol for AI agent state. ~92% fewer state-tracking tokens on real Claude Code sessions, a proven single-bit-error-safe command code, fully reproducible. Tell HN: Writing an LLM critique/takedown? – Do not use an LLM to write it 🌱 an LLM models our worst behavior Prompt eval cues predicted refusal shifts across 32k LLM rollouts Ask HN: Is Java the ideal language for LLM-assisted coding? AI Foundry – Flat-Fee Unlimited LLM Inference on Blackwell GPUs in NZ LLM tracing with MLflow AI Gateway LLM Performance by Programming Language The LLM Looked Smart. The Metrics Disagreed – tiago.rio.br The Four Horsemen of the LLM Apocalypse GitHub - piqoni/piqo-extension: A good interface is invisible Intro to TLA+ for the LLM Era: Prompt Your Way to Victory Give every tool LLM wiki and bypass Claude Code SSH Throttle The Ultimate LLM Fine-Tuning Guide Ask HN: What LLM models are you using and why? Five Agents, One Browser: Werewolf on Quack + DuckDB LLM models are not ready for orchestrating many agents ClickBook — Offline AI eReader - Apps on Google Play DeepSeek-V4-Flash means LLM steering is interesting again Recent Developments in LLM Architectures: KV Sharing, mHC, and Compressed Attention Recent Developments in LLM Architectures: KV Sharing, MHC, Compressed Attention We Built SynapseKit: The Truth About Production LLM Frameworks GitHub - albedan/ai-ml-gpu-bench: A suite to benchmark CPU/GPU Python performance in training ML models and running local LLMs GitHub - chopratejas/headroom: Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server. if you are redlining the LLM, you aren't headlining Most Meaningful Dates on the Web and for an LLM I tested 8 LLM models on Linux without using the GPU RelaxAI – UK sovereign LLM inference at 80% cheaper than OpenAI/Claude GitHub - Andyyyy64/whichllm: Find the local LLM that actually runs — and performs best — on your hardware. Ranked by real, recency-aware benchmarks, not parameter count. One command, run it instantly. GitHub - krellixlabs/llm-reasoning-research: Curated, annotated research on reasoning gaps in large language models — temporal reasoning, causal reasoning, and beyond. Agentic evals or LLM as a judge? considering cost, time and quality Known By Their Actions: Fingerprinting LLM Browser Agents via UI Traces Add an LLM policy for `rust-lang/rust` by jyn514 · Pull Request #1040 · rust-lang/rust-forge GitHub - nimeshnayaju/markdown-parser: A streaming-capable markdown parser, written in TypeScript Dragos Documents First LLM-Assisted Strike on Water Infrastructure in Mexico Alchemize: PyMC's model to replace Stan/PyMC, etc. with an LLM BlitzGraph - The AI-native backend. Pokémon SVG Bench LLM Witch Hunts are getting F'in Irritating bliki: Interrogatory LLM Ctx-opt: TypeScript middleware to trim LLM chats to a token budget Show HN: Local-first Kubernetes YAML visualizer (no server, no LLM) Why Ruby Is the Better Language for LLM-Powered Development Paper page - Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Show HN: Asciidia – LLM-Powered Game State media control shapes LLM behaviour by influencing training data Small Model Forensics How LLM Inference Works Multi-LLM AI trading agent harness GitHub - crawshaw/yeah: yeah: LLM-powered yes/no CLI tool Predicting Rare LLM Failures with 30× Fewer Rollouts — LessWrong Mechanism Design for Quality-Preserving LLM Advertising I tried to put an on-device LLM in an iOS Share Extension. It didn't fit Show HN: Gox – Strict static analyzer for Go designed for LLM-written code GitHub - torrix-ai/install Show HN: MCPSafe – Free security scanner for MCP servers using 5-LLM consensus Ada-MK: Adaptive MegaKernel Optimization via Automated DAG-based Search for LLM Inference Atlas Inference Engine Hi-Vis: one-shot jailbreak disguised as LLM "software patch" reaching 100% ASR Loading/running every LLM with 4M ctx in 3 clicks Free AI Leak Checker — Is Your Prompt Leaking Data? GLiGuard: 16x Faster Safety Moderation with a Small Language Model - Pioneer AI by Fastino Labs Are LLM Useful for Solo Founders

LLM-Powered Server Setup from Markdown

2026-04-10 · via Hacker News - Newest: "LLM"

LLM-Powered Server Setup

Describe your server in Markdown. Provision does the rest.

Provision converts a plain-English server configuration into a verified execution plan, runs it on pretty much any Unixy system (Debian, RHEL, macOS, etc.), and self-repairs common failures automatically. Bring one API key (Anthropic, OpenAI, or Google). Typical provisioning runs are very cheap. Config can come from a URL or a local Markdown file on the server.

Open source. Single Python script. Read it first, then run it on fresh VMs/LXCs.

curl -sSL http://provision.sh/provision.py | python3 -

Config URL: https://example.com/my-server.md
API Key (Anthropic/Google/OpenAI): •••••••••••
Planning tasks... ✓
Executing... [14/14] ✓

Complete: 14/14 tasks succeeded

What Provision Is

Provision is an open-source, single Python script. You can inspect the whole runtime path before execution: Markdown input, planned tasks, generated commands, verification, and repair logic. Review it first, then run it.

Single Script, Fully Reviewable

Everything lives in one Python file so you can audit behavior end to end before use.

Open Source by Design

The tool is meant to be transparent, inspectable, and easy to validate in your own environment.

Human-Readable Input

Define setup in Markdown using sections and bullets. No rigid schema required.

URL or Local File

Use a hosted Markdown config URL or point Provision at a local config file already on the server.

Distro Support

Works on pretty much any Unixy distribution (Debian, Ubuntu, RHEL, CentOS, macOS, etc).

Bring Any Major API Key

All you need is a valid Anthropic, Google, or OpenAI API key.

How It Works

The runtime flow is deterministic: gather input, plan discrete tasks, execute safely, verify, repair on failure, and only ask for help when absolutely needed.

1

Bootstrap Python and dependencies, then collect config source (URL or local file) and API key.

2

Load Markdown config and prompt for {{VARIABLE}} placeholders.

3

Send config to the planner prompt to produce an ordered JSON checklist.

4

Generate non-interactive commands for each task and execute with timeout controls.

5

Run verification command; if failure occurs, invoke self-repair up to 3 attempts.

6

Summarize all actions, logs, and final success status in a clear report.

Config Example

Provision accepts straightforward Markdown specs. Full example shown below, or open: rs_config.md

# Robert's standard setup

## System
- Set the timezone to America/Los_Angeles
- Ensure the system is fully updated

## Packages
- Install: openssh-server, joe, htop, curl, wget, git, fish, cifs_utils

## Users
- Create user rcs1000 with password {{secret:RCS_PASSWORD}}
- Add rcs1000 to the sudo group
- Set rcs1000's default shell to fish

## SSH Configuration
- Disable root login via SSH
- Allow password login
- Restart the SSH service

## Custom
- Install Tailscale using the official Tailscale apt repository
- Do not start the Tailscale service, as it requires logging in
- Install uv using the official uv repository

Security and Guardrails

Provision is intended for fresh installs on trusted infrastructure. It includes practical controls like preview mode, bounded retries, explicit user escalation, and a fully reviewable single-script runtime: provision.py.

Single Script You Can Audit

Provision is one Python file. Read it end-to-end before execution: provision.py.

Plan Before Run

--plan shows intended commands before any changes are executed.

Controlled Retries

Self-repair is capped (3 attempts) to prevent runaway loops and hidden repeated failures.

Secret Handling

Secret placeholders can remain tokenized in LLM prompts and be substituted only at execution time.

Get Started

Step 1 is to read the script. Step 2 is to run it as root on a fresh system, provide a config URL or local file path, and enter an Anthropic/Google/OpenAI API key.

curl -sSL http://provision.sh/provision.py | python3 -

# Optional flags
# --config URL_OR_LOCAL_PATH --provider anthropic|google|openai --plan --log /var/log/provision.log --verbose

Recommended: run --plan first, review the output, then execute.

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。