惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

人人都是产品经理
人人都是产品经理
美团技术团队
J
Java Code Geeks
T
The Exploit Database - CXSecurity.com
博客园 - 聂微东
T
Tor Project blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
P
Proofpoint News Feed
AWS News Blog
AWS News Blog
博客园_首页
S
Secure Thoughts
S
Schneier on Security
量子位
Simon Willison's Weblog
Simon Willison's Weblog
H
Help Net Security
Spread Privacy
Spread Privacy
Vercel News
Vercel News
Hugging Face - Blog
Hugging Face - Blog
M
Microsoft Research Blog - Microsoft Research
T
Tailwind CSS Blog
The Cloudflare Blog
V
V2EX - 技术
I
InfoQ
O
OpenAI News
有赞技术团队
有赞技术团队
F
Fortinet All Blogs
Google DeepMind News
Google DeepMind News
V
V2EX
Jina AI
Jina AI
Hacker News: Ask HN
Hacker News: Ask HN
F
Future of Privacy Forum
C
Comments on: Blog
Y
Y Combinator Blog
T
The Blog of Author Tim Ferriss
Blog — PlanetScale
Blog — PlanetScale
Cyberwarzone
Cyberwarzone
Project Zero
Project Zero
P
Privacy International News Feed
H
Hacker News: Front Page
Engineering at Meta
Engineering at Meta
Security Latest
Security Latest
P
Privacy & Cybersecurity Law Blog
Recent Announcements
Recent Announcements
小众软件
小众软件
The Hacker News
The Hacker News
Martin Fowler
Martin Fowler
T
Threatpost
P
Proofpoint News Feed
博客园 - 司徒正美
S
SegmentFault 最新的问题

Hacker News: Show HN

Architectural Metapatterns GitHub - brooksmcmillin/mcp-authflow: OAuth 2.0 Authorization Server framework for MCP servers Show HN: VimRace Hodor — Instantly launch your prompts into any AI tool GitHub - javaid-codes/audit-supply-chain-agents Workplane — Share AI artifacts with humans and agents DEMON: Diffusion Engine for Musical Orchestrated Noise Show HN: Gochan – A library of channel architectures for Go, inspired by Rust Show HN: WatchPlane, my attempt to replace my monitoring tool stack GitHub - arifozgun/OpenGem: Free, Open-Source AI API Gateway with Gemini, OpenAI & Anthropic Compatibility Show HN: Bounty-Doctor – Diagnose a GitHub bounty before wasting hours on it Show HN: Approve Claude CLI prompts from the browser, phone, or tablet GridPath — Best way to build spreadsheets with AI Kibbutznik — a pulse-based direct democratic engine Show HN: CoreMCP – MCP Server for On-Prem DBs Zorilla — vibe code games with your crew Show HN: KittyHTML – Render HTML/CSS as an inline image in your terminal Show HN: Enigma – a walkthrough from Caesar ciphers to a working Enigma machine GitHub - bingud/filemat: Web-based file manager Show HN: TruthLens – Free multi-signal deepfake image detector GitHub - apexlocal-jz/claude-usage-tray: Windows system-tray app showing your Claude Code rate-limit usage at a glance. Zero deps, ~300 lines of PowerShell. Cross-IDE (works regardless of VS Code, Cursor, plain terminal). Show HN: I made an emergency page for my family. You should too Mneme HQ — Architectural Governance for AI-Assisted Development 2048 — Blitz Edition Release v0.1.2.1 · kouhxp/yapsnap GitHub - noopolis/moltnet: Self-hostable chat network for AI agents. Pre-built bridges for Claude Code, Codex, and the Claws. Rooms, DMs, history. No Slack bots, no Matrix, no glue code. Show HN: Disable Ugly Firefox Single Rounded Corner Show HN: Enju – humans, AI agents, and compute as peers on one workflow graph PolyCSS - CSS 3D Engine for the DOM Show HN: Continuity-auth – Respect-weighted rate limits for the open web GitHub - luml-ai/luml: LUML is an open-source MLOps/LLMOps platform, allowing to build and deploy AI/ML models in a matter of minutes. Show HN: Sitchy – Auto-setup any GitHub repo Show HN: Detect anti-bot, anti-agent defenses for any website InsiderTrack · Insider Trading Intelligence GitShare.ch - GitHub Repo Screenshots for Social Media Show HN: Game Boy pixel pipeline explorer 在地图上绘制 — 免费在线路线绘制和位置标记工具 Supapin - Automate your Pinterest. Grow your traffic. GitHub - mrdanielcasper/CoreTex: A UNIX-inspired, biomimetic, flat-file AI harness and knowledge engine. Show HN: Notmyfault.fyi – email alerts when GitHub, Stripe, or Vercel go down GitHub - clemg/pierre-github: Pierre's diffs.com and trees.software for Github GitHub - lyriks-io/unspaghettit: Behavior-driven AI development without prompt spaghetti. ADHD: Parallel Divergent Ideation for Coding Agents GitHub - sofumel/claude-handoff-revive: Resume Claude Code work after rate/usage/context limits without replaying the prior transcript. Auto-saves at 90%/95% usage. Plugin-installable, 10 languages. droast — Free Online Dockerfile Linter Billpal | AI bookkeeping assistant GitHub - dotexorg/erpc: Typed, end-to-end encrypted RPC over any bidirectional channel. GitHub - BeeZeeAgent/beezee: Agent harness orchestration Legato Next.js Boilerplate for Internal Tools · CoreUI Axion — Real amps in your browser Chat Hoarding: A permanent, private archive of your WhatsApp chats Show HN: I hand-write 5 daily word puzzles before work Show HN: Generate 54 social media assets in 1 click the shared workspace for human + agent teams Sotto — Your invisible interview co-pilot. GitHub - clark-labs-inc/clark-hash: Clark Hash, 32x smaller searchable sketches for embeddings TokenAdvisor — Free LLM token analyzer with savings advice GitHub - ZeroPointRepo/youtube-mcp: The fastest YouTube transcript + YouTube search MCP for AI agents. Try for free. Typing Mastery — climb toward 100+ WPM, deliberately GitHub - Andebugulin/Awareen Mirdel - Next-generation AI Workspace PikoCI — The CI/CD that grows with you Virtuoso Data Table GoPeek — open links in live mini browser windows without losing your flow. Show HN: I built a samurai-themed playable Résumé with React, Phaser, + Laravel Programming Language Job Demand Index — 2026 STAX IDE — a spatial terminal IDE for macOS Tasmap GitHub - craigmccaskill/posthorn: Self-hosted email gateway between your apps and a transactional mail provider (Postmark, Resend, Mailgun, AWS SES, or outbound-SMTP). Three ingress shapes (HTTP form, HTTP API, SMTP). One Docker container, one TOML config. Show HN: Windows 8 inspired transfer speed graph Show HN: Hyper, the self driving company brain GitHub - shubhamgoel27/artifold: 📚 A local-first library for the stuff you make with AI. Index, search, preview, share — and use your past work as the style guide for your next one. Show HN: I made a simple Keyword Research tool for app devs Mobile SSH - Android SSH client GitHub - punnerud/mpee: Offline routing, multi-vehicle VRP & street geocoding for one downloaded area — Rust engine, driven from Python or a CLI GitHub - fayzan123/claude-workflow-composer: Visual desktop app for composing multi-agent coding workflows. Drag agents, attach skills and MCPs, wire handoffs, export to .claude/ Show HN: I turned my personal website into a bash shell (with Vim) Show HN: I built a tool to auto-accept AI slop and bigtech devs loves it GitHub - Flowtriq/ftagent-lite: Lightweight open-source DDoS traffic monitor. Stdout output, no account required Permly — Notification Manager for Android GitHub - srijanpatel/arq-dashboard: A dashboard for ARQ built with FastAPI Show HN: CredWork – a simple project tracking and showcasing tool GitHub - clark-labs-inc/clark-agent: A small, typed, hookable agent loop. Provider-agnostic, sandbox-agnostic, tooling-agnostic. Battle tested on clarkchat.com GitHub - alebeck/rhymesum: Hash files into LLM-generated poems locally GitHub - bitcreed/gsd-meta-manager: TUI command center for managing multiple GSD projects from a single terminal GitHub - oeo/monkdev: A holy, minimalist CLI toolkit and MCP server designed exclusively for LLM coding agents. GitHub - xilioscient/troskji: Post-quantum multi-path tunnel — Hybrid KEM (X25519+Kyber-1024) · Shamir 3-of-5 SSS · BLAKE3 · XDP/eBPF cover traffic · Rust Introducing vtermux – M.C. Pantz Flow Simulator Show HN: Free DNS propagation checker – 40 resolvers, TTL and response times GitHub - hamsterbase/llm-translator SetupHub - Share Your IDE Setup with the World Show HN: Zt – Expose local services via Cloudflare Zero Trust in one command Mirror — Record your workflow. Generate docs in one click. GitHub - NikhilSKashyap/interviewsignal: AI-native broad-interviewing. Share a code, capture thought process, auto-grade on submit. pip install, zero setup cost, pure signal. Stumbleback - Chrome 应用商店 OACP — Open Agent Coordination Protocol GitHub - mplsllc/macsurf: A modern web browser for Classic Mac OS 9 PowerPC. Real CSS3, ES5 JavaScript, native HTTPS — built with CodeWarrior on the Carbon API. yavchn
GitHub - OSbiotools/BioPetals: 🌸 Run BIOxAI models at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
shpran · 2026-05-27 · via Hacker News: Show HN


Run a biology-focused LLM on your own network.
Distributed inference and fine-tuning powered by Petals.

Biology Model (OpenBioLLM)

BioPetals is a specialized fork of Petals for aaditya/Llama3-OpenBioLLM-8B, a biology-oriented LLM built on the Llama 3 architecture. Run it distributed across your own network for fast inference and fine-tuning.

from petals.client import load_biology_model

tokenizer, model = load_biology_model()
inputs = tokenizer("Summarize the role of ribosomes in translation", return_tensors="pt")["input_ids"]
outputs = model.generate(inputs, max_new_tokens=80)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Run the bundled example script:

python examples/run_biology_inference.py

Run the Colab notebook from this fork:

Open in Colab

If you prefer copy-pasting cells into Colab, use %pip (not plain pip) in the install cell:

%pip -q install -U pip setuptools wheel
%pip -q install --upgrade --force-reinstall --no-cache-dir "numpy==1.26.4" "scipy==1.14.1"
%pip -q install --upgrade --force-reinstall --no-cache-dir "protobuf==5.29.6" "grpcio-tools==1.71.2" "grpcio-status==1.71.2" "jedi>=0.19.2"
%pip -q install --upgrade --no-cache-dir "bitsandbytes==0.41.1" "speedtest-cli==2.1.3" "tensor_parallel==1.0.23" "peft==0.8.2"
%pip -q install --upgrade --no-cache-dir "hivemind==1.1.12" "transformers==4.43.1" "accelerate>=0.27.2" "huggingface-hub>=0.11.1,<1.0.0" "tokenizers>=0.13.3" "sentencepiece>=0.1.99" "packaging>=20.9" "humanfriendly" "async-timeout>=4.0.2" "Dijkstar>=2.6.0" "safetensors>=0.3.1"
%pip -q install --upgrade --no-deps --no-cache-dir "git+https://github.com/Pranesh950/BioPetals.git"

After installing packages in Colab, restart the runtime once before running inference cells.

Host this biology checkpoint in Petals:

python -m petals.cli.run_server aaditya/Llama3-OpenBioLLM-8B

Private biology-only swarm

To run a network that serves only the biology checkpoint, start one or more servers announcing that model and do not connect to the public swarm (use --new_swarm). The simplest option is the bundled helper:

./examples/run_bio_server.sh --num-blocks 8 --port 31337
  • Minimum peers: 1 — a single server that hosts all blocks will make inference possible.
  • Distributed mode: if you split the model across multiple people, you need enough peers to host all model blocks (or fewer peers if each peer hosts multiple blocks). The exact number depends on the model's number of blocks and each peer's GPU memory.

Recommended: start with one server (or three for redundancy), verify inference locally, then invite more peers if you want to distribute serving across multiple machines.

🔏 Privacy. BioPetals is designed for private, community-run swarms. Your data stays within your network. Learn more about security here.

💬 Questions? Open an issue or check the Petals wiki.

Host a Server

BioPetals networks are community-run — help by sharing your GPU capacity to serve the biology model:

Access: The OpenBioLLM model is open-access. Run huggingface-cli login if you want to save credentials locally.

Setup:

# Linux or macOS
pip install git+https://github.com/Pranesh950/BioPetals.git

# Join a private swarm
./examples/run_bio_server.sh --num-blocks 8 --port 31337

Or manually:

python -m petals.cli.run_server aaditya/Llama3-OpenBioLLM-8B --new_swarm --public_ip <YOUR_IP> --port 31337

For Windows, AMD GPUs, Docker, or multi-GPU setups, see the Petals wiki for detailed instructions.

📚  Learn more (how to use multiple GPUs, start the server on boot, etc.)

🔒 Security. Hosting a server does not allow others to run custom code on your computer. Learn more here.

💬 Any questions? Ping us in our Discord!

🏆 Thank you! Help maintain the network by hosting blocks. You can optionally specify --public_name YOUR_NAME for recognition.

How does it work?

  • You load a small part of the model locally, while peers host the remaining blocks. Inference runs efficiently across the distributed network.
  • Use any fine-tuning and sampling methods, access hidden states, and enjoy the flexibility of PyTorch and 🤗 Transformers with distributed execution.

📜  Read paper            📚  See FAQ

📚 Resources

Examples:

  • Inference script: examples/run_biology_inference.py
  • Colab notebook: examples/run_biology_inference_colab.ipynb
  • Server helper: examples/run_bio_server.sh

Documentation:

  • Petals Wiki — general Petals setup, troubleshooting, and advanced configurations
  • Security & Privacy — learn how BioPetals keeps your data safe

Benchmarks

Please see Section 3.3 of our paper.

🛠️ Contributing

Contributions are welcome! Please see the Petals FAQ for contribution guidelines, or open an issue to report bugs and suggest features.

📜 Citations

Alexander Borzunov, Dmitry Baranchuk, Tim Dettmers, Max Ryabinin, Younes Belkada, Artem Chumachenko, Pavel Samygin, and Colin Raffel. Petals: Collaborative Inference and Fine-tuning of Large Models. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations). 2023.

@inproceedings{borzunov2023petals,
  title = {Petals: Collaborative Inference and Fine-tuning of Large Models},
  author = {Borzunov, Alexander and Baranchuk, Dmitry and Dettmers, Tim and Riabinin, Maksim and Belkada, Younes and Chumachenko, Artem and Samygin, Pavel and Raffel, Colin},
  booktitle = {Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)},
  pages = {558--568},
  year = {2023},
  url = {https://arxiv.org/abs/2209.01188}
}

Alexander Borzunov, Max Ryabinin, Artem Chumachenko, Dmitry Baranchuk, Tim Dettmers, Younes Belkada, Pavel Samygin, and Colin Raffel. Distributed inference and fine-tuning of large language models over the Internet. Advances in Neural Information Processing Systems 36 (2023).

@inproceedings{borzunov2023distributed,
  title = {Distributed inference and fine-tuning of large language models over the {I}nternet},
  author = {Borzunov, Alexander and Ryabinin, Max and Chumachenko, Artem and Baranchuk, Dmitry and Dettmers, Tim and Belkada, Younes and Samygin, Pavel and Raffel, Colin},
  booktitle = {Advances in Neural Information Processing Systems},
  volume = {36},
  pages = {12312--12331},
  year = {2023},
  url = {https://arxiv.org/abs/2312.08361}
}

This project is a part of the BigScience research workshop.