惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
Comments on: Blog
S
Schneier on Security
Microsoft Azure Blog
Microsoft Azure Blog
T
Tor Project blog
V
Visual Studio Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Spread Privacy
Spread Privacy
月光博客
月光博客
罗磊的独立博客
Cisco Talos Blog
Cisco Talos Blog
P
Privacy International News Feed
T
Tenable Blog
阮一峰的网络日志
阮一峰的网络日志
AWS News Blog
AWS News Blog
T
ThreatConnect
博客园 - 三生石上(FineUI控件)
Recorded Future
Recorded Future
Hugging Face - Blog
Hugging Face - Blog
T
Tailwind CSS Blog
博客园 - 叶小钗
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
A
Arctic Wolf
L
LINUX DO - 最新话题
美团技术团队
大猫的无限游戏
大猫的无限游戏
I
Intezer
博客园 - 司徒正美
酷 壳 – CoolShell
酷 壳 – CoolShell
量子位
小众软件
小众软件
T
Threatpost
V
V2EX
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
宝玉的分享
宝玉的分享
The Register - Security
The Register - Security
Project Zero
Project Zero
J
Java Code Geeks
Cyberwarzone
Cyberwarzone
IT之家
IT之家
MyScale Blog
MyScale Blog
T
Threat Research - Cisco Blogs
T
The Blog of Author Tim Ferriss
腾讯CDC
S
SegmentFault 最新的问题
F
Fox-IT International blog
S
Security Archives - TechRepublic
Last Week in AI
Last Week in AI
G
GRAHAM CLULEY
M
MIT News - Artificial intelligence

Hacker News: Show HN

Show HN: Audiogen – a new take on generative music AI Radiccio Server Show HN: A website that tracks every stock trade Congress makes Show HN: MurrDB: A RocksDB-based NVMe/S3 cache for AI inference workloads Logline Archetype Matcher: Find the Right Story Structure | Quanten Arc Préparer l'internat GitHub - ynnk-research/-NeuroFlow: Official PyTorch implementation of NeuroFlow: EMA-Gated Temporal Sequence Compression for Vision Transformers. Achieves up to 55.8x wall-clock speedup for video inference via semantic surprise routing and a training-free Dual-Memory Reconstruction Protocol. GitHub - ivoputzer/testbump: The versioning tool that will tell you if you broke your own contracts. Show HN: Vibeshub – Git for your vibe code transcripts GitHub - hieunc229/mailflare: Email client with custom domain based on Cloudflare Show HN: Private social media feed with posts only from friends GitHub - mbbill/mind-expander: A shared visual workspace for understanding and steering code with AI agents. Introducing Chunk sidecars: Inner loop validation that keeps up with your agents Cantible Show HN: Clean Gigabytes of Junk from Your Mac Show HN: We made a cinematic heist trailer with 4 AI models for $60 Show HN: MCPs aren't enough, give Codex/Claude accurate memory of everything Gravel · Cross-team prompt updates for vertical agents GitHub - SynapCores/synapcores-agent: Real, framework-free AI support agent where SynapCores is the brain — memory, RAG, tool routing, generation in one database. Browser chat widget + live Brain debug sidebar. Fork and run in 30s. Release v0.4.19 - Harbor Launch · av/harbor Stratus Show HN: Local-first PDF redaction for permanently removing data Kakeibo — The Mindful Budgeting App | Spend on What Matters Show HN: Compile-time model-id validation with declared capability GitHub - av/naiou: Yes/no agent Copywriting after AI Show HN: Perga, an open-source daily planner with notes Private Field Search with Local Recovery Show HN: WYSIWYG markdown editor for any GitHub repo Show HN: Raft in Rust Show HN: Treats Human and AI the Same Sifter Show HN: TypistStories Show HN: A Story Show HN: Swift-Markdown-engine – A Native macOS Markdown editor on TextKit 2 Show HN: PrismCat – Local transparent proxy and debugging console for LLM APIs Show HN: Run RL agents in the browser with WebGPU Show HN: Lavern: an open-source multi-agent legal system (Apache 2.0) Show HN: Burnrate $1M a month, backwards through time GitHub - SkepticCTO/decoding_the_language_machine: Documentation, Prompts, and Media for the "Decoding the Language Machine" series GitHub - xqb64/X: The X programming language GitHub - compuficial/apery: Synthetic Data Generator for Agents elio – Terminal File Manager with Rich Previews Rogue-Bench GitHub - mikebmac86/pviz-parser: Analyze your codebase's dependency graph and export a structured bundle — nodes, edges, metrics, and cycle detection across multiple languages Show HN: I built a tool to estimate AI agent costs before you ship Show HN: The product is (usually) SnakeOil MetaStrip — Strip Hidden Metadata from Files Show HN: My Day – daily planner to get things done Show HN: CodeGuard – defence-in-depth SSH security in one Python file Vendorlobby — Vendor pitches, on autopilot AiAffList — The Biggest AI Affiliate Programs List GitHub - its-monotype/jobzap: Chrome extension to filter irrelevant LinkedIn jobs RemotePilot — Your Private Remote Job Pilot (macOS) Show HN: Agent Launch – One CLI for Codex, Claude Code, Cursor, Gemini, OpenCode Marketing Manager Jobs | Live marketing manager roles Typerion: The coherence system for software development GitHub - NoteDance/parallel-saver: High-performance parallel save/load for large NumPy arrays using shared memory and multiprocessing GitHub - WillTaylor22/self-managing-codebase GitHub - JustVugg/judicex: Open-source Legal AI workspace for evidence-grounded legal drafting, matter analysis and verifiable answers. GitHub - hushhq/hush: End-to-end encrypted messaging, voice, and video. Entry point that orchestrates every public component. GitHub - Secure-Code-HQ/audit: Open-source CLI agent for automated Linux VPS security auditing. One command, 25 checks, zero install. GitHub - kwanUm/open-feed: Your personal dev news feed, assembled entirely in your browser. No backend, no account, no telemetry. Show HN: A small game where you have to guess the stars of GitHub repositories GitHub - rduffyuk/engineering-memory-benchmark: Empirical study: layered retrieval (typed→semantic→grep) scores 0.954 for LLM-generated engineering artifacts. 5 conditions, 3 model tiers, 36 generated ADRs, 23 score files. GitHub - lc-at/atproxy: A program to transparently proxy an Android app TCP traffic via using iptables to an HTTP proxy server upstream AI Agent Token Cost Calculator - TinyOps Studio Show HN: Presentforme.ai – Make slide decks explain themselves Show HN: A high-performance audio visualizer using Rust, WASM, and React BYOW(Build Your Own Wallet) : A Field Guide to Building MPC Wallets in 2026 - Part 1 Show HN: GPTFortress, a 24/7 live-stream playing Dwarf Fortress with GPT-5 Riot · OCaml stack GitHub - human-avatar/skills-for-humanity: Structured reasoning methodologies from history's most rigorous thinkers, packaged as Claude Code skills. GitHub - rednakta/nilbox: Desktop sandbox for AI agents and MCP servers — with Zero Token Architecture so your API keys never touch the agent. GitHub - cthing/versionparser: Parses versions in a wide range of formats and provides a canonical, comparable version object. GitHub - Dogacel/auto-gpu-kernel: Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x GitHub - p-raj/collab-sqlc: Self-hosted collaborative SQL editor for teams Lily Design System Show HN: AgentToolBench-Code – security benchmark for AI coding agents GitHub - redraw/rapel: chunked resumable downloads in unstable networks GitHub - lucasfrederico/pgcraft: lazygit-style TUI for Postgres. Navigate schemas, run queries, view explain plans — without leaving the terminal. 将 Notion 页面嵌入您的网站 | 免费试用! Show HN: Aigis – 43% of MCP servers have injection payloads; I built a firewall Audio to Video Converter AI Online Free GitHub - ClickHouse/nerve GitHub - highpost/tailscale-macos-vm: Using Tailscale with an OrbStack VM on macOS Show HN: OpenBrief – Local-first video downloader/summarizer Artifice Software | Synonym and Antonym Game GitHub - tomfunk/fungible: Terminal UI for personal finance — Plaid sync, CSV import, AI assistant, and MCP server Show HN: Unsiloed AI – #1 on OlmOCR-Bench,Beats Reducto, LlamaParse and GPT-5.5 Show HN: AI skills for program / project / delivery managers The Grid: The Largest Machine Ever Built | The Stepchange Show GerbDiff — Local-First Gerber Diff Tool for PCB Engineers Show HN: Built a tool to create brand-consistent images using AI GitHub - ndom91/macos-random-jellyfin-screensaver: Play a random Jellyfin item as your screensaver Bae — the AI companion who actually knows you Open Source Social Media Scheduler — TryPost GCal Sidebar Custom New Tab Dashboard with 50+ Widgets — Start Page HQ iPhone 版“Today” - App Store
GitHub - bogdanr/fono: Press a key, speak, text lands at your cursor. Press another, get a spoken answer. Local-first, lightweight voice dictation and assistant for Linux.
bogdan_r · 2026-05-26 · via Hacker News: Show HN

A lightweight dictation tool for Linux. Press a key, speak, and the text lands at your cursor.

CI License: GPL-3.0-only Latest release Homepage

Fono dictation demo: press a hotkey, speak, the text appears at the cursor

Install one-liner

curl -fsSL https://fono.page/install | sh

The script picks the right binary for your CPU (and switches to the Vulkan-GPU build if your machine has one), runs sudo fono install to place it on $PATH, starts the daemon, and opens the fono setup wizard in the same terminal.

Different styles

Four overlay visualisation styles: bars, oscilloscope, FFT, heatmap

While you're speaking, a small overlay shows what the microphone is hearing. Four styles ship: bars, oscilloscope, fft, heatmap. Switch via the tray (Preferences → Waveform style) or set [overlay].style in ~/.config/fono/config.toml.

What Fono does

  • Dictation, push-to-talk or toggle. Tap F7 to toggle recording; hold F7 for push-to-talk. The same key works either way — the press duration decides.
  • Voice assistant on F8 Talk to Ollama, OpenAI, Anthropic, Groq, Cerebras, or OpenRouter; the reply is streamed sentence-by-sentence into TTS so audio starts before the model has finished thinking.
  • Talk to your coding agent (early preview). Drive Forge, Claude Code, Cursor, Codex CLI, Gemini CLI — any MCP-capable agent — entirely by voice. fono agent-setup <name> wires it up in one shot; after that the agent speaks its replies, listens for your follow-ups, and offers A/B/C choices you pick with your voice. See docs/coding-agents.md.
  • Lands in any X11 or Wayland window. Fono types straight into the focused window and mirrors to the clipboard as a safety net. Hotkeys register through the Wayland portal where it's available, with automatic fallbacks for GNOME 46 and X11 — see docs/wayland.md for the per-compositor story.
  • Local or cloud speech-to-text. Whisper runs on your machine by default. Or switch to Groq, OpenAI, or Deepgram with one command (fono use stt …).
  • Local or cloud text-to-speach. For local you can use Wyoming-piper. More options are on the roadmap. Or switch to various cloud providers if you want most naturally sounding voices.
  • Automatic model selection. The first-run wizard probes your CPU and GPU, then picks the heaviest local Whisper model that runs better than real time on your hardware — no manual tuning. The decision matrix was engineered here with older and new machines accross multiple days of benchmarking.
  • Optional cleanup pass. A small LLM can tidy up the transcript before it's injected — locally with llama.cpp, or via Cerebras / Groq / OpenAI / OpenRouter / Anthropic / Ollama.
  • Visualisation overlay during recording. Bars, oscilloscope, FFT, or heatmap. Live-dictation mode adds a small VU bar.
  • Optional GPU acceleration. fono update probes your host for Vulkan and pulls the matching CPU or Vulkan build automatically.
  • LAN-friendly. Speaks the Wyoming protocol as both client and server, so Fono can route through (or host for) a Home Assistant satellite or another Fono on the network. mDNS finds peers automatically.
  • Two small builds CPU/GPU ~22/60 MB No Electron, no Node, no Python, no WebKit. Four glibc dependencies.

First run

sudo fono install installs the files and starts the setup wizard.

Default hotkeys are F7 (dictation) and F8 (voice assistant). Both keys auto-detect how you press them: a quick tap toggles recording on (tap again to stop); holding for more than a second turns the key into push-to-talk and recording ends on release. Escape cancels a recording or shuts up an assistant reply.

The setup wizard hot-reloads the running daemon when it finishes, so you don't need to restart anything. Reconfigure with fono setup.

Switching providers

fono setup asks for a primary cloud provider. With OpenAI or Groq, a single API key covers STT, cleanup, the assistant, and TTS. Narrower providers (Anthropic, Cerebras, OpenRouter) cover what they offer; the wizard only prompts for follow-on keys if you opt in to capabilities they don't cover.

fono use cloud groq           # paired preset (Groq STT + Groq LLM)
fono use stt openai           # change just STT
fono use tts cartesia         # swap TTS backend
fono use local                # back to whisper-local + skip polish

Keys live in ~/.config/fono/secrets.toml:

fono keys add GROQ_API_KEY    # paste at the prompt
fono keys check               # reachability probe per stored key

TTS works with OpenAI, Groq, OpenRouter (Kokoro), Cartesia, Deepgram, and any Wyoming server you have on the LAN.

Other ways to install

  • Distro packages. .deb, .pkg.tar.zst, and .txz files are built by CI and attached to each release, but they are not regularly tested — they may work, please file an issue if they don't.
  • macOS and Windows. Planned, not shipping.

Privacy

Local-first. Nothing leaves your machine unless you pick a cloud provider.

Documentation

Status

Linux-first; used daily by the maintainer. Rough edges exist — issues and patches are welcome. See ROADMAP for what's next.

Contributing

Pull requests welcome. See CONTRIBUTING.md for the workflow (DCO sign-off required).

License

GPL-3.0-only. See LICENSE.