惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Proofpoint News Feed
博客园 - 聂微东
Application and Cybersecurity Blog
Application and Cybersecurity Blog
MyScale Blog
MyScale Blog
罗磊的独立博客
H
Help Net Security
L
LangChain Blog
T
Threat Research - Cisco Blogs
量子位
S
Securelist
Last Week in AI
Last Week in AI
L
Lohrmann on Cybersecurity
T
The Exploit Database - CXSecurity.com
P
Privacy International News Feed
The Hacker News
The Hacker News
Vercel News
Vercel News
D
Darknet – Hacking Tools, Hacker News & Cyber Security
C
Cybersecurity and Infrastructure Security Agency CISA
T
The Blog of Author Tim Ferriss
T
Threatpost
Security Latest
Security Latest
P
Palo Alto Networks Blog
Microsoft Security Blog
Microsoft Security Blog
NISL@THU
NISL@THU
F
Full Disclosure
WordPress大学
WordPress大学
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Stack Overflow Blog
Stack Overflow Blog
C
Check Point Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
酷 壳 – CoolShell
酷 壳 – CoolShell
H
Heimdal Security Blog
J
Java Code Geeks
Recorded Future
Recorded Future
Hugging Face - Blog
Hugging Face - Blog
G
GRAHAM CLULEY
Know Your Adversary
Know Your Adversary
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
阮一峰的网络日志
阮一峰的网络日志
U
Unit 42
B
Blog RSS Feed
月光博客
月光博客
C
Cisco Blogs
V
Visual Studio Blog
D
DataBreaches.Net
H
Hacker News: Front Page
博客园 - 叶小钗
N
News and Events Feed by Topic
爱范儿
爱范儿
A
Arctic Wolf

Kyle Redelinghuys

How I Got the UK Global Talent Visa as a Software Engineer SrvMon: Self-Hosted Server Monitoring Built in Go Claude Cowork: Closing the Gap Between Coding and Knowledge Work Teaching a Transformer to Read DNA: How EabhaSeq Works Claude Code Agents & Subagents: What They Actually Unlock AI Agent Context Management: What I Built in Cont3xt Claude Agent SDK: Subagents, Sessions and Why It's Worth It I Built a Claude Code Cost Tracker - Was Max Worth It? Claude Code Pricing Guide: Which Plan Saves You Money OpenClaw: How I Built a Personal AI Operations Centre on Linux Claude Code Hooks: Automate Your AI Coding Workflow Have Anthropic Already Won the AI Race? Sonde: An AI Tool for Solving Complex Organisational Problems Open Sourcing EabhaSeq: Synthetic cfDNA for NIPT Research
SoupaWhisper: Free SuperWhisper Alternative for Linux (Open Source)
Kyle Redelinghuys · 2025-12-07 · via Kyle Redelinghuys

SoupaWhisper: Free SuperWhisper Alternative for Linux (Open Source)

6 min read Paid

I've been using my Linux desktop a lot more recently. The prenatal testing project I'm working on has me spending more time on the machine, and I wanted to make sure my development environment was as productive as possible. When you're deep in a project, small friction points add up quickly.

One thing I missed immediately: voice dictation.

On Mac, I use SuperWhisper constantly. I dictate to Claude, to Grok, and especially when working with AI coding tools like Claude Code. The workflow is simple - hold a key, speak, release, and the text appears. It keeps me in flow without breaking to type out longer prompts or explanations. When you're working with AI tools, you're often explaining context, describing what you want, or talking through problems. Typing all of that out breaks the conversational rhythm.

The problem? SuperWhisper doesn't exist for Linux. And the built-in dictation options are either cloud-based (privacy concerns, latency, requires internet) or ancient tools that barely work. Cursor has some built-in dictation, but Claude Code and any of the CLI tools don't have anything. I needed something that would work everywhere - in the terminal, in the browser, in any application.

So I built my own. It took less than an hour.

The solution

SoupaWhisper is a ~250 line Python script that does exactly what SuperWhisper does, powered by OpenAI's Whisper model running entirely locally via faster-whisper.

The workflow is identical to what I was used to: hold F12, speak, release. The text gets typed into whatever window is active and copied to the clipboard if I want to paste it elsewhere. Nothing fancy - just local speech-to-text that actually works.

I built this using Claude Code, which is part of why it came together so quickly. The combination of Claude Code for the implementation and Linux's straightforward tooling made this surprisingly simple. I was genuinely taken aback by how easy it was to put together. Linux has all the building blocks ready to go - audio capture, clipboard management, keyboard simulation - you just need to wire them together.

Why Python?

I normally reach for Go for almost everything. It's my language of choice and I've written extensively about why. But Python made more sense here for a few reasons.

The faster-whisper bindings are Python-native, so there's no fighting with FFI or CGO bindings. Downloading and managing the Whisper models is trivial with Python's ecosystem. The code stays minimal because you're not dealing with any impedance mismatch between the AI libraries and your application code.

I'm also doing more Python work on the prenatal project right now, so staying in the same language reduces context switching. When you're bouncing between projects, having one less thing to mentally switch on helps.

Technical implementation

The architecture is dead simple. Here's the entire flow:

This post is for paying subscribers only

Sign up and upgrade your account to a paid membership plan to read the post as well as all other paid subscribers only posts.

Subscribe now

Already have an account? Log in