惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
V
Vulnerabilities – Threatpost
Attack and Defense Labs
Attack and Defense Labs
N
News and Events Feed by Topic
SecWiki News
SecWiki News
S
Security @ Cisco Blogs
Schneier on Security
Schneier on Security
B
Blog
TaoSecurity Blog
TaoSecurity Blog
The Last Watchdog
The Last Watchdog
H
Hacker News: Front Page
Hacker News - Newest:
Hacker News - Newest: "LLM"
博客园_首页
D
Docker
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Y
Y Combinator Blog
W
WeLiveSecurity
N
News and Events Feed by Topic
F
Fortinet All Blogs
PCI Perspectives
PCI Perspectives
WordPress大学
WordPress大学
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
www.infosecurity-magazine.com
www.infosecurity-magazine.com
Recent Announcements
Recent Announcements
Forbes - Security
Forbes - Security
T
Tailwind CSS Blog
Hacker News: Ask HN
Hacker News: Ask HN
爱范儿
爱范儿
腾讯CDC
Last Week in AI
Last Week in AI
月光博客
月光博客
C
Cybersecurity and Infrastructure Security Agency CISA
P
Proofpoint News Feed
Help Net Security
Help Net Security
V
V2EX
C
Cyber Attacks, Cyber Crime and Cyber Security
C
CXSECURITY Database RSS Feed - CXSecurity.com
H
Heimdal Security Blog
L
LINUX DO - 最新话题
GbyAI
GbyAI
The Hacker News
The Hacker News
罗磊的独立博客
S
SegmentFault 最新的问题
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园 - 【当耐特】
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
V2EX - 技术
V2EX - 技术
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
O
OpenAI News
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python) The Hidden Cost of AI Systems Nobody Talks About. undefined vs undeclared, and how typeof behaves Switching from file-based jobs to NATS/Kafka in Rust without changing code io_uring Adventures: Rust Servers That Love Syscalls Why Agentic AI is Killing the Traditional Database The POUR principles of web accessibility for developers and designers Quantum Neural Network 3D — A Deep Dive into Interactive WebGL Visualization How To Install Caveman In Codex On macOS And Windows Automation Pipeline Reliability: Why Your Workflow Breaks When Nobody Is Watching I Built an 'Open World' AI Coding Agent — It Works From ANY Folder From Freelancing to Product: A Tech Service Company's SaaS Transformation China's AI Giants: Adding Tencent Hunyuan & ByteDance Doubao to AI University (74 Providers) On the Vibe Coders and Their Lies clerk: Auto-Summarize Your Claude Code Sessions AI Weekly — 2026/04/10–04/17 | The Model Lockdown Is Here, but the Toolchain Is the Real Battleground AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場 Maybe this is how Open-Source apps are born... 🚀 Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide tRPC v11 + Next.js App Router: End-to-End Type Safety Without the Boilerplate ShadCN UI in 2026: Why I Stopped Installing Component Libraries and Started Owning My Components SaaS Billing in React Server Components: Stripe + Supabase Without a Single `useEffect` Join our DEV Weekend Challenge — $1,000 in Prizes Across TEN winners! Submissions Due April 20 at 6:59 AM UTC. Implementing FSRS Spaced Repetition in Flutter + Supabase — Adding Memory Science to an AI Learning App "I Texted My Localhost From the Train — Claude Code Fixed the Bug Before I Got Home" I Built a Sales Prep AI and It Went Deeper Than Expected Design to Code #2: One JSON, Eleven Outputs Solving the 100M-Row Problem: A Summary Table Pattern for High-Volume Push Notification Logs Flutter Web With Wasm: What Actually Changes For Developers I Built 50 Royalty-Free Soundtracks for My Side Project in a Weekend Using AI Music Generation The Vibe Coding Security Checklist: 7 Things to Check Before You Ship Stop Letting Googlebot Guess Fix Your React App's SEO Right Desconstruindo o Streaming do LinkedIn: Como Criar um Engine de Extração de Vídeo de Alta Performance com HLS e FFmpeg (EDA Part-1) EDA (Exploratory Data Analysis) Explained With Real Life — Why Looking at Your Data Is the Most Important Step in Machine Learning Brand Relationship Management at Scale: Our 4-Touch Outreach System for 200+ Brands Why String.fromEnvironment() Might Return an Empty String in Dart JGuardrails 1.0.0 — Hardening Java LLM Apps Against Jailbreaks, Toxicity, and Prompt Injection Plan and Schedule a Full Week of Threads Content From One Claude Conversation Coding Cat Oran Ep3, Five Tables Changed Everything Updated: BFF Pattern I'm done watching freelancers get buried by 200 proposals. So I'm building the alternative. This is my first post BFS Algorithm in Java Step by Step Tutorial with Examples Tracking LLM Pricing Monthly: An Open Dataset for 22 AI Models How We Measure Content ROI on a Comparison Site: Revenue Attribution Without Perfect Data Introducing Nova AI Ops: The AI-Native Operating System for SRE Teams I built a free desktop video downloader for Windows — Grabbit How Talkie OCR Helps Vision-Impaired & Dyslexic Users Read the World Around Them VRCFaceTracking安装和iPhone面捕配置教程,有bug Even CrowdStrike Can't See Your Agents The Automation Gold Rush: What n8n Workflows and Claude Are Opening Up for Developers Right Now
I Went Looking for the Basis of 'N Characters Per Minute Is Fast' — There Wasn't One. Setting Read-Aloud Thresholds Honestly
Uya · 2026-06-18 · via DEV Community

Uya

Uya

Posted on • Originally published at zenn.dev

📝 Originally published in Japanese on Zenn. This is the English version.
Canonical: https://zenn.dev/uya0526_design/articles/satellite3_metrics-rationale

📚 This is satellite article #3 in my "Read-Aloud Speed Meter dev log" series. For the whole picture, see the main article.

Where This Sits

The read-aloud speed meter converts speaking speed into an evaluation label like "slightly fast," and stagnation rate into one like "few." Those labels ultimately become the foundation for Claude Haiku's feedback.

So — on what basis did I draw the thresholds (the dividing lines)? This article digs into that "basis."

The short answer from my research:

I couldn't find a paper that defines an academic threshold for "N characters/min = fast/slow."

This is a record of how I drew the lines honestly once I'd learned there was no firm basis. More than the metric numbers themselves, I believe being transparent about why I chose those numbers is what makes an evaluation app trustworthy.

💡 I'm an ex-Java engineer learning TypeScript in public. This one is mostly about design decisions.


Why Obsess Over the "Basis"?

An evaluation app passes judgment on the user: "your reading is slightly fast." Once you're passing judgment, if you can't explain "why we can say that," it's just guesswork.

This app in particular passes the labels straight to Claude Haiku to generate coaching. If the foundational label has an unclear basis, the feedback built on top of it is a castle on sand. So I decided to nail down the basis for the thresholds first.

Two things to research:

  • The judgment basis for speaking speed (characters/min)
  • The judgment basis for stagnation rate (the proportion of silence)

As it turned out, these two had completely different kinds of basis.


Speed Thresholds: No Academic Threshold → Draw From General Rules of Thumb

What I found

For speaking speed, I first looked for academic backing. Here's what I found:

  • Speaking speed has traditionally been measured against mora count, but prior research from the listener's perspective is itself scarce.
  • Japanese is considered a "fast language" with many syllables per second, yet there's research suggesting that, correlated with information density, the overall information-transfer rate is similar across languages.

But none of this serves as a basis for judging whether an individual's reading is fast or slow. I couldn't find an academic threshold that says "N characters/min is fast."

The basis I adopted: the announcer standard

So I adopted a rule-of-thumb basis. The premise: the speed expected of an announcer is 300 characters/min.

  • The NHK announcer standard is "300 chars/min." It's shared knowledge across program staff and serves as a guide for writing scripts.
  • The trend has been getting faster recently — said to be around 380 chars/min at NHK and 400 at commercial broadcasters.
  • In settings that demand fast delivery, like sports play-by-play, over 400 chars/min is common.

Announcers are in the realm of language professionals. So if an ordinary reader reaches this speed, I positioned that as already on the "slightly fast" side.

Thresholds (adopted version)

Speed label Chars/min
Slow – 149
Slightly slow 150 – 199
Standard 200 – 299
Slightly fast 300 – 350
Fast 351 –

This is where an app-specific design decision comes in. General tables usually put "the NHK standard, ~300 = standard," but this app intentionally shifts that, placing the professional standard of 300 as the start of "slightly fast" for an ordinary person.

The reason is that this app's users are ordinary readers. If I put the professional level at "standard," almost everyone gets a "slow" verdict, and the coaching pushes in a discouraging direction. "If an ordinary person reaches the professional level, that's already on the fast side" is a more natural framing for an encouragement tool.

Mapping into code: These thresholds live as constants in thresholds.ts. The explanation of the numbers' basis (300 = pro standard, etc.) isn't in code comments but in this article and the LEARNING_LOG. In Java terms, it's the same as "extract config values into constants + keep the intent in the design docs."

Writing about it honestly in the article and UI

What I was careful about when presenting the thresholds was not creating the misunderstanding that "Claude is judging on strict academic grounds."

  • Since there's no academic threshold, write honestly that I adopted a rule-of-thumb basis (announcer standard, 300 chars/min).
  • Spell out the intent of the custom placement — putting the pro standard of 300 as the ordinary person's "slightly fast" start.
  • Note that, more rigorously, chars/min is a simplified metric and mora/min is the usual measure.

Rather than dressing it up as "strict science," I figured showing the process — "I drew the lines this way, on this basis" — earns more trust for an evaluation app.


Stagnation-Rate Thresholds: There Isn't Even a "Ruler"

Even less basis than speed

For stagnation rate (the proportion of silence across the whole utterance), the conclusion was even tougher than for speed:

The basis for a numeric threshold of "stagnation under X% is good" is even harder to find than for speed. It's a domain where the "numeric ruler" itself, the kind speed has, simply doesn't exist.

It's also thorny as a general principle. A speaker not taking pauses is not itself regarded as evidence of fluent speech. In fact, research holds that pauses serve important functions:

  • Speech rate (including pauses) and articulation rate (the speaking portion only) are distinct concepts, and the one including pauses correlates more strongly with auditory impression.
  • Pauses don't just mark word/phrase/sentence boundaries — they serve the important function of giving the listener a chance to digest information.

In other words, "fewer pauses = better" doesn't simply hold.

Accepting it as an "in-house metric"

Here I made a design decision. For stagnation rate, I decided to claim no academic or general backing and accept it frankly as an in-house heuristic for this app.

Stagnation label Stagnation rate (% after labelMetrics)
Few – 0.9%
Somewhat few 1.0 – 5.9%
Normal 6.0 – 9.9%
Somewhat many 10.0 – 19.9%
Many 20.0% –

The basis for putting "0% on the good side" is cultural context

Let me get ahead of the question a reader will naturally have: "Why put 0% (speaking without breaks) on the good side?"

The basis isn't academic — it's the cultural context of Japanese. Japanese has an expression, "tate-ita ni mizu" (literally "water down a standing board" — speaking fluently and without interruption), reflecting a disposition to see uninterrupted, flowing speech as "fluent." I drew the line leaning toward that connotation.

That said, this isn't "the correct answer." As noted above, there's research finding that "pauses serve a function of letting the listener digest information," so 0% isn't always good. I judged this to be a domain that isn't academically settled and where people's perceptions differ, and — within a deadline-bound Phase 1 — I didn't chase it deeper; as the designer I adopted the connotation on the "tate-ita ni mizu" side.

⚠️ This metric's limits: As it stands, the stagnation rate does not distinguish Japan's meaningful pauses (ma). A 0.5-second stop that lands appropriately at a punctuation point and a 0.5-second stumble where the reader got stuck are both counted as the same "silence" stagnation. It really ought to be looked at qualitatively, paired with "appropriateness of the pause's position," and this is a research item not yet listed in the README's Phase 2. The app's footer makes this limit explicit too.


The Two Metrics Had Different Kinds of Basis

To summarize, even for the same "evaluation-metric threshold," how I grounded them was completely different:

Metric Academic threshold Basis adopted How I handle it in the article
Speed None Rule of thumb (announcer 300 chars/min) Present an industry-custom value as basis; spell out the custom placement
Stagnation rate None (no ruler at all) App-specific heuristic + cultural context Claim no academic backing; honestly label it an in-house metric

This is the point I most want to land in this article. When you can't find a basis, don't force a "this is scientific" facade. Disclosing the process — "there's no academic threshold, so I drew from a rule of thumb," "I accepted this as an in-house metric" — is, I think, actually stronger for an evaluation app.


What I Decided Myself / What I Asked AI For

Area Detail
My decisions The final thresholds, the custom placement of the announcer standard, the call to treat stagnation as an in-house metric, adopting the "tate-ita ni mizu" context, stating the MVP limits
Asked AI for Presenting tentative boundary tables, helping survey related research and industry-custom values
Verified myself Checking sources, the final judgment on the validity of the basis (not taking AI's research at face value)

The threshold numbers and their basis were all decided by me. AI helped with the survey and tentative drafts, but "which basis to adopt and how to place it" is the designer's judgment.


Wrapping Up

This was a summary of "on what basis" I drew the read-aloud evaluation thresholds. Three takeaways:

  1. There was no academic threshold for speed, so I grounded it in a rule of thumb (announcer 300 chars/min) and custom-placed the pro standard as the ordinary person's "slightly fast" start.
  2. Stagnation rate has no ruler at all, so I accepted it as an app-specific heuristic; the basis for putting 0% on the good side rests on the cultural context of "tate-ita ni mizu."
  3. When there's no basis, don't dress it up as scientific — disclose the process. That, I believe, is what makes an evaluation app honest.

An evaluation tool's value isn't decided by the precision of the numbers alone. It's whether you can be transparent with the user about how you justified those numbers. I recorded even the dead ends of the basis hunt because that was the thing I most wanted to convey.

The detailed development log is in the repository's LEARNING_LOG.

Main references

Sources I consulted while organizing the basis for this article.

  • Ayumi Marushima, "A Study on the Cognition of Speech Rate," Studies in Linguistics online inaugural issue (2008) [in Japanese]
  • Ayumi Marushima, "A Study on the Tempo of Spoken Language," Studies in Linguistics online no. 2 (2009) [in Japanese]
  • Ayumi Marushima, "An Experimental-Phonetics Study of Speech Rate from the Speaker's and Listener's Perspectives," doctoral dissertation, University of Tsukuba (2015) [in Japanese]
  • Reporting and shared industry knowledge on NHK / commercial-broadcaster announcer speech rates (Diamond Online, etc.)

💡 As this article argues, I believe an academic "speed threshold" does not exist. The above aren't sources for the thresholds themselves, but background knowledge for "how to think about speech rate and pauses." I've checked each against the original.

Next time is the finale — I look back at the "AI-collaborative development" that underpinned all these design decisions → satellite #4, "Reflections on AI-collaborative development & a collection of stumbles" (https://dev.to/uya0526design/not-did-you-use-ai-but-are-you-the-one-driving-reflections-on-building-a-real-product-through-464m).


This article is part of my public learning journey using AI tools (Claude / Cursor). The thresholds and the judgment of their basis are mine. I collaborate with AI on the article's structure, outline, and draft prose, and I review and revise every line before publishing.