惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

N
News and Events Feed by Topic
D
Docker
云风的 BLOG
云风的 BLOG
F
Fortinet All Blogs
F
Full Disclosure
H
Hackread – Cybersecurity News, Data Breaches, AI and More
P
Proofpoint News Feed
Microsoft Azure Blog
Microsoft Azure Blog
WordPress大学
WordPress大学
The GitHub Blog
The GitHub Blog
L
LangChain Blog
H
Help Net Security
B
Blog
T
Tailwind CSS Blog
V
V2EX
博客园_首页
阮一峰的网络日志
阮一峰的网络日志
人人都是产品经理
人人都是产品经理
The Cloudflare Blog
Recent Announcements
Recent Announcements
aimingoo的专栏
aimingoo的专栏
美团技术团队
A
About on SuperTechFans
C
Cybersecurity and Infrastructure Security Agency CISA
K
Kaspersky official blog
I
InfoQ
Project Zero
Project Zero
I
Intezer
Google DeepMind News
Google DeepMind News
博客园 - 【当耐特】
Hugging Face - Blog
Hugging Face - Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
T
Threat Research - Cisco Blogs
Last Week in AI
Last Week in AI
C
Cyber Attacks, Cyber Crime and Cyber Security
G
GRAHAM CLULEY
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
AWS News Blog
AWS News Blog
Spread Privacy
Spread Privacy
S
Securelist
Recorded Future
Recorded Future
D
Darknet – Hacking Tools, Hacker News & Cyber Security
博客园 - 叶小钗
S
Security Affairs
Blog — PlanetScale
Blog — PlanetScale
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
月光博客
月光博客
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
罗磊的独立博客
The Hacker News
The Hacker News

HN's home page

Rainbow Query Language | Hacker News Exec into Node via Kubectl An AI native hedge fund The Seven-Action Documentation Model | Hacker News Package Manager for Kubectl Plugins Tongan Castaways | Hacker News Tech overlords plan for conscious AI to conquer the cosmos. What could go wrong? Data Breach Disclosure Lag Is Getting Worse How LLMs Work | Hacker News I Dropped PRDs for Shape Up Go Experiments Explained | Hacker News FCA's Palantir deal could expose UK financial data to Trump's US, critics fear WebXR BCI for Neural-Adaptive Avatar Control in Mixed Reality The first murder conviction via DNA analysis Tom Interviews Theo de Raadt of the OpenBSD Project (2019) [video] Show HN: Replace shell commands with bun shell typescript scripts Quay.io Is Down | Hacker News AI driven analysis of brokerage account fees in the UK Bill Gates Spent Years Crafting His Image. Now It's Cracking Using LLMs to secure source code Wi-Fi 8 in the Lab [video] The household battery revolution that could change energy bills and the world Is Python Becoming Pinyin? | Hacker News Livia – Executive Assistant | Hacker News FindMyPipe – Query Apple Find My from Linux for AI Agents Show HN: Agent skill for creating product launch videos with Remotion RecruitMyself – AI job search copilot for resumes and applications AI coding agents and the erosion of system understanding The 'Resting' Generation and South Korea's Youth Recession AMD Computex 2026: 10 Years of AM4, AM5 Support Through 2029 Docker Networking Explained | Hacker News Textbooks in Tokenland | Hacker News Key Chemistry Question Answered, No Quantum Computer Required Gifts For Retrocomputing Fans – remix yesterday's tech with a modern spin Miscellany № 49: introducing the quasiquote – Shady Characters Amazon Thinks the Future of Data Centers Is a Technical Problem It Just Solved A brief history of the UUID (2017) Flying High Unpressurized (2016) | Hacker News Five Years of Trying to Add Recursion to Lychee How British comfort food won over the French Blorp Language | Hacker News Decache – you might have the internet's lost media in your PC's cache folders Criminal Activities and Migration | Hacker News A free, open-source library of DESIGN.md files for AI-generated UIs MiniMax M3 | Hacker News People are apparently farming citations on ResearchGate – Chuniversiteit Hacker News Basketeer – a typed TS SDK for your Tesco account, with nutrition data 'Penguin' decays from CERN's Large Hadron Collider experiment hint new physics Emergence World: A Laboratory for Evaluating Long-Horizon Agent Autonomy Homebrew lead Mike McQuaid: Sandboxes and Worktrees - My Secure Agentic AI Setup Lean, Not Backpressure | Hacker News AI Dangers Eclipse Nuclear Weapons at Singapore Defense Forum Open source analytics that answers backbase How turkey hacked the hair-transplant industry How GPT Image 2 Is Transforming Marketing Workflows in 2026 Improve Git monorepo performance with a file system monitor Strava for Claude Code MiniMax M3 on Qubrid AI There's Something Else We Should Be Worrying About Celebrity Profile of an A.I. Actress What Is Windows K2? | Hacker News AI is devoid of meaning and humanity. Its vapid voice suits the political moment Show HN: Interpreto – Live Translation for Travel Taxicab Geometry Sealed classes and interfaces in Java (2025) Show HNs | Hacker News My AI Skill Edited This Video That Explains My AI Skill – Arcturus Labs Amazon Pinpoint End of Support The Mystery of the Backward Index MP/M's Process Dispatcher SlimTide Reviews: A Modern Solution for Metabolism and Energy Learning Lustre: Type-safe front end development with gleam Thomas Mann: Goethe Heartened by Panama (As Suez for English, or Danube-Rhine) How to make Message Log of the Unreal Engine 100 times faster Sum-product, unit distances, and number fields Can Meta Buy Belief? | Hacker News Twenty Years of Bigtable | Hacker News Show HN: Combine WigglyPaint GIFs into Video Show HN: AgentThreatBench – Benchmark for AI Agent Memory Security Genius Spotted in the Wild Napkins: Where Ethernet, Compaq and Facebook’s cool data center got their starts (2011) Moderate caffein use alters sleep-related EEG Nvidia Announces RTX Spark | Hacker News Show HN: Ministry of Everything – CLI agent harness for a single operator CEOs blame AI for layoffs, MIT prof says it fits a pattern to find cover story Bugs I didn't expect while building a zsh cleanup script for macOS dev machines Nvidia jumps into PCs with new chip debuting in laptops from Microsoft, Dell, HP Nvidia unveils PC 'superchip' in challenge to Apple and Intel Show HN: Having fun making mini static site apps Synthea API: Create Synthetic Medical Records as a Service Berkshire Hathaway to buy Taylor Morrison for $6.8B in cash The most complex model we understand [video] SanDisk stock is +4,440.53% in the past year Driftwm: What if your window manager worked like a whiteboard? US Immigration enforcement looks into buying ad data AI Is Creating More Work for Australia's Workplace Tribunal Finding New Biblical Cross-References with Codex Glide: A tiling window manager for macOS Ultra-highly efficient enrichment of uranium from seawater via studtite nanodots (2024)
Value for Money Is All You Need
BEKOUTI · 2026-06-22 · via HN's home page

Value For Money is All You Need

A reflection on the future of token consumption in artificial intelligence

Token consumption now sits at the center of the growing use of artificial intelligence by businesses and individuals alike.

The "TokenMaxxing" trap

In the early days, the trend was to maximize token consumption from proprietary LLMs, regardless of cost — a practice seen as a marker of performance for the user, the employee, or the company. This phenomenon, known as "TokenMaxxing," reportedly exhausted Uber's entire annual budget in under a year.

Faced with the enormous financial cost this TokenMaxxing generated, many companies and individuals turned to lower-cost LLMs to preserve their budgets — fueling the rise of Chinese open-source LLMs such as DeepSeek, in line with Harvard professor Clayton Christensen's theory that disruptive innovation can conquer a market through low prices.

Users thus found themselves facing a dilemma: choose a highly capable but token-expensive proprietary model, or a less capable but more budget-friendly open-source model.

The temptation of dumping

To resolve this dilemma, Sam Altman, CEO of OpenAI, promised to lower the cost of OpenAI's tokens — aiming to stand out from the competition, gain ground in the AI space, and make his highly capable models more accessible in terms of token cost.

While commendable, this initiative exposes OpenAI to two major risks:

A considerable financial risk: this token dumping could negatively impact OpenAI's profitability, making the strategy difficult to sustain over time. A market risk: dumping in no way guarantees an increase in OpenAI's market share against a competitor like Anthropic, since users remain willing to pay a high price if they can afford it — and if the expensive tokens they purchase generate returns far superior to those of cheaper tokens.

Current initiatives around token utilization

To resolve this cost-versus-quality trade-off faced by users, a new philosophy is now emerging: that of cost efficiency. Several interesting initiatives reflect this shift:

OpenRouter merges models in an attempt to reduce costs while still providing access to the most powerful models available — but the operation of its AI agents generates considerable hidden costs. Chinese open-source models such as GLM 5.2 are highly capable and cheaper than proprietary models from OpenAI or Anthropic, while still being notably more expensive than other open-source models. Ponytail strips away everything superfluous in code to preserve only the essential, thereby reducing token cost while preserving quality regardless of the LLM used — but it risks being too minimalist and insufficiently flexible to understand the context in which a user introduced lines of code that are essential to them, but which Ponytail might judge as superfluous. Headroom promises, through compression, to cut token costs by 95% — but the hidden costs tied to running its AI agent risk undermining this commendable goal.

Ultimately, all of these projects are commendable and worth encouraging, as they help address a problem that still stands in the way of the broader adoption of artificial intelligence.

The real challenge: Value For Money

In my view, the real challenge lies neither in price, nor in quality, nor in a performance-cost trade-off, nor even in cost efficiency. The real challenge lies in Value For Money.

Value For Money rests on three cumulative criteria:

Cost Quality Protection against risk(s)

Together, these three criteria deliver the best quality, at the lowest cost, with the least risk.

A new philosophy

Value For Money is the new paradigm that should guide AI labs and companies in how they approach token usage. That is why I am currently working on a project — soon to be available — to help remove, together with anyone willing to join me on this journey, the obstacle that token consumption represents