惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
Recorded Future
Recorded Future
T
Tenable Blog
S
Securelist
C
CERT Recently Published Vulnerability Notes
T
Threatpost
S
Schneier on Security
A
Arctic Wolf
The Hacker News
The Hacker News
C
CXSECURITY Database RSS Feed - CXSecurity.com
Know Your Adversary
Know Your Adversary
P
Privacy International News Feed
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
Cisco Talos Blog
Cisco Talos Blog
AWS News Blog
AWS News Blog
K
Kaspersky official blog
T
True Tiger Recordings
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
P
Palo Alto Networks Blog
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
B
Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Microsoft Azure Blog
Microsoft Azure Blog
Cyberwarzone
Cyberwarzone
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tor Project blog
Spread Privacy
Spread Privacy
Malwarebytes
Malwarebytes
P
Proofpoint News Feed
F
Fox-IT International blog
F
Fortinet All Blogs
P
Privacy & Cybersecurity Law Blog
G
GRAHAM CLULEY
量子位
Latest news
Latest news
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
博客园 - 叶小钗
Project Zero
Project Zero
T
Tailwind CSS Blog
N
Netflix TechBlog - Medium
Martin Fowler
Martin Fowler
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
I
Intezer
博客园_首页
腾讯CDC
H
Hackread – Cybersecurity News, Data Breaches, AI and More
D
Darknet – Hacking Tools, Hacker News & Cyber Security

Trending Topics

Pacifico Biolabs: FoodTech holt 7 Millionen Euro, macht Brauereien zu Protein-Fabriken „Verloren ist nichts“: Wie Europa in der humanoiden Robotik aufholen kann Bis zu 99% Rabatt: Xiaomi und DeepSeek attackieren mit AI-Token zu Schleuderpreisen Micron and SK Hynix Hit $1 Trillion as AI Fuels Historic Memory Chip Boom Micron und SK Hynix im Billionen-Dollar-Club – danke der Speicherkrise und AI REPS-Gründer: „Die nächste Finanzierungsrunde muss 200 Millionen groß sein“ ContexMesh: Vorarlberger DeepTech-Startup holt Millionen-Investment OpenRouter Raises $113M to Become AI’s Most Powerful Routing Layer OpenRouter: Der Seismograph der AI-Welt holt 113 Millionen Dollar, wird zum Unicorn DeepSeek Slashes Prices for Top Model V4-Pro by 75 Percent, Renews Attack on Anthropic and Co. DeepSeek viertelt die Preise für Top-Modell V4-Pro, greift neuerlich OpenAI und Co an ViennaUP: Wiener Startup-Festival zählte 2026 mehr als 14.000 Teilnehmende Lucis: HealthTech sichert sich 20 Millionen Dollar für KI-gestützte Präventionsplattform One100: Hansi Hansmann investiert mittlere sechsstelligen Betrag in KI-Telefonassistenten Quanscient: B&C-Gruppe investiert in finnisches Industrie-Simulations-Startup eustella: KI-Companion startet mit Web-Version und Bildgenerierung in die Open Beta Shield Guard: Raiffeisen Stadtbank Wien startet Cyber-Schutzschild für Unternehmen Drei Sales Champion: So zünden Vertriebsteams mit Gamification den Sales-Turbo eustella Brings Its AI Companion to the Web as it Starts Into Its Open Beta Phase “AI Must Be Disarmed”: Pope Leo XIV Demands Strict Rules for AI Companies „Die KI muss entwaffnet werden“: Papst Leo XIV. fordert Restriktionen für AI ECB Pushes Back on Euro Stablecoins, Fears Losing Control Over Interest Rates EZB fürchtet Kontrollverlust über Leitzinsen wegen Verbreitung von Stablecoins Bitcoin falls below $75,000, spot trading at lowest level since November 2023 Bitcoin fällt unter 75.000 Dollar, Spot-Handel auf tiefstem Wert seit November 2023 German Startup Awards 2026: Merz fordert mehr privates Kapital für Jungfirmen Oura Plans IPO: Smart Ring Pioneer with $11 Billion Valuation and Its Own AI Model Oura plant IPO: Finnischer Smart-Ring-Pionier mit 11-Milliarden-Dollar-Bewertung und KI-Modell OtterlyAI: Österreichisches GEO-Startup bei European Search Awards 2026 ausgezeichnet Google Spends $180 to $190 Billion on AI — and Now Wants You to Pay Too Google steckt immer mehr KI-Produkte in kostenpflichtige Abos Tyrolean Startup REPS Raises $23.6M to Turn Road Braking Into Clean Energy 30% of All Bitcoin at Risk From Future Quantum Computer Attacks, Study Warns REPS: Österreichisches Startup sichert sich 23,6 Millionen Dollar, macht Straßen zu Kraftwerken Rund 500 Milliarden Dollar in Bitcoin durch Quantencomputer gefährdet, zeigt Studie Wiener WealthTech FAIT wird von Münchner Cleversoft aufgekauft SpaceX: Orbital Data Centers Are The Big IPO Narrative – And an Unsolved Physics Problem SpaceX: Orbital Data Centers sollen als IPO-Narrativ Investoren anlocken Pivot: Paris AI Startup Raises $40M to Fix Enterprise Procurement’s Blind Spots Pivot sichert sich 40 Millionen Dollar für KI-gestützte Procurement-Plattform Google Starts to Inject Ads Into AI Search Answers to Protect Its Core Business Google beginnt damit, Werbung in KI-Antworten zu mischen SpaceX IPO: Elon Musk Wants a $1.75 Trillion Valuation on $18.7 Billion Revenue SpaceX: IPO-Bewertung 94x mal höher als der Umsatz von 2025 Förderungen, Netzwerke, Wachstum: WKO Steiermark begleitet Gründer:innen ganzheitlich Wiener Software-Firma Kinnovis kauft spanisches Self-Storage-Unternehmen Xperitt Breite Front von Startups bis FPÖ gegen geplante Paketsteuer in Österreich Nvidia Hits $81B Quarterly Revenue as AI Boom Keeps Breaking Records Nvidia erzielt Rekordumsatz von 81 Milliarden Dollar Auktions-Plattform Aurena kauft Berliner AssetOrb zu, will Markt konsolidieren The SpaceX IPO Prospectus: 15 Key Insights from the S-1 Filing Anthropic steht vor erstem profitablem Quartal SpaceX-IPO: Börsenprojekt zeigt Starlink als Cashcow, xAI als Milliardengrab Wer am Ende die KI-Rechnung von Google bezahlen wird | Wasner + Steinschaden #5 ohrheld: Wiener Startup für maßgefertigte Schlaf-Ohrstöpsel mit 150.000 Euro gefördert Swapfiets Acquires Dance, Creating Europe’s Largest E-Bike Subscription Player Swapfiets schluckt Dance, treibt Konsolidierung im E-Bike-Markt voran Europe’s Players Unite to Challenge Visa, Mastercard, and PayPal With Shared Payment Network Wero & EuroPA: 5 europäische Payment-Anbieter wollen 130 Millionen User von Visa, PayPal und Co wegholen Tiroler ParaStruct holt sich 170.000 Euro von SPRIN-D für grüne Baustoffe 37 European Banks Unite to Launch Regulated Euro Stablecoin by 2026 Europäische Banken treiben gemeinsamen Euro-Stablecoin voran Connect Day 26: Die vier Sieger:innen des Matchmaking-Events Digitale Abhängigkeit nimmt zu: Souveränität wird zur Schlüsselstrategie für Österreichs Wirtschaft Avian: TU-Wien-Absolvent sammelt 2,6 Mio. Dollar mit Schweizer DeepTech-Startup Erste AM lanciert ELTIF und ermöglicht Privatmarktanlagen für Privatanleger Infrawatch Raises $3M to Catch Cyberattacks Before They Strike Infrawatch: Startup holt 3 Millionen Dollar, entwickelt Frühwarnsystem für Cyberangriffe Overwatch AI sammelt 1,5 Millionen Dollar ein, um Flugausfälle zu reduzieren OpenAI Co-founder Andrej Karpathy Joins Anthropic to Improve Claude KI-Größe und OpenAI-Mitgründer Andrej Karpathy geht zu Anthropic, um Claude zu verbessern Google Launches Gemini 3.5 Flash With Higher Prices but No Generational Leap Gemini 3.5 Flash: Schneller, teuer, aber noch nicht Spitze Google Declares Biggest Search Box Overhaul in 25 Years Google macht Suchfeld zum KI-Agenten, Weiterklicken kaum mehr notwendig Gemini Omni: Video-Modell bringt Schnitt per Spracheingabe und Physik-Simulation Gemini Spark is Google’s Answer to the OpenClaw Hype, High-Risk Actions Included Gemini Spark: Googles Antwort auf OpenClaw auch hochriskante Fähigkeiten Cohere Acquires Reliant AI, Just Weeks After the Merger with Aleph Alpha Cohere kauft nach Aleph-Alpha-Deal das nächste deutsche KI-Startup, nämlich Reliant AI Unframe: Enterprise AI Startup Raises $50M and Hits $100M TCV in 12 months Unframe: Enterprise-KI-Startup holt 50 Mio. Dollar, knackt 100 Mio. Dollar Auftragsvolumen in 12 Monaten BMD knackt 100-Millionen-Marke und bleibt bewusst unabhängig Austrian Thorium SMR Builder Emerald Horizon Eyes IPO at €790M Valuation Emerald Horizon will im Juni zu Bewertung von 790 Millionen Euro an die Börse Lexroom Raises $50M to Fix Legal AI’s Hallucination Problem Anthropic, OpenAI, and Perplexity Quietly Shift to Costly Auto-Reload as AI Goes Agentic KI-Agenten kaufen selbst nach: Auto-Reload bei Claude, ChatGPT und Perplexity als Kostenfalle Cursor bringt neues Coding-Modell Composer 2.5, das auf China’s Kimi K2.5 basiert Lexroom: LegalTech sichert sich 50 Millionen Dollar für europäische Legal-AI-Plattform Emmi AI aus Linz geht an Mistral AI – die Analyse | Wasner + Steinschaden Emmi AI aus Linz macht Exit an Mistral AI, Milliarden-Bewertung steht im Raum Mistral AI Acquires Austrian Startup Emmi AI in Europe’s Boldest AI Deal Yet ViennaUP 2026: Eröffnung im Rathaus, factorymaker gewinnt Startup World Cup Austria EU Picks Stockholm-Based EQT to Run Its Largest-Ever Scaleup Fund EQT aus Schweden wird 5 Milliarden Euro schweren Scaleup Europe Fund managen California Jury Dismisses Musk’s Lawsuit Against OpenAI After Two Hours Elon Musk verliert Klage gegen Sam Altman und OpenAI, will Berufung einlegen “Hormuz Safe”: Iran Plans Bitcoin-Based Insurance Scheme to Control Contested Strait „Hormuz Safe“: Iran plant Bitcoin-basierte Versicherung für Straße von Hormus
China’s AI Price War Escalates as Xiaomi Slashes API Costs by 99 Percent
Jakob Steins · 2026-05-27 · via Trending Topics

Within just a few days, two Chinese AI providers have permanently and drastically cut their API prices per token, pushing the global price war over large language models into a new phase. Xiaomi is rolling out price cuts of up to 99 percent for its MiMo-V2.5 series starting today, May 27, 2026. DeepSeek, in parallel, is making permanent the discount campaign introduced the previous month for its flagship V4-Pro, keeping usage costs at one quarter of the original level.

Both moves target the same market segment: paying enterprise customers and developers who process billions of tokens daily and for whom the price per million tokens has become the hardest operational metric.

Xiaomi: Price Reduction of Up to 99 Percent and Reset of All Token Packages

With the MiMo-V2.5 series, Xiaomi is positioning itself unmistakably as a price disruptor. The Chinese conglomerate, which under CEO Lei Jun intends to invest at least $8.7 billion in AI by 2029, announced several measures in a single package:

  • The API prices for the entire V2.5 series are being cut by up to 99 percent compared to the previous rate, according to the official announcement. The previous differentiation by input length is being dropped — there is now only a single uniform price per million tokens.
  • Existing customers with an active “Token Plan” will receive five to eight times the amount of usable credits for the same price.
  • All already-consumed credits in active packages will be fully reset as of the cutoff date — a classic lock-in move to prevent churn to competitors.
  • The “100 Trillion Token Creator Incentive” program launched at the end of April was ended early on May 26, because the entire token volume had already been distributed.

On third-party platforms such as OpenRouter, MiMo-V2.5-Pro is currently listed at $0.435 per million input tokens and $0.87 per million output tokens. The model features a context window of one million tokens and, according to Xiaomi, positions itself against the Western top tier on benchmarks such as SWE-bench Pro and ClawEval.

DeepSeek: From Temporary Discount to Permanent Price

DeepSeek, for its part, is turning its time-limited discount campaign into a permanent state. API prices for V4-Pro remain in a range of 0.025 to 6 yuan per million tokens — approximately $0.0035 to $0.83. Before the adjustment, the level was still 0.1 to 24 yuan. The exact amount depends on whether it involves pure text input or the significantly more compute-intensive text generation.

Of interest is the parallel capital raise: the lab founded by hedge fund billionaire Liang Wenfeng is opening its cap table to external investors for the first time. According to reports from the Financial Times, Bloomberg, and the South China Morning Post, a round of three to four billion US dollars is being sought at a valuation of up to $50 billion — led by the state-owned Chinese semiconductor fund “Big Fund III,” with participation from Tencent, Alibaba, and Hillhouse. It would be the first known investment by Big Fund in a Chinese LLM provider — a political signal that Beijing is positioning DeepSeek as a national champion.

How Are Such Prices Even Sustainable?

The key question for Western providers and investors is: how can Chinese providers offer prices that are a fraction of what OpenAI or Anthropic charge, without structurally running at a loss? The answer lies in a combination of three layers — hardware, software, and political economy.

1. Inference Optimization at the Software Level. In its announcement, Xiaomi reveals surprisingly openly where the lever lies. The conglomerate’s inference team has fundamentally rebuilt the KV cache architecture — the memory mechanism that retains the most important intermediate results during token generation. SGLang HiCache is used in combination with Sliding Window Attention (SWA). HiCache organizes the KV cache according to the principle of modern CPU architectures across three levels: GPU memory as L1, host memory as L2, and distributed storage as L3. According to Xiaomi, this reduces the volume of data to be transferred between memory levels to approximately one-seventh of the previous value, while the number of cacheable tokens increases by a factor of five. In practical terms, this means: for recurring requests with similar prefixes — such as in coding agents or multi-turn conversations — the model needs to recompute far less frequently.

2. Own Hardware Strategy. DeepSeek consistently relies on Huawei Ascend 950 semiconductors for V4 instead of Nvidia GPUs, which are already difficult for Chinese customers to obtain due to US export controls. The company has indicated that infrastructure costs will continue to fall once the so-called supernodes of the Ascend series are deployed more broadly in the second half of 2026. The combination of DeepSeek + Huawei is strategically regarded as the core of an independent Chinese AI stack. What began as a workaround against export restrictions is developing into structural cost arbitrage: Ascend chips are cheaper to procure in China and are billed without US margin markups.

3. Political Economy. With the entry of “Big Fund III” — should it be confirmed — DeepSeek would effectively become a state co-financed champion. This fundamentally changes the business logic: a company that does not primarily need to be profitable in the short term, but is instead meant to capture strategic market share in a geopolitically contested sector, can offer prices that are barely economically viable for purely privately financed competitors. Xiaomi, too, finances its AI division from the cash flow of a profitable consumer electronics conglomerate with announced cross-subsidized investments of $8.7 billion.

What This Means for the Market

For Western providers such as OpenAI and Anthropic, the situation is becoming increasingly uncomfortable. Both companies typically charge several multiples of what DeepSeek and Xiaomi are now offering per million tokens for their top models. For pure commodity workloads — classification, translation, simple extraction — the switching barrier will continue to fall. The picture is different for complex reasoning, agent, and coding workloads, where model quality, security tooling, and enterprise integration remain the differentiating factors.

For startups and developers, the development means above all one thing in the short term: capable reasoning models are increasingly being priced like infrastructure — not like premium software. Anyone validating an idea with an AI backend today can do so at unit costs that would have been unthinkable two years ago.

In the medium to long term, the more strategic question arises: if inference costs are pushed further toward marginal cost, value creation shifts away from the pure model and toward data integration, tooling, security, and vertical expertise. That is precisely where Western providers want to defend their pricing power. Whether they succeed will also depend on how aggressively Chinese providers extend their cost advantage into Western markets — and how quickly open-source alternatives continue to close the gap on the model side.

Rank My Startup: Erobere die Liga der Top Founder!

Aus Datenschutz-Gründen ist dieser Inhalt ausgeblendet. Die Einbettung von externen Inhalten kann in den Datenschutz-Einstellungen aktiviert werden: