惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

B
Blog
Attack and Defense Labs
Attack and Defense Labs
大猫的无限游戏
大猫的无限游戏
爱范儿
爱范儿
MongoDB | Blog
MongoDB | Blog
Last Week in AI
Last Week in AI
Engineering at Meta
Engineering at Meta
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
月光博客
月光博客
IT之家
IT之家
D
Docker
L
LangChain Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
S
SegmentFault 最新的问题
Martin Fowler
Martin Fowler
Recorded Future
Recorded Future
C
CERT Recently Published Vulnerability Notes
H
Hackread – Cybersecurity News, Data Breaches, AI and More
P
Privacy International News Feed
博客园 - 三生石上(FineUI控件)
博客园 - Franky
Cisco Talos Blog
Cisco Talos Blog
C
Cyber Attacks, Cyber Crime and Cyber Security
A
About on SuperTechFans
Recent Announcements
Recent Announcements
云风的 BLOG
云风的 BLOG
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
博客园 - 聂微东
酷 壳 – CoolShell
酷 壳 – CoolShell
G
GRAHAM CLULEY
P
Proofpoint News Feed
L
Lohrmann on Cybersecurity
T
The Blog of Author Tim Ferriss
T
Threat Research - Cisco Blogs
GbyAI
GbyAI
P
Palo Alto Networks Blog
Cyberwarzone
Cyberwarzone
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
SecWiki News
SecWiki News
Help Net Security
Help Net Security
有赞技术团队
有赞技术团队
Blog — PlanetScale
Blog — PlanetScale
Cloudbric
Cloudbric
C
Cybersecurity and Infrastructure Security Agency CISA
量子位
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
I
Intezer
C
Cisco Blogs
The Cloudflare Blog
S
Securelist

OfficeChai

These Are The 10 Cheapest AI Models In The World [June 2026] 18 Best AI Tools For English Speaking (With Examples) [2026] AI Impact? Vacancy Rates For US Office Properties Are Now Highest Since The 2008 Crisis KPMG Pulls Report Praising AI After It Was Found To Have Fake AI-Generated Citations India's Sarvam Raises $234 Million At $1.5 Billion Valuation After SpaceX Stock Pops 20%, Musk Has Made More Money In The Last 24 Hours Than Warren Buffett Made In His Entire Career OfficeChai Nobody Is Using AI Better Than Meta: NVIDIA CEO Jensen Huang 21 Best AI Tools For Animation (With Examples) [2026] 22 Best AI Tools For Architecture (With Examples) [2026] Datacenter Construction Spending Has Eclipsed Public Transportation Spending In The US China Scraps 12,000 Degree Courses, Mainly In Arts And Humanities, To Prepare For AI Age OfficeChai There Is No Job Loss With AI: David Friedberg Loop Between Human Capital And "Token Capital" Will Be The New IP For Firms, Says Satya Nadella How to Reduce Dependency on Key Employees 8 Google Index Checker Use Cases Beyond New Blog Posts Memory Squeeze? Smartphone Purchases Are Down Globally 21 Best AI Tools For Accounting (With Examples) [2026] AI For Voice Generation: 22 Best Options (With Examples) [2026] These Are The Most Popular Image Generation Models On OpenRouter [June 2026] Search Traffic For Websites Is Down 25% Over The Last Year Because Of AI: a16z Data Agentic Coding Has Led To A 50% Increase In Number Of Apps, But Most Are Finding Very Few Users: SimilarWeb Data OpenRouter Launches Fusion API, Which Uses A Combination Of Models To Achieve Fable-Like Performance At Half The Price Dario Amodei Refused To De-Deploy Or Fix Vulnerabilities In Fable Before US Export Controls, Says David Sacks 23 Best AI Tools For Notes Making (With Examples) [2026] 16 Best AI Tools For Astrology (With Examples) [2026] How Jensen Huang Once Had To Ask SEGA's CEO To Pay NVIDIA For A Technology That Didn't Work ChatGPT Already Has 11% Of The Search Market: OpenAI CFO Sarah Friar SpaceX Has Now Launched More Satellites Than Rest Of Humanity Combined Across History Globalization Is Dead, Time For India To Wake Up Says Sridhar Vembu After US Bans Anthropic Mythos And Fable Models For Foreign Users Elon Musk Becomes World's First Trillionaire After Record SpaceX IPO Anthropic Suspends Access To Mythos And Fable Models Following US Govt Directive Against Foreign Users 27 Best AI Tools For Market Research (With Examples) [2026] Why Jeff Bezos Makes Important Decisions Early in The Morning Education And Healthcare IT Have Been The Hardest Areas To Invest In: Peter Thiel Giving AI Long-Term Goals Could Lead To The Emergence Of Self-Preservation: Geoffrey Hinton Your Startup Doesn't Have a Hardware Problem. It Has an Accountability Problem Cyber Incidents Rarely Start With a Hacker: The Weak Links Businesses Overlook What Makes an App Worth Returning to Every Day? 21 Best AI Tools For Lead Generation (With Examples) [2026] How NBA Player Shaquille O'Neal Became An Early Investor In Ring AI For Kids Learning: 22 Best Options (With Examples) [2026] These Are The Most Popular AI Model Companies On OpenRouter [June 2026] Advanced Fintech and NeoBank Software Development Solutions: Building the Digital Banks of Tomorrow TRON Payments: Integrating AML Checks Into Business Workflows 18 Best AI Tools For Resume (With Examples) [2026] 16 Best AI Tools For UI Design (With Examples) [2026] These Are Top 10 Countries Generating The Most Internet Traffic How to Choose the Best Magento Agency for Your Store These Are The Best AI Models For Creative Writing [June 2026] AI For Managers: 28 Best Tools (With Examples) [2026] 17 AI Tools For Trading (With Examples) [2026] AI Has Led To An Explosion Of New Apps, But Nearly None Have Managed To Garner Significant Usage Cloudflare CEO Matthew Prince Says Vinod Khosla Asked Him To Fire His Co-founders For Him To Invest In His Company Australia’s AirTrunk To Invest $30 Billion To Develop Datacenters In India Anthropic Says That Their Employees Are Using AI To Write 8x More Code Compared To 18 Months Ago Anthropic Is Extremely Expensive, Many Are Urgently Looking For Alternatives: Microsoft AI CEO Mustafa Suleyman Sergey Tokarev on creating DIY “Beehives” and a free guidebook AI Crypto Price Prediction: How Accurate Are Machine Learning Models? Why Anthropic Could Find It Hard To Maintain Its $965 Billion Valuation Startup CEO Says They're Saving "Millions Of Dollars" By Replacing Anthropic Models With DeepSeek Ola Cabs' Valuation Falls 99% From Peak, Now Valued At Just $70 Million By Vanguard After TCS Case, Former Wipro Employee Alleges Attempt At Religious Conversion By Coworkers Bot Traffic Has Surpassed Human Traffic On The Internet For The First Time In History, Clouflare Says ChatGPT's Free Users Do 7 Queries Per Day, Those On $20 Plan Do 3x More: CFO Sarah Friar How Keith Rabois Had Been "Highly Skeptical" In 2023 That Anthropic Would Be Worth More Than $5 Billion In 10 Years How to Install AdGuard Home with Docker Step by Step We're Running Out Of Training Data, But Not Too Worried Because There Are Alternate Approaches: Google's Jeff Dean JioHotstar Is Hiring For 75 AI Roles Amid AI Content Push NVIDIA's Nemotron 3 Becomes Most Intelligent Open Weights Model From The US Hackers Allegedly Fooled Meta's AI To Take Over Accounts By Simply Asking It To Change User Emails Manchester Super Giants' AI Promotional Video Gets Panned As "Slop" For Glaring Cricketing Errors AI Reducing Jobs Is "Complete Nonsense": NVIDIA CEO Jensen Huang MiniMax Releases MiniMax M3, Is Competitive With Frontier Models On Many Benchmarks IIT Delhi-Incubated BotLab Dynamics Lights Up Skies With Lord Shiva Themed Drone Show During IPL Final NVIDIA Introduces RTX Spark, A New Chip Optimized For AI Agents For Windows Laptops And PCs NVIDIA Introduces Vera, A New CPU Chip For AI Agents That Is 80% Faster Than x86 CPUs OpenAI's Codex Reaches 5 Million Users, Resets Rate Limits For Users Key Factors That Influence Personal Loan Approval in India AI Is Allowing Me To Experiment And Try Crazier Things: Mathematician Terrance Tao Efficiency Of Human Learning Is Still A Thousand Times Better Than LLM Learning, Need Algorithmic Advances To Improve It: Jeff Dean San Francisco Home's Zillow Listing Says It'll Accept OpenAI Or Anthropic Stock As Payment Open-Source Models Currently Lag Proprietary Models By Just 4 Months: Epoch AI Self-Improvement Possible In AI Models Within A Year, Say Google's Top AI Leaders Digital Minds: Preparing for a Moral Challenge Before It Arrives Nearly 30% Of US-Based Y-Combinator Founders Are Of Indian Origin: SF Chronicle Data "A New Era Of PC": NVIDIA, Microsoft Windows Tease New Collaboration At Least 146,000 AI Hallucinated Citations In Papers Published In 2025, Finds Paper AI Doesn't Undergo Experiences, Has No Moral Conscience: Pope Leo XIV Claude Opus 4.8 Tops Artificial Analysis Intelligence Index, Edges Out GPT 5.5 With Score Of 61.4 Anthropic Says Its Annual Revenue Run-rate Has Now Touched $47 Billion Anthropic Raises $65 Billion At $965 Billion Valuation, Is Now Worth More Than OpenAI Claude Opus 4.8 Is Better Than Opus 4.7 But Not As Good As Mythos Preview, Says Anthropic Claude Opus 4.8 Beats GPT 5.5 On GDPval-AA Benchmark For Real World Tasks Anthropic Releases Claude Opus 4.8, Beats Opus 4.7, GPT-5.5 On Many Benchmarks GTM for Tech Startups Explained How to Use an AI Picture Generator to Create Professional Images Anthropic Is Now Generating 35% More Revenue Than OpenAI: The Information SK Hynix, Micron Join $1 Trillion Club Following AI-Led Memory Shortages
These Are The Most Popular AI Models On OpenRouter [June 2026]
OfficeChai Team · 2026-06-17 · via OfficeChai

OpenRouter’s monthly leaderboard is one of the cleaner signals in AI — it tracks token consumption across thousands of developers and apps, which means it reflects actual usage, not just benchmark hype. The June 2026 rankings tell a clear story: Chinese open-source models have taken over the top of the chart, Anthropic’s Claude family holds firm in the middle tier, and the rest of the field is scrambling for scraps.

Here’s a breakdown of every model in the top 10.

1. DeepSeek V4 Flash — 10.9T tokens (+995%)

The runaway leader. DeepSeek V4 Flash is an MoE model built around DeepSeek Sparse Attention (DSA) and token-wise compression, making 1M-context inference practical at scale — and DeepSeek offers it as the default. The nearly 10x growth in token consumption reflects a model that hit production pipelines hard and stayed there.

It does come with a caveat: V4 Flash hallucinates 96% of the time when it doesn’t know an answer, preferring a confident wrong response over abstention. For production deployments where accuracy matters more than throughput, that’s a meaningful risk. But for developers optimizing for speed and volume, the price-to-performance ratio is hard to beat.


2. Hy3 Preview — 10.7T tokens (+>999%)

Tencent’s Hy3 Preview is the biggest surprise on the leaderboard. Released in late April 2026 — less than three months after Tencent rebuilt its pre-training infrastructure from scratch — it went from zero to near-parity with DeepSeek V4 Flash in a single month.

Hy3 is a 295B-parameter MoE model with only 21B active parameters per inference pass. It’s optimized for agentic workflows, long-context understanding, and instruction following. On BrowseComp, a benchmark for complex web research, it reached 67.1%, a dramatic jump from Hy2’s 28.7%. Its pricing — $0.063 per million input tokens — makes it one of the most accessible models on the market. The >999% growth figure tells you everything: this model essentially didn’t exist on OpenRouter last month.


3. Claude Opus 4.7 — 7.48T tokens (+197%)

Claude Opus 4.7 is Anthropic’s flagship publicly-available model, and it’s the highest-ranked closed-source model on the leaderboard. The 197% growth isn’t viral surprise — it’s steady production adoption.

Opus 4.7 leads GPT-5.4 and Gemini 3.1 Pro on most key agentic benchmarks, including SWE-bench Pro (64.3%) and SWE-bench Verified (87.6%). Anthropic’s data shows it tops the Artificial Analysis GDPval-AA benchmark for general agentic performance across 44 occupations. Claude Code now accounts for roughly 4% of all public GitHub commits — and Opus 4.7 is what’s powering most of those workflows. For enterprise developers who need reliability alongside raw capability, Opus 4.7 is the benchmark.


4. Claude Sonnet 4.6 — 7.45T tokens (+34%)

Claude Sonnet 4.6 is the workhorse of the Claude lineup — fast, cost-efficient, and capable enough for most production use cases that don’t require Opus-level reasoning. The more modest 34% growth reflects a model that’s already deeply embedded in workflows rather than one riding a launch spike. Sitting just 30 billion tokens behind Opus 4.7 suggests many teams are running both, routing simpler tasks to Sonnet and harder ones to Opus.


5. Owl Alpha — 5.03T tokens (+>999%)

OpenRouter’s own model, built specifically for the platform. The >999% growth suggests it’s functioning as a default fallback or routing layer for traffic that doesn’t specify a model. Without external benchmarks to evaluate it against, it’s difficult to assess on capability alone — but its position here says something about the value of platform-native distribution.


6. Gemini 3 Flash Preview — 4.6T tokens (+3%)

Google’s Gemini 3 Flash is designed for high-frequency, latency-sensitive workflows — the kind of production pipelines where you’re calling a model thousands of times a day and raw intelligence is less important than speed and cost. The near-flat 3% growth puts it in a different category from the models above it: this is a mature, stable choice for teams that have already built around it. Gemini 3 Pro dominates the intelligence benchmarks in Google’s lineup, but Flash is where the volume lives.


7. DeepSeek V4 Pro — 4.54T tokens (+739%)

The more powerful sibling to V4 Flash. DeepSeek V4 Pro runs 1.6 trillion total parameters with 49 billion active — more than double V3’s architecture — and scores 52 on the Artificial Analysis Intelligence Index, making it the #2 open-weights model behind Kimi K2.6. The 739% growth is high, but V4 Flash’s 995% suggests developers are preferring speed over peak capability for most workloads. V4 Pro carries the same hallucination caveat as Flash, at a rate of 94%.


8. DeepSeek V3.2 — 4.31T tokens (-14%)

The only model in the top 10 with declining usage. DeepSeek V3.2 is being cannibalized by its successors — both V4 Flash and V4 Pro offer better performance at competitive prices, and the 14% drop is a predictable consequence of the V4 launch. V3.2 remains a capable reasoning-first model built for agentic tasks, and it will likely retain a long tail of users who’ve built stable pipelines around it. But the trajectory is clear.


9. Kimi K2.6 — 3.72T tokens (+1%)

Moonshot AI’s Kimi K2.6 is the top-ranked open-weights model on the Artificial Analysis Intelligence Index at 54 — just three points behind the closed-source trio of Claude Opus 4.7, GPT-5.4, and Gemini 3.1 Pro, all tied at 57. It’s a 1-trillion-parameter MoE model with 32B active parameters and native support for image and video input.

The nearly flat growth (+1%) despite strong benchmark performance is interesting. Kimi K2.6 may be a developers’ model — widely respected, heavily evaluated, but not yet embedded in the kind of high-volume pipelines that drive token counts into the trillions. The Chinese open-source surge is real, but token leadership still correlates with price and infrastructure availability, not just benchmark rank.


10. Nemotron 3 Super (free) — 2.65T tokens (+3%)

Nvidia’s entry into the open-weights race. Nemotron 3 Super is a 120B-parameter hybrid Mamba-Transformer MoE model that scores 48 on the Artificial Analysis Intelligence Index — the highest of any US open-weights model, though still behind the Chinese-led frontier. Nvidia offers it free on OpenRouter, which explains its position here. The 3% growth is modest but steady. Its real advantage isn’t intelligence — it’s inference speed, serving over 300 tokens per second compared to 50–100 for comparable Chinese models. For latency-sensitive workloads, that matters.


The bigger picture

The June 2026 leaderboard makes the structural shift in AI hard to ignore. Chinese open-source models occupy six of the top ten spots, and the two at the very top grew by close to 1,000% in a single month. Chinese models have already displaced US open models as the developer community’s default choice — not because of benchmark games, but because they’re fast, cheap, and available.

Anthropic is the exception among Western labs, holding positions three and four through genuine production utility. The Claude family’s staying power comes from agentic performance and enterprise trust — qualities that take longer to build but are harder to displace.

The decline of DeepSeek V3.2 is also worth noting as a structural signal: in this market, even strong models become obsolete within a few months. The labs releasing new architectures fastest are winning the leaderboard — and right now, that race is being run largely out of China.