惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
J
Java Code Geeks
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
H
Hackread – Cybersecurity News, Data Breaches, AI and More
V
Visual Studio Blog
G
Google Developers Blog
V
V2EX
The Register - Security
The Register - Security
博客园 - 三生石上(FineUI控件)
云风的 BLOG
云风的 BLOG
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
博客园_首页
S
SegmentFault 最新的问题
博客园 - Franky
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog
A
About on SuperTechFans
人人都是产品经理
人人都是产品经理
aimingoo的专栏
aimingoo的专栏
罗磊的独立博客
C
Check Point Blog
MyScale Blog
MyScale Blog
T
The Blog of Author Tim Ferriss
MongoDB | Blog
MongoDB | Blog
The GitHub Blog
The GitHub Blog
Last Week in AI
Last Week in AI
Microsoft Azure Blog
Microsoft Azure Blog
IT之家
IT之家
F
Fortinet All Blogs
Jina AI
Jina AI
P
Proofpoint News Feed
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
阮一峰的网络日志
阮一峰的网络日志
B
Blog
L
LangChain Blog
月光博客
月光博客
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
宝玉的分享
宝玉的分享
博客园 - 【当耐特】
T
Tailwind CSS Blog
酷 壳 – CoolShell
酷 壳 – CoolShell
Microsoft Security Blog
Microsoft Security Blog
WordPress大学
WordPress大学
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
B
Blog RSS Feed
博客园 - 聂微东
Hugging Face - Blog
Hugging Face - Blog
M
MIT News - Artificial intelligence
GbyAI
GbyAI

The Decoder

The AI industry's platform trap is starting to look a lot like Microsoft's OpenAI buys Ona to push Codex toward long-running, autonomous coding tasks Jeff Bezos' AI startup Prometheus closes $12 billion round at a $41 billion valuation Free Deezer tool lets users on any streaming service check their playlists for AI music OpenAI vs. Anthropic: A price war over API tokens is brewing Dario Amodei's new essay reads like a Cold War playbook for the AI age Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers Google's new open model DiffusionGemma generates text from noise instead of word by word OpenAI's IPO slips as Altman tells staff to expect a public offering "within the next year" Anthropic study shows AI needs hours, not weeks, to build exploits from security patches OpenAI wants its biggest data center yet, and Nvidia would back the bill Claude Fable 5: The first Mythos model is powerful, expensive, and heavily filtered Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI Google's NotebookLM now runs its own cloud computer with code execution and agent-based research Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science Google's Gemini 3.5 Live Translate delivers real-time voice translation across 70+ languages SpaceX wants to put data centers in orbit, and Musk says it's no big deal Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers Beijing's $295 billion AI buildout would require 80 percent domestic chips, locking out US suppliers Apple Intelligence gets a second shot with help from Google and Nvidia OpenAI now says "entirely automating everything is not the future we want" OpenAI says going public is "a complicated set of tradeoffs" and is unsure about the timing Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators Intel gets a second life as Google and Nvidia explore it as a TSMC backup for AI chips Most companies are flying blind on AI spending Frontier Radar #3: How agentic AI is turning tokens into a business metric Instagram AI chatbot breach may have affected over to 20,000 accounts, Meta discloses Microsoft tightens rules for conflict zones after investigation into Israel's military use of Azure Moonshot AI targets a $30 billion valuation, more than six times its late-2025 worth Deepseek topped Ramp's trending software vendors in June 2026 as US companies chase cheaper AI OpenAI says "chat is dead" and plans to rebuild ChatGPT as a full-blown agent app Perplexity's "Search as Code" lets AI models write their own search pipelines instead of calling fixed APIs ChatGPT's new Lockdown Mode lets you disable web access and more to protect sensitive data from prompt injection Anthropic poaches OpenAI's second-ever chip engineer as both companies race toward IPOs Researchers pinpoint why larger language models pick up skills that small ones miss Sakana AI bets AI that improves itself can break the compute arms race of frontier labs Meta's Hatch AI agent could cost up to $200 a month and marks its first paid AI product Elon Musk's xAI reportedly trained its coding models on Claude outputs for months before getting cut off New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent SpaceX signs $920 million per month deal with Google for 110,000 Nvidia AI chips ahead of IPO OpenAI and the Trump administration are negotiating a government stake in the AI startup Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent Florida's lawsuit against OpenAI and CEO Altman treats ChatGPT as a defective product and public nuisance Satya Nadella publicly torches a VP's plan to make Microsoft's AI agent deliberately addictive Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data" Anthropic's Mythos model is reportedly powering NSA offensive cyber ops against China and Iran Anthropic says Claude now writes over 90% of its code and wants the world to have an AI pause button Cloudflare CEO says the web's future is "pay to crawl" as bots overtake human traffic ChatGPT now saves narrative dossiers about you sorted by work, hobbies, and travel preferences Bain study finds companies miss AI savings targets because humans keep getting in the way OpenAI CEO Sam Altman sees "proactive AI" as the next big phase after chatbots and agents AI can now coach amateur virologists, and top tech leaders want Congress to act on DNA security xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution Google Deepmind's Gemma 4 12B squeezes multimodal AI onto a laptop with just 16 GB of RAM Google lets sites opt out of AI search results, knowing most have nowhere else to go Ideogram 4.0 drops as an open-weight model with native 2K resolution and improved text rendering Trump's new executive order wants AI companies to voluntarily submit models for government safety reviews Perplexity announces hybrid AI system that decides what runs locally or in the cloud AI music startup Suno doubles its valuation to $5.4 billion while fighting major record labels in court Nous Research releases Hermes Desktop, an open-source AI agent for every platform Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning OpenAI expands Codex with role-specific plugins to build a general-purpose app for non-developers Anthropic scales Project Glasswing to 150 partners across 15 countries to hunt critical software flaws Hackers hijacked high-profile Instagram accounts by simply asking Meta's AI chatbot to change the email OpenAI turns ChatGPT into a career platform with job search and CV editor Warren Buffett's Berkshire Hathaway bets $10 billion on Alphabet's AI infrastructure buildout OpenAI models now available on Amazon Web Services Claude maker Anthropic files for IPO with the SEC Turing Award winner Richard Sutton says pure generative AI can't do real science MiniMax M3: Open-weight model with a million-token context challenges proprietary leaders Nvidia's Nemotron 3 Ultra becomes the smartest open US model, but China still leads Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot Nvidia pitches RTX Spark as the chip that finally makes local AI agents practical on Windows devices OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need" Ask AI what goes with chicken and the answer depends on whether it learned from recipes or molecules Anthropic bans AI tools during job interviews to see how candidates actually think Anthropic study finds men use AI coding agents more than twice as often as women in social science research SoftBank plans 75 billion euro AI data center buildout in France AI search agents often confirm what they already know instead of actually researching the web Microsoft and Nvidia reportedly team up on AI PCs that run actual agents instead of Copilot Making AI chatbots helpful weakens their ability to simulate human behavior, large-scale study finds Terence Tao argues AI could bring division of labor to math for the first time in history Attackers abuse shared ChatGPT and Claude chats to spread malware OpenAI's Codex can now operate your Windows PC autonomously, hunting bugs and testing apps on its own Salesforce claims AI agents cut a 231-day migration to 13 days with fewer incidents Meta's leaked memo reveals AI pendant, supersensing glasses, and enterprise wearables strategy OpenAI gives GPT-5.5 Instant a readability upgrade while phasing out two older models Google fixes several bugs in Gemini usage limits that burned through quotas too fast One company reportedly spent $500 million on Claude in one month after failing to cap AI usage OpenAI is giving away its life sciences AI model to help governments prepare for the next pandemic New review paper argues code is how AI agents think and act, not just what they produce Amazon kills internal AI leaderboard after employees gamed it with pointless tasks Claude company Anthropic nears a trillion-dollar valuation after raising $65 billion in Series H Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks Google Cloud responds to AI-accelerated cyberattacks with a platform that aims to close security gaps in minutes Google launches a tiny board that runs Gemma 3 locally Mistral rebrands LeChat as Vibe, betting its chatbot's future is as a full-blown work agent Meta One: Zuckerberg finally puts a price tag on all that AI spending Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video ElevenLabs Music v2 promises opera-to-metal transitions without losing musical coherence
OpenAI's GPT-5.6 Sol launches to rival Claude Mythos under government access rules it calls unsustainable
Matthias Bastian · 2026-06-27 · via The Decoder

OpenAI's new flagship GPT-5.6 Sol claims a lead over Anthropic's Claude Mythos in agentic coding and goes toe to toe with it in cybersecurity. Access stays limited to a handful of partners for now.

OpenAI has unveiled GPT-5.6 Sol, a new generation of models built to compete with Claude's Mythos class. The limited preview is only open to select partners through the API and Codex, at the explicit direction of the US government. The same government previously yanked Anthropic's Mythos-class model Fable 5 off the market.

OpenAI isn't subtle about its frustration. "We don’t believe this kind of government access process should become the long-term default. It keeps the best tools from users, developers, enterprises, cyber defenders, and global partners who need them."

GPT-5.6 also brings a new layered naming scheme that looks a lot like Claude's. The number (x.6) marks the generation, while Sol, Terra, and Luna are permanent performance tiers that can evolve on their own. Sol is the flagship. Terra matches GPT-5.5 at half the cost. Luna is the budget option. On top of that, there's a "max" mode for deeper reasoning and an "ultra" mode that farms out complex tasks to sub-agents running in parallel.

Sol edges past Claude Mythos in agentic coding

OpenAI's benchmark numbers put Sol ahead of Anthropic's Claude Mythos 5 in agentic coding. On Terminal-Bench 2.1, Sol scores 88.8 percent. Sol Ultra hits 91.9, Claude Mythos 5 lands at 88 percent, and Fable 5 trails at 84.3.

GPT-5.6 Sol Ultra tops the Terminal-Bench 2.1 coding benchmark at 91.9 percent. Claude Mythos 5 scores 88.0 percent. Google's Gemini 3.1 Pro Preview brings up the rear at 70.7 percent. | Image: OpenAI

Sol also shows gains in biology. On GeneBench v1, a benchmark for genomics and quantitative biology, it beats GPT-5.5 (30 percent vs. 22 percent best case) while burning fewer tokens.

On ExploitBench, which tests how well AI agents can find and exploit real security flaws in Google's V8 JavaScript engine all the way to full code execution, Sol matches Mythos Preview's performance while using roughly a third of the output tokens, OpenAI says.

On ExploitBench, GPT-5.6 Sol matches Anthropic's Mythos Preview at around 150,000 output tokens. Mythos 5 still leads at around 80 percent, but without comparable efficiency data. | Image: OpenAI

On ExploitGym, a benchmark built by UC Berkeley researchers with OpenAI and other labs, all three GPT-5.6 models get better as reasoning effort goes up. That points to room for scaling with more compute. Claude numbers for this benchmark aren't available yet.

OpenAI calls Sol its most capable cybersecurity model yet but frames it as a defender, not an attacker. The model is better at spotting and fixing flaws than at running full end-to-end attacks on its own, the company says. Mythos pulled that off in a different benchmark.

In tests with Chromium and Firefox, Sol found bugs and exploitation primitives but never produced an autonomous full-chain exploit. OpenAI says GPT-5.6 Sol is still below the "Cyber Critical" threshold in its Preparedness Framework.

Pricing, availability, and a Cerebras launch in July

Per million tokens, OpenAI charges $5 input and $30 output for Sol, $2.50 and $15 for Terra, and $1 and $6 for Luna. The company has also revamped its prompt caching system with explicit cache breakpoints and a guaranteed minimum lifetime of 30 minutes. Cache writes cost 1.25x the regular input price. Cache reads still get a 90 percent discount.

Since Sol uses fewer tokens to match or beat competitors across several benchmarks, the effective cost per task could end up lower than previous generations. That would push back against the trend of AI models getting pricier with each release, a frequent criticism lately, and a competitive weak spot against cheaper Chinese models.

In July, Sol is set to go live on Cerebras at up to 750 tokens per second.

AI News Without the Hype – Curated by Humans

Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.

Subscribe now