惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Recorded Future
Recorded Future
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
T
Troy Hunt's Blog
S
Security Archives - TechRepublic
S
Security @ Cisco Blogs
AI
AI
Schneier on Security
Schneier on Security
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
C
CERT Recently Published Vulnerability Notes
Spread Privacy
Spread Privacy
Help Net Security
Help Net Security
L
Lohrmann on Cybersecurity
The Hacker News
The Hacker News
Google DeepMind News
Google DeepMind News
www.infosecurity-magazine.com
www.infosecurity-magazine.com
Security Latest
Security Latest
T
Tor Project blog
P
Privacy International News Feed
The Last Watchdog
The Last Watchdog
L
LINUX DO - 最新话题
D
DataBreaches.Net
W
WeLiveSecurity
H
Help Net Security
L
LangChain Blog
B
Blog RSS Feed
Scott Helme
Scott Helme
Hacker News: Ask HN
Hacker News: Ask HN
C
Cisco Blogs
Cloudbric
Cloudbric
Application and Cybersecurity Blog
Application and Cybersecurity Blog
O
OpenAI News
I
InfoQ
GbyAI
GbyAI
Project Zero
Project Zero
Blog — PlanetScale
Blog — PlanetScale
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
WordPress大学
WordPress大学
Stack Overflow Blog
Stack Overflow Blog
G
GRAHAM CLULEY
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
Jina AI
Jina AI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园 - 聂微东
美团技术团队
PCI Perspectives
PCI Perspectives
Y
Y Combinator Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC

The New Stack | DevOps, Open Source, and Cloud Native News

Agentic development hinges on verification. For cloud-native software, that is a runtime problem. AI agents need infrastructure: Why Europe’s regional cloud strategy matters Transform your AI coding agent into a deterministic Java Spring expert WeAreDevelopers is coming to the US to give unsung developers a bigger voice Cleaner AI training data, fewer bugs: Sonar’s SonarSweep explained Observability overload is drowning engineers Google’s DiffusionGemma is 4x faster than its other Gemma models Fable 5: Guardrails and burn rate are annoying users, who say it’s still better than Opus 4.8 The Anthropic leader who built Claude Code says he ditched prompting — now he just writes loops. AWS can now mathematically prove your VMs are isolated Microsoft pulled 73 GitHub repos after malware attack — but still won’t say who’s compromised Databricks wants to kill the “email me a file” problem for AI agent skills Ramp bets forward deployed engineers can do what off-the-shelf finance AI can’t Git real: AI agents aren’t just for solo developers anymore Anthropic launches Claude Mythos/Fable 5, but you better try it soon This AI agent startup ditched Anthropic for DeepSeek — and says it’s saving millions When your data model is the bottleneck: lessons from Medium’s feature store How long before we stop reading the code? The tokenmaxxing party is over, and Revenium is mopping up How AI is solving the memory crunch it created Microsoft’s pitch to enterprises: Ditch Azure Repos for GitHub, despite its rocky reliability record Claude Code’s biggest upgrade yet ran 5 agents at once — here’s what happened Why Anthropic just doubled Claude Cowork limits at no charge For years, Apache Cassandra handed this work to your team — 6.0 takes it back “A dangerous combination”: The 2 factors that can “corrupt” AI agent workflows With Foundry, Microsoft bets the enterprise AI battle is about reliability, not capability Microsoft unlocks Visual Studio for developers left behind by its own AI AI teams now deploy 1,000 times a month. Your pipeline wasn’t built for that. Microsoft just made the agent runtime free — and kept everything around it “Whoever builds the most joyous product wins”: The agent war begins Netlify CTO Dana Lawson: Writing code is no longer the job From Jupyter Notebook to production: How to ship AI systems that actually work OpenClaw used Gavriel Cohen’s code and exposed the AI Agent accountability problem Replit shows how vibe coding is getting its own financial stack — and a path to profit Cloudflare aqui-hires VoidZero: Did a piece of the open web just stabilize, or become more brittle? Cursor cuts prices and adds enterprise spend controls amid “tokenomics” reckoning Google Gemma 4 12B nearly matches 26B benchmarks — and runs on your laptop Snowflake thinks it knows what’s really slowing developers down Autonomous agents have met their biggest challenge yet: The database. Why agentic AI makes the ops platform the most important layer in the enterprise How to dramatically improve enterprise security alert tuning to battle cyberattacks Why the need for humans won’t disappear in the age of autonomous databases How to secure Kubernetes in the age of AI workloads Asana says its new AI “chief of staff” turns your Slack chaos into trackable work Nvidia’s best model is now live Mate Security’s Asaf Wiener made every backend engineer a model router. He’s right to. The AI cost crisis finally has a watchdog — just not the companies causing it How to get operational data off the factory floor without creating an IT breach Why CPUs still matter in the age of AI agents Rayfin: Microsoft’s answer to the gap between vibe coding and enterprise production Microsoft bets the enterprise AI race will be won on data context, not model power “A successful attack could be catastrophic”: Anthropic gives more groups access to Claude Mythos How GitHub plans to win developers back Microsoft really, really, really wants developers to love Windows again With Intelligent Terminal, Microsoft is reinventing the Windows terminal Microsoft debuts “Scout” at Build, a new personal agent for work OpenAI’s Codex adds new tools — Sites, Annotations, more plugins — for knowledge workers GitHub Copilot’s usage-based billing is live: Here’s what you need to know OpenAI, Anthropic, Google, Amazon, and xAI all fail on type of attack, study finds JetBrains open-sources Mellum2 to go where Claude Code can’t Claude Code vs. Cursor vs. Codex vs. Antigravity — six months in This coding agent doesn’t want your feedback — it ships without it “Blowing things up”: The one move vendors got wrong on AI agents At Sapphire, SAP makes the case that enterprise AI is a context problem Gavriel Cohen found his own code inside OpenClaw, so he walked away AI retrieval at scale is becoming a systems problem, not a tooling problem The DIY platform trap that’s burning out engineering teams I tested Cursor’s new Jira integration and it’s 5 stars, no notes. Here’s why. Why GPT-5.4, Claude, and Gemini can’t agree on basic, real-world facts Replit’s vibe coding platform just got a Visa-backed identity layer for AI agents — and it changes how agents spend money Opus 4.8 Made Claude Smarter. Token Discipline Got Urgent. Why Linux creator Linus Torvalds gets angry hearing “99% of code is AI” Vendor neutrality isn’t magic: A hard look at the OpenTelemetry ecosystem “The AI did it” won’t save you when EU regulators come knocking The fix for soaring AI cloud bills exists — so why won’t we trust it? AI is shipping code faster than security was built to handle Why AWS scrapped OpenSearch’s architecture to chase agent workloads Claude Opus 4.8 is here: effort controls, dynamic workflows, cheaper fast mode, better honesty, less deception Percona celebrates 20th birthday with new foundation — and a goat cake Why OpenAI and Anthropic are hiring forward deployed engineer teams Claw-style AI agents are coming to the enterprise. The governance infrastructure is still catching up. The agentic identity crisis: Why your security isn’t ready for the AI revolution Debugging the undebuggable: building observability into probabilistic AI systems Snowflake commits $6B to AWS as it pushes deeper into AI Why MotherDuck refuses to fork DuckDB Researcher “gave Claude Code ‘ADHD’… and it thinks 2x better now.” Outside experts want more proof. “There is no accountability”: AI coding agents are installing packages no one owns “Tokenmaxxing is real, expensive & it’s spreading”: AI budgets are exploding With Google’s debut, the most important AI agent feature is now the most boring one Why AI agents need a Context Lake Google ranks the best AI for building Android apps, and the winner isn’t Gemini Google pushes Pro, Ultra, and free users from open-source Gemini CLI to closed-source Antigravity CLI The reason enterprise outages almost never start where ops teams think Taming the agentic influx: a blueprint for AI business observability How the AC/DC framework helps teams govern AI coding agents GitLab 19.0 trades its string section for a full DevSecOps orchestra Who’s monitoring the agents? How Jaeger hit 8.6× compression on 10 million spans with ClickHouse What ClickHouse learned from a year of coding with AI agents OpenClaw passed 300,000 GitHub stars. Then Google launched Spark.
Cohere sold sovereign AI to enterprises, now it's targeting developers with its first coding model
Paul Sawers · 2026-06-15 · via The New Stack | DevOps, Open Source, and Cloud Native News

Canadian foundation model company Cohere has spent the past few years selling a specific idea to banks, governments, and healthcare providers: that AI should run on their infrastructure, under their control, with their data never leaving the perimeter.

Cohere’s pitch went down well in regulated industries. Now the company is taking it to a different audience, with the launch of North Mini Code — its first coding model, released under an Apache 2.0 license from the get-go.

Model access as infrastructure

The sovereignty argument Cohere has long made to enterprise customers is, at its root, about ownership. Regulated industries have hard requirements: data can’t leave certain boundaries, and the intelligence layer running on sensitive infrastructure needs to be something the organization controls. That requirement shaped how Cohere built its products — deployable anywhere, runnable on private infrastructure.

What’s changed, according to Cohere co-founder Nick Frosst, is who is asking those same questions.

“We’re now hearing similar concerns from developers,” Frosst tells The New Stack. “They’re starting to think of model access as infrastructure, and infrastructure should be something you own and control. That is an extension of sovereignty.”

“[Developers] are starting to think of model access as infrastructure, and infrastructure should be something you own and control.”

North Mini Code is a direct response to that demand. It’s a 30-billion-parameter Mixture of Experts (MoE) model with just 3 billion active parameters and is designed for agentic coding tasks: the kind of multi-step, tool-using work that coding agents like Claude Code and Cursor are built around.

Cohere says it runs on a single Nvidia H100 GPU, making self-hosting practical without a larger multi-GPU deployment. Developers who would rather not manage their own infrastructure can access it via API instead.

“We want to give developers a capable, fast, open-weight model they can run locally on their own terms, and that fits in their compute environments,” Frosst says.

“We want to give developers a capable, fast, open-weight model they can run locally on their own terms, and that fits in their compute environments.”

Cohere claims it outperforms comparable open-weight models including Alibaba’s Qwen3 and Google’s Gemma 4 on the Artificial Analysis Coding Index, where it scores 33.4, and says it delivers up to 2.8x higher output throughput than Mistral’s Devstral Small 2 on identical hardware.

Cohere’s own benchmark testing shows North Mini Code leading on terminal and code generation tasks — but results are mixed across the full evaluation suite, with Qwen 3.6 ahead on SWE-Bench Verified and LiveCodeBench v6, as its chart illustrates. Those comparisons are based on Cohere’s own testing and should be taken as indicative.

North Mini Code’s performance in agentic software engineering and terminal tasks, along with complex code generation benchmarks, compared to leading open-source models of a similar size.
North Mini Code’s performance in agentic software engineering and terminal tasks, along with complex code generation benchmarks, compared to leading open-source models of a similar size. (Credit: Cohere)

A growing club

Cohere’s timing puts it alongside a growing group of international companies that have made open-weight coding models a deliberate product choice. Mistral, the Paris-based AI company, launched Devstral in May 2025 — its first dedicated agentic coding model, also under Apache 2.0 — and followed it with Devstral 2 in December. JetBrains, the Czech developer tools company, recently open-sourced Mellum2, its second-generation coding model.

The emphasis differs. Mistral has explicitly linked open weights to AI sovereignty and the ability to deploy models on private infrastructure, while JetBrains focuses on latency, cost and deployment flexibility. In practice, both approaches give developers and enterprises more control over where models run and how they are operated.

Owning the infrastructure

The appetite for open-weight alternatives to frontier models is clearly there. AI agent platform Lindy recently announced it had moved 100% of its inference traffic from Anthropic to China’s DeepSeek, saying the switch would save the company millions while actually improving performance on its core use cases. Lindy’s CEO Flo Crivello addressed the obvious question about routing through a Chinese-developed model: the company uses Atlas Cloud, a US-based inference provider that hosts DeepSeek on American soil. The open-weight nature of DeepSeek made that possible — the model can be hosted by any provider, in any jurisdiction.

That’s precisely the dynamic Frosst is pointing to. Open weights give developers optionality that a proprietary API does not: the ability to choose where the model runs, who operates it, and under what terms. For companies whose inference bill has grown to exceed payroll — as Crivello noted is the case at Lindy — those are decisions with real commercial consequences.

Cohere’s Command family — its flagship line of enterprise models built for agentic, multilingual, and multimodal tasks — had previously shipped as open-weight models under more restrictive licenses. With Command A+, the company moved to Apache 2.0 in May, making the legal terms around use and redistribution significantly more permissive.

“Open-source development was concentrated in a small number of jurisdictions, and organizations running critical infrastructure had no reliable alternative.”

Frosst draws a direct line between the enterprise sovereignty argument Cohere has made for years and the thinking behind North Mini Code. The open-source coding model, he says, is a response to the same concentration problem Cohere saw in enterprise AI — only now playing out at the developer layer.

“Open-source development was concentrated in a small number of jurisdictions, and organizations running critical infrastructure had no reliable alternative,” Frosst says. “North Mini Code extends that thinking to the developer layer. As coding agents become the infrastructure software engineering runs on, whoever controls those systems controls how they work, how they evolve, and what they’re optimized for. We think that developers and enterprises should be in control.”

YOUTUBE.COM/THENEWSTACK

Tech moves fast, don't miss an episode. Subscribe to our YouTube channel to stream all our podcasts, interviews, demos, and more.

Created with Sketch.