惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
Lohrmann on Cybersecurity
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
Recorded Future
Recorded Future
S
Schneier on Security
I
Intezer
Latest news
Latest news
N
News and Events Feed by Topic
Scott Helme
Scott Helme
T
Threat Research - Cisco Blogs
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
U
Unit 42
量子位
博客园 - 【当耐特】
S
Security @ Cisco Blogs
Google Online Security Blog
Google Online Security Blog
博客园 - 叶小钗
酷 壳 – CoolShell
酷 壳 – CoolShell
NISL@THU
NISL@THU
The Cloudflare Blog
李成银的技术随笔
T
ThreatConnect
L
LINUX DO - 最新话题
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
有赞技术团队
有赞技术团队
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Jina AI
Jina AI
T
Tor Project blog
The Hacker News
The Hacker News
人人都是产品经理
人人都是产品经理
小众软件
小众软件
S
Security Archives - TechRepublic
美团技术团队
博客园 - Franky
Security Latest
Security Latest
J
Java Code Geeks
P
Proofpoint News Feed
V
V2EX
The GitHub Blog
The GitHub Blog
WordPress大学
WordPress大学
Application and Cybersecurity Blog
Application and Cybersecurity Blog
H
Help Net Security
PCI Perspectives
PCI Perspectives
Cyberwarzone
Cyberwarzone
Hugging Face - Blog
Hugging Face - Blog
N
Netflix TechBlog - Medium
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
SecWiki News
SecWiki News
腾讯CDC
爱范儿
爱范儿
D
Docker

Swift for Visual Studio Code comes to Open VSX Registry | InfoWorld

As AI speeds coding, CVE Lite CLI keeps security deliberately AI-free The role of MCP in context engineering AI still needs humans The sovereign cloud illusion Google folds CodeMender into agent ecosystem amid push for AI-led AppSec Salesforce extends its headless push into enterprise data via Informatica Microsoft releases open-source tools to operationalize AI agent safety AI at scale: What engineering teams are confronting Angular Signal Forms: From event pipelines to signal-driven state Google to unify AI coding tools under Antigravity GitHub admits major source code leak after 3,800 internal repositories breached Google launches Gemini 3.5 Flash to push AI agents deeper into enterprise workflows 9 application security startups combating AI risks Why I trust Claude Code Learning to trust Claude Code First look: Mojo 1.0 mixes Python and Rust AntV data visualization tool the latest to be hit by ongoing npm supply chain attacks GitHub scales back bug bounties, reminds users security is their responsibility too Anthropic acquires Stainless to strengthen Claude’s developer tooling Context graphs and decision traces to the rescue An AI data center in your home? What can you do with quantum computing today? Contexts graphs, AI memory, and enterprise knowledge: Are decision traces enough? Informatica and Salesforce move data platforms into the decision layer The new AI lock-in AWS boosts CloudWatch Logs query limits by 10x to ease debugging for developers, SREs 21 LLMs tuned for special domains AWS adds Advanced Prompt Optimization tool to Bedrock Capacity markets could reshape cloud computing Four cutting-edge tools for spec-driven development 4 cutting-edge tools for spec-driven development Anthropic puts Claude agents on a meter across its subscriptions Notion courts developers with a platform for AI agents and workflow automation Using continuous purple teaming to protect fast-paced enterprise environments A better way to work with SQL Server Evidence-driven workflows: Rethinking enterprise process design AWS debuts Graviton-powered Redshift RG instances to cut analytics costs SAP’s AI promises last year? Most are still rolling out First look: Lemonade serves up local AI with limitations AI is ready to take over Python programming, but not much else Mistral AI SDK, TanStack Router hit in npm software supply chain attack GitLab CEO sees developer tool bill increasing 100-fold Red Hat adds support for agentic AI development What’s new and exciting in JDK 26 Kill the loading spinner with local-first data and reactive SQL A networking revolution at AWS Tokenmaxxing is super dumb Hands-on with React, Supabase, and PowerSync Malicious Hugging Face model masquerading as OpenAI release hits 244K downloads How to add AI to an existing product (without annoying users) Your AI doesn’t need another database What happens when engineering teams reorganize around AI agents Python isn’t always easy When cloud giants meddle in markets 12 model-level deep cuts to slash AI training costs 13 new critical holes in JavaScript sandbox allow execution of arbitrary code The best new features in Python 3.15 Teradata launches platform for enterprise AI agents moving beyond pilots Three skills that matter when AI handles the coding MongoDB targets AI’s retrieval problem Building AI apps and agents with Microsoft Foundry Designing front-end systems for cloud failure No, AI won’t destroy software development jobs Supply-chain attacks take aim at your AI coding agents Oracle will patch more often to counter AI cybersecurity threat AI finds 20-year-old bugs in PostgreSQL and MariaDB Diskless databases: What happens when storage isn’t the bottleneck Vibe coding or spec-driven development? The agentic AI distraction Vibe coding or spec-driven development? How to choose Cloud providers are blinded by agentic AI SAP to acquire data lakehouse vendor Dremio Small language models: Rethinking enterprise AI architecture Making AI work through eval hygiene Improving AI agents through better evaluations Spotlight report: Transforming software development with AI AI in the cloud is easy but expensive Running AI in the cloud is easy – and expensive Are we ready to give AI agents the keys to the cloud? Cloudflare thinks so SAP npm package attack highlights risks in developer tools and CI/CD pipelines Making AI work for databases Harness teams of agentic coders with Squad Harness teams of coding agents with Squad Critical GitHub RCE bug exposed millions of repositories Oracle NetSuite announces AI coding skills for SuiteCloud developers Why it’s so hard to create stand-alone Python apps A new challenge for software product managers The hidden cost of front-end complexity More fake extensions linked to GlassWorm found in Open VSX code marketplace GitHub shifts Copilot to usage-based billing, signaling a new cost model for enterprise AI tools Xiaomi releases MIT‑licensed MiMo models for long‑running AI agents OpenAI’s Symphony spec pushes coding agents from prompts to orchestration The front-end architecture trilemma: Reactivity vs. hypermedia vs. local-first apps Enterprise AI is missing the business core The best JavaScript certifications for getting hired Google begins putting the guardrails on agentic AI Meta's compute grab continues with agreement to deploy tens of millions of AWS Graviton cores Germany's sovereign AI hope changes hands Former OpenAI research scientist launches new AI model for Tencent Why world models are AI’s next frontier
DeepSeek’s steep V4-Pro price cut escalates AI pricing war
2026-05-25 · via Swift for Visual Studio Code comes to Open VSX Registry | InfoWorld

Chinese AI startup DeepSeek has announced a steep price cut for its recently launched flagship AI model, V4-Pro. The company has reduced pricing for the model by 75%, just a month after unveiling the V4 generation, which includes V4 Pro and V4 Flash.

Earlier, usage costs ranged from $0.0145 for one million tokens (cache hit) to $3.48 for one million output tokens. Following the revision, the V4 Pro will now cost starting at $0.003625 per million tokens and going up to $0.87 per million tokens, respectively. The Deepseek V4 Pro model API pricing will be officially adjusted to 1/4 of the original price after the 75% discount promotion ends on 2026/05/31 15:59 UTC, said the company.

“V4-Pro was engineered to cut the cost of long-context inference, reportedly running at roughly a quarter of the single-token compute and a tenth of the memory footprint of its predecessor at very long context. This is why the price cut is permanent rather than promotional. It is not a discount. It is an efficiency gain being passed through,” said Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research.

DeepSeek narrows gap with Western AI rivals

Almost a year after introducing its R1 reasoning model offering performance and cost efficiency, DeepSeek released the preview of V4 LLM. Similar to the earlier models, even V4 is open source, which allows developers to download the code to run it locally and even modify it. The new models were optimized for use with popular agent tools such as Anthropic’s Claude Code and OpenClaw.

“From a pure capabilities perspective, DeepSeek V4-Pro has effectively closed the performance gap on critical tasks like complex math and reasoning, while aggressively leading the market on openness and inference costs. Its specialized reasoning modes and architectural enhancements make it a formidable alternative to Western frontier models,” said Neil Shah, vice president at Counterpoint Research. However, its primary limitations aren’t found in its raw intelligence; rather, it lags behind Western rivals on broader ecosystem adoption, global support structures, clear IP provenance, and the deep and secure hyperscaler integrations natively offered by AWS, Microsoft, and Google, he added.

Lower costs, better ROI

As inference costs remain one of the biggest barriers to scaling pilots into organization-wide deployments, DeepSeek’s aggressive discounts could translate into substantial savings for enterprises, say experts.

The first wave of enterprise AI was full of impressive demonstrations and uncomfortable invoices. CIOs learnt quickly that the cost of AI was never just the model call but included retrieval, orchestration, and more, added Gogia.

However, the 75% cut is meaningful only if CIOs can actually access it at scale.

“For most enterprises, the relevant comparison is not DeepSeek’s direct API but the cost of running a local deployment versus using any external inference provider. If a CIO can host DeepSeek V4-Pro on their own infrastructure, inference costs drop dramatically, and many projects that were previously uneconomical at scale become viable. That includes always-on copilots, bulk document review, code generation, L1 support, and multi-agent workflows,” explained Amit Jaju, senior managing director at Ankura Consulting. He added that if the model is consumed through third-party providers, the effective rate may be higher and the ROI benefit smaller.

AI pricing pressure to intensify

DeepSeek’s discounted pricing strategy is likely to intensify pressure on major AI vendors whose models often command premium enterprise pricing. This could lead vendors such as OpenAI, Anthropic, and Google to respond with better packages.

Shah noted high-margin, high-consumption token pricing models from Anthropic and OpenAI are becoming harder to justify for many enterprise workloads and workflows. The presence of a viable open-weights alternative gives enterprise buyers decent leverage. This will likely prompt these premium flagship western AI labs to gradually shift from basic consumption-based pricing toward more defensible, outcome-oriented or value-based monetization models.

Consequently, CIOs will also adopt a multi-model AI strategy, similar to migration to multi-cloud architectures. “This will result in an AI portfolio architecture where premium models will be for high-stakes work, domain models for specialist tasks, smaller models for repeatable execution, and an orchestration layer to route, log, govern, and monitor the whole estate,” added Gogia.

CIOs must proceed cautiously

Despite the cost advantages DeepSeek offers, CIOs should remain cautious when evaluating Chinese-origin AI models and carefully assess risks around sensitive data exposure, regulatory compliance, and geopolitical dependency.

Jaju added that the primary risk is data sovereignty and cross-border exposure. If CIOs rely on external APIs hosted in China, prompts, documents, embeddings, logs, and telemetry can leave the enterprise perimeter and traverse jurisdictions with different legal regimes.

Another big risk is IP leakage as developers may paste source code, product designs, legal drafts, M&A material, or incident data into model workflows. If the model is external, that data can be stored, used for training, or exposed through logs or plugins, he added.

Jaju highlighted that the third risk is regulatory defensibility. CIOs need clarity on where data is processed, what is retained, who can access it, what contractual protections exist, whether the model can be self-hosted, and how outputs can be audited.

Experts warn that the safest way will be to host DeepSeek locally or in a sovereign cloud under enterprise control, with encryption, access controls, and audit trails.