惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
博客园_首页
H
Hackread – Cybersecurity News, Data Breaches, AI and More
T
ThreatConnect
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
博客园 - 聂微东
H
Help Net Security
T
Threat Research - Cisco Blogs
Blog — PlanetScale
Blog — PlanetScale
A
Arctic Wolf
G
Google Developers Blog
量子位
U
Unit 42
I
InfoQ
V
V2EX
F
Fox-IT International blog
P
Privacy & Cybersecurity Law Blog
V
Visual Studio Blog
J
Java Code Geeks
大猫的无限游戏
大猫的无限游戏
C
CERT Recently Published Vulnerability Notes
博客园 - 三生石上(FineUI控件)
T
The Exploit Database - CXSecurity.com
T
Tailwind CSS Blog
SecWiki News
SecWiki News
Know Your Adversary
Know Your Adversary
MyScale Blog
MyScale Blog
宝玉的分享
宝玉的分享
The Hacker News
The Hacker News
Project Zero
Project Zero
Application and Cybersecurity Blog
Application and Cybersecurity Blog
月光博客
月光博客
Recent Commits to openclaw:main
Recent Commits to openclaw:main
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
G
GRAHAM CLULEY
C
Cisco Blogs
I
Intezer
Simon Willison's Weblog
Simon Willison's Weblog
O
OpenAI News
Recorded Future
Recorded Future
T
Tenable Blog
W
WeLiveSecurity
腾讯CDC
Stack Overflow Blog
Stack Overflow Blog
T
The Blog of Author Tim Ferriss
www.infosecurity-magazine.com
www.infosecurity-magazine.com
D
Docker
C
Cybersecurity and Infrastructure Security Agency CISA
PCI Perspectives
PCI Perspectives

... eeNews Europe

Toshiba SiC MOSFET samples support 800V AI data centres NASA picks Astrolab for Artemis lunar rover mission Marvell posts record Q1 revenue on AI data centre demand AMD unveils compact Versal Prime Gen 2 devices with high scalar compute Dual-band GNSS antenna from Taoglas for L1/L5 Infineon trims PCB space for CoolGaN power switching TDK launches stray-field immune Hall sensor for EV motors SEMI and NSF expand US microelectronics talent initiative EU invests €400m in electrified industrial heat systems System Check: Which bench instruments do you use? The Disappearing Battery? How Ultra-Thin Supercapacitors Are Reshaping Wireless IoT And Sensors Graphical design of signal switching and cabling in test systems eases signal path development, cuts costs and boosts accuracy Schneider Electric urges faster EU electrification drive ST CEO conference webcast set for June 2 TDK expands micro POL lineup for AI edge and optical modules AI boosts the cable market Huawei 1.4 nm target leans on Tau Scaling Law Infineon launches Moore4Power to boost Europe’s power electronics ambitions IBM expands AI security push with Project Glasswing Keysight reports record revenue with orders topping $2bn AI weather forecasting boosts energy trading at Spire IBM targets US quantum manufacturing with $1bn foundry plan Teledyne CCD370 sensors launch on SMILE mission EU backs €1.3bn hydrogen push in Germany EU approves €288m aid for German chip supply chain projects Z-Wave Alliance adds Semtech to board HighTec and SiFive strengthen RISC-V safety tools European Energy launches Italy’s largest agrivoltaic project in Sicily Nvidia revenue jumps 85% on AI demand Splunk report: Global downtime costs surge to $600bn a year Spintronic memory switches in 40 ps The “Elektor Industry/eeNews Europe” YouTube channel surpasses 90,000 subscribers SEMI FlexTech calls for proposals on flexible hybrid electronics Ericsson and Net Feasa target smart shipping with 5G and agentic AI ROHM configurable PMICs support automotive SoC power design Diodes launches PCIe 7.0 clock generator Microchip timing module supports AI data centre and 5G synchronization AI drives MLCC shortage MIT App Inventor for AI and IoT POET lands $400m to ramp AI photonics production System Check: Share your opinion on version control and Git Webinar: Spurious Emission Testing for Amateur Radio Transmitters AI energy hub planned by Mitsubishi and Tallgrass Siemens expands rail technology portfolio with MERMEC acquisition ASML and Tata Electronics partner on India semiconductor fab Mobile DRAM prices squeeze smartphone production SEMI and semiconductor companies call for extended investment credit MaxLinear targets AI driven 5G backhaul with Trinity platform Infineon adds 2300 V SiC modules for energy systems Elektor Lab Talk #47: Max Imagination on Drones, Robots, and Outdoor Electronics The risks to Europe’s space push IREN targets European AI growth with Spain deal NVIDIA appoints Suzanne Nora Johnson to board amid AI growth Advantech EdgeView adds low-code SCADA visualization Physical AI BoF from MIPI invites robotics companies Nebius starts work on Missouri AI factory campus Incedo targets AI native telecom growth in Canada Infineon challenge puts humanoid robotics in focus Red Hat AI and OpenShift services land on IBM Cloud Ericsson and KDDI advance autonomous networks with AI uplink trial TSMC and Applied Materials team up on AI chip manufacturing R&D Ouster adds native color LiDAR to Rev8 OS family System Check: Which wireless protocol do you prefer? Molex Teramount deal targets co-packaged optics NVIDIA and ServiceNow extend AI governance from desktops to data centres Faraday Future launches Physical AI robotics institute with BIBS SiTime targets AI data centers with sub-nanosecond timing chip Semiconductor industry heads for $1tn in 2026 GENE-26.5 brings dexterous AI robotics into focus System Check: Should engineers learn analog? Decoupled by Design: How Gateworks and NXP are rethinking edge AI architecture NVIDIA and Corning partner on AI photonics expansion GCT taps satellite partner to speed 5G rollout Sodankylä supersite to support ESA Earth observation SiTime posts 88% revenue growth on AI infrastructure demand Infrared LEDs support in-cabin sensing for vehicle safety Anthropic compute deal taps SpaceXAI Colossus 1 Quantum Brilliance CEO Mark Luo on deployable quantum systems and the future of diamond-based computing SEMI Summit spotlights Europe’s chiplet and packaging push AI data center infrastructure drives Pennsylvania energy expansion NXP CoreRide gains Vector software support for SDV platforms Elektor Lab Talk covers Red Pitaya and reconfigurable test gear SEMI names Julie Rogers to lead ESD Alliance ASML CEO backs joint call for Europe tech competitiveness push FlexIC RFID inlays bring NFC to paper packaging ROHM targets smart rings with ultra-compact NFC wireless power chipset China silicon wafers push boosts Eswin capacity ESD Alliance outlook spotlights agentic AI in chip design AI robotics sales growth rises as Faraday Future expands into education Microchip expands dsPIC33A controllers for AI data center power sensiBel MEMS microphone heads to Silex production SEMI: Global silicon wafer shipments jump 13% on AI demand AI drives photonics innovation Advantech adds Intel Core Series 3 to edge AI systems NI CHESS enables software-driven RF channel emulation into aerospace testing Forsee Power battery system powers new electric fire pump Advantech brings agentic AI to Jetson Thor edge platforms Rohde & Schwarz adds Pulsar signal simulation for LEO navigation UL Solutions builds new testing lab in Germany IonQ and Florida LambdaRail roll out US quantum-safe network initiative
AI token costs force rethink at Uber and Microsoft
2026-05-29 · via ... eeNews Europe

AI token costs force rethink at Uber and Microsoft

Business news |

By Brian Tristam Williams




AI token costs are becoming harder to treat as a rounding error, as agentic coding tools and enterprise AI workflows push usage from simple prompts into long, multi-step inference jobs.

Goldman Sachs Research says agentic AI could drive a 24-fold increase in token consumption by 2030, reaching 120 quadrillion tokens per month as consumer and enterprise adoption grows. The bank’s analysis, published earlier this month, argues that the same trend could improve the economics of hyperscalers and model providers if inference costs keep falling faster than demand rises.

AI token costs move from hype to budget line

The problem for customers is that lower unit costs do not automatically mean lower bills. Agentic tools can call models repeatedly, review context, generate code, run checks, and revise their own output. That turns a single developer request into a chain of token-consuming actions. This is why token-based billing is becoming a practical issue for engineering organisations rather than a narrow cloud-infrastructure concern.

Uber has become one of the more visible examples. The company is reassessing parts of its AI spending after reports that its 2026 AI budget had been exhausted within the first few months of the year. Uber president and COO Andrew Macdonald has said the company does not yet see a clear link between higher token consumption and more useful consumer-facing features. That does not mean the tools are useless, but it does make the cost-benefit argument less automatic.

Microsoft is facing a related issue inside its own engineering operations. The company is reportedly winding down most internal Claude Code licences for parts of its Experiences + Devices group and steering developers towards GitHub Copilot CLI by the end of June. Separately, GitHub has announced that Copilot plans will move to usage-based billing from 1 June 2026, with GitHub AI Credits consumed according to token usage across input, output, and cached tokens.

AI token costs expose the hardware gap

The Goldman Sachs view is not simply bearish. It expects semiconductor providers to cut inference cost per token by 60% to 70% per year through chip and architecture improvements. It also expects chip supply to remain constrained for the next 12 to 18 months as production capacity catches up with the pace of new AI use cases.

That makes the story relevant well beyond software procurement. If agents become a default interface for coding, customer service, search, and enterprise workflow automation, the load shifts back into datacentre silicon, networking, memory, storage, and power infrastructure. As previously reported by eeNews Europe when ARM set out its datacentre CPU plans, agentic AI is already being used to justify new processor strategies for AI datacentres.

The immediate lesson is more prosaic. Businesses are being pushed to measure AI against shipped features, resolved support cases, reduced engineering time, or revenue impact, not against token volume. AI token costs may fall at the hardware level, but agentic workflows can easily spend the savings before finance teams see them.

If you enjoyed this article, you will like the following ones: don't miss them by subscribing to :

   eeNews on Google News


Linked Articles