惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
LangChain Blog
Martin Fowler
Martin Fowler
P
Palo Alto Networks Blog
MongoDB | Blog
MongoDB | Blog
A
About on SuperTechFans
Google DeepMind News
Google DeepMind News
博客园_首页
量子位
小众软件
小众软件
F
Full Disclosure
Vercel News
Vercel News
爱范儿
爱范儿
Engineering at Meta
Engineering at Meta
F
Fortinet All Blogs
博客园 - 聂微东
V
V2EX
Blog — PlanetScale
Blog — PlanetScale
罗磊的独立博客
WordPress大学
WordPress大学
D
Darknet – Hacking Tools, Hacker News & Cyber Security
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
T
Tor Project blog
Google DeepMind News
Google DeepMind News
M
MIT News - Artificial intelligence
L
Lohrmann on Cybersecurity
H
Hacker News: Front Page
Spread Privacy
Spread Privacy
AI
AI
C
Cyber Attacks, Cyber Crime and Cyber Security
C
CERT Recently Published Vulnerability Notes
D
Docker
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Recorded Future
Recorded Future
L
LINUX DO - 热门话题
Microsoft Azure Blog
Microsoft Azure Blog
Recent Commits to openclaw:main
Recent Commits to openclaw:main
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Latest news
Latest news
W
WeLiveSecurity
Application and Cybersecurity Blog
Application and Cybersecurity Blog
博客园 - 司徒正美
博客园 - 叶小钗
T
Threat Research - Cisco Blogs
P
Privacy International News Feed
O
OpenAI News
Help Net Security
Help Net Security
aimingoo的专栏
aimingoo的专栏
宝玉的分享
宝玉的分享
博客园 - Franky

The Register

Shadow IT has given way to shadow AI. Enter AI-BOMs Zed team releases version 1.0 of Rust-built editor: Traditional editor and AI tool Microsoft boss tells investors the company is working to 'win back fans' What type of 'C2 on a sleep cycle' do they leave behind? Novel Chinese spy group found in critical networks in Poland, Asia NASA boss: Make Pluto A Planet Again GitHub says sorry and vows to do better as uptime slips and devs complain Age checks could turn internet into an ID checkpoint, complains Proton CEO Microsoft gives your Word documents an AI co-author you didn’t ask for Datadog digs down into GPU efficiency as AI costs soar If malware via monitor cables is a matter of national security, this might be the gadget for you Thunderbird in hand worth 2 Outlooks as fresh FOSS fave and Firefox arrive Grafana offers AI assistant for free, warns users not to go mad Right to repair champ Framework punts modular 13in laptop with Core Ultra Series 3 France's 'Secure' ID agency probes breach as crooks claim 19M records Scotland Yard can keep using live facial recognition on Londoners, say judges UK tribunal sends £2B claim accusing Microsoft of overcharging for licensing to trial Nation-states want to cause harm, not just steal cash - stop handing your cyber defenses to the cheapest contractor Murder, she wrote: Ex-FBI chief wants some ransomware crims charged with homicide Phone-to-satellite use goes into orbit, growing 25% in 8 months macOS ClickFix attacks deliver AppleScript stealers to snarf credentials, wallets Anthropic bakes memory fixes into Bun 1.1.13 as developers complain of leaks The spaghettified DBMS chart that shows Oracle's crown is slowly slipping Yet another ex-ransomware negotiator admits turning rogue after payoff from crimelords FAA grounds Blue Origin's New Glenn as it probes missed satellite delivery 'mishap' AMD's Ryzen 9 9950X3D2 Dual Edition tested: Gratuitous overkill with a price to match AI-assisted intruders pwned Vercel via OAuth abuse and a pilfered employee account Crook claims to leak 'video surveillance footage' of companies Met police trials snoop tech platform in push to cuff more London shoplifters England's school phone ban gets teeth, just in time to bite no one Adaptavist Group breach spawns imposter emails as ransomware crew claims mega-haul Panasonic creates device-locked QR codes to speed facial biometric capture Iran claims US used backdoors to knock out networking equipment during war NASA Inspector fears new spacesuits won’t be ready for Moon landing Vibe coding upstart Lovable denies data leak, cites 'intentional behavior,' then throws HackerOne under the bus Trump-branded datacenter project fails to make itself great, again World's blandest man steps down from CEO job to spend more time in tastefully appointed home Chase got a spiff of $77 million to create one job with New York datacenter Scot becomes second Scattered Spider-linked crook to plead guilty in US You too can build a nuclear battery from junk you have lying around the house Schmoozebots: study finds flattery will get AI everywhere One of Europe's sovereign cloud picks may not be so-sovereign after all New Android development tool designed for robots, not humans AI is reshaping Britain's datacenter map away from London HP's remote desktop push retreats as Anyware heads for end of life 'Invisible mouse' made a mess of PC rebuild NASA working on ‘Big Bang’ upgrade to keep the Voyagers alive for longer Indonesia’s game rating system paused amid claims it leaked developer creds and glimpses of major new titles Just like phishing for gullible humans, prompt injecting AIs is here to stay Atlassian’s new data collection policy protects rich customers while AI eats the rest Intel eases reliance on TSMC with 'Merica-made Core Series 3 processors NASA gets the ball rolling on its part in Europe's jinxed Mars rover mission Attention data hoarders: Alexa loses its Plex appeal as voice feature gets canned Locked-out iPhone user tells The Reg that Apple is scrambling to fix character flaw passcode bug Would you like fries with that terminal? Capita won disastrous UK pensions gig after acing performance checks NodeWeaver says its perpetual licensing beats VMware’s perpetual price hikes Maine to pause big bit barns as local opposition spreads If you want into Anthropic's Claude club, you may have to show ID DuckDB uses RDBMS to tackle lakehouse 'small changes' issue Iran has something America can only dream of: cheap broadband Brussels tells Google to hand rivals its search crown jewels as privacy row brews Visual Studio 18.5 lands with AI debugging at a price Git identity spoof fools Claude into giving bad code the nod McGraw Hill linked to 13.5M-record data leak Microsoft announces product it doesn't want anyone to buy Obsolete Google nag drowns out vital bar information at Swedish concert hall Cops hand Motorola £25M to keep 2000-era radios alive Server-room lock was nothing but a crock QUIC will soon be as important as TCP – but it's vastly different Nobody knows how many CVEs Anthropic's Project Glasswing has actually found Allbirds shoe company moving to AI infra is the top 20-year-old Enlightenment E16 bug finally gets patched Bad teacher bots can leave hidden marks on model students Autovista blames ransomware for service disruption Networks not ready for the challenges of AI traffic Windows takes a crash dump after one McDonald's too many French cops free mother and son after crypto kidnapping US states can't account for datacenter tax breaks. Literally Salesforce debuts Headless 360 agentic platform Fission impossible: Uncle Sam wants nuclear power in space UK told its Big Tech habit is now a national security risk UKAEA lays out roadmap to take Britain closer to fusion Waymo's self-driving cars face their toughest test yet: London The only technology that died more times than VR is AI, and that seems to have worked out Boeing soars past Airbus for the first time in years Commvault has a Ctrl+Z for rogue AI agents Nvidia slaps forehead: AI, that's what quantum needs! Oracle taps Bloom for fuel cells to support datacenter binge GitHub recalls Phabricator with preview of Stacked PRs Physicist proposes two-button calculator Amazon pays $11.5B to satisfy satellite-envy while cowering in Musk's shadow No honor among thieves as 0APT threatens rival ransomware gang Krybit NASA insiders oddly relaxed about latest budget threats Microsoft raises UK Surface prices as RAM crisis reaches the checkout OpenAI CEO Sam Altman home attack suspect charged Microsoft kills off Outlook Lite as memory costs skyrocket UK state bank considers lengthening disastrous IT program Japan going back to the future by reviving its chip industry Windows Update: Torture chamber for seldom-used PCs Japanese rocket came unglued, causing mission fail
Inside the cloud
Arm · 2026-06-16 · via The Register

When Spotify evaluated its cloud compute options, it needed more than incremental improvements. Its recommendation engine delivers real-time suggestions to millions of users around the clock, placing heavy demands on compute infrastructure while requiring tight control over energy use and costs. During its evaluation of next-generation cloud processors, Spotify found that workloads running on Google Cloud Axion processors built on Arm architecture delivered roughly 250 percent better performance.

Axion is just a part of a broader shift toward Arm-based compute built on the Neoverse architecture, which has been adopted across all major hyperscale cloud platforms. AWS reports that its Arm-based Graviton processors have accounted for over half of new CPU capacity deployed over the past three years. Microsoft and Google have followed with their own Arm-based designs, including Azure Cobalt and Axion, while NVIDIA’s Grace and Vera signal that it sees Arm as central to the future of AI infrastructure. Now about half of the compute shipped to top hyperscalers are Arm-based platforms.

Purpose-built for customers

Hyperscalers are not only deploying Arm processors but also designing silicon and infrastructure together to reflect real usage patterns. Ninety-eight percent of top 1,000 Amazon EC2 customers running production workloads on Graviton and benefit from Graviton’s price–performance advantages compared to x86. The new Cobalt 200 processor, built on Arm Neoverse technology, was engineered using telemetry from real Azure workloads and an internal suite of benchmark variants to reflect production behavior. Google is pursuing its own strategy with Axion processors, with C4A instances delivering up to 65 percent better price-performance and up to 60 percent greater energy efficiency than comparable x86 systems.

At the core of this shift is Arm’s Neoverse platform, a datacenter–focused architecture designed to enable high-performance, energy-efficient compute at hyperscale. Neoverse marks Arm’s evolution from a mobile-first architecture to a platform purpose-built for cloud and AI infrastructure. It provides the common foundation hyperscalers use to design custom silicon optimized for their own workloads, allowing providers to tailor performance, power, and system behavior to meet specific application demands.

While this momentum is driven by hyperscaler adoption, it is rooted in a broader change in how compute infrastructure must operate to support AI workloads. Traditional enterprise workloads emphasized predictable CPU utilization and storage throughput. AI changes that equation. Modern workloads require simultaneous optimization across training, inference, networking, and storage performance while minimizing energy consumption and latency. Even minor inefficiencies can become costly at scale.

Power consumption now represents a significant portion of datacenter operating costs, which means performance per watt has become a primary design metric. According to an IDC report AI-ready datacenters are seeing rapid increases in power density, with rack requirements rising from typical levels of 5–10 kW to 30 kW or more, and in some cases exceeding 100 kW per rack. These constraints are forcing organizations to rethink how compute, networking, storage, and cooling systems are designed and integrated at the rack-level

These pressures are also collapsing traditional boundaries between compute, networking, storage, and acceleration, creating tightly integrated systems optimized for end-to-end performance. This is driving cloud providers to adopt purpose-built silicon and architectures designed specifically for modern workloads.

Real-world efficiency gains drive adoption

These design choices are translating into measurable improvements in production environments. Organizations migrating workloads to Arm-based infrastructure are reporting gains across performance, efficiency, and cost:

Databricks is using Azure Cobalt 100 virtual machines, built on Microsoft’s Arm-based CPU architecture, which are designed to optimize data-intensive and AI workloads. and deliver up to 50 percent better price-performance compared to previous generations, along with improvements in query speed and latency for analytics applications. For organizations running large-scale data pipelines to power machine learning and business intelligence workloads, these gains translate directly into faster processing and lower infrastructure costs.

Pinterest provides a clear example of how Arm adoption can improve both cost efficiency and sustainability at scale. As a platform serving more than half a billion monthly active users and running AI-driven discovery workloads, Pinterest relies heavily on large-scale cloud infrastructure. By migrating workloads to AWS Graviton–based instances, the company achieved 38 percent savings on compute resources and 47 percent cost savings for key workloads, while also reducing carbon emissions by 62 percent. These improvements support both performance and sustainability goals, showing how infrastructure decisions can directly impact operational efficiency and environmental footprint.

Uber’s transition to a multi-architecture environment highlights the operational realities of adopting Arm at scale. The company migrated more than 2,800 services and shifted nearly 20 percent of its infrastructure capacity from x86 to Arm-based processors, requiring updates to codebases, dependencies, and deployment pipelines. Through phased rollout, benchmarking, and continuous monitoring, Uber demonstrated that Arm can coexist with other architectures while improving price-performance and supporting a more flexible, efficient infrastructure model.

Atlassian’s migration of Jira and Confluence to AWS Graviton highlights how Arm adoption can improve performance and efficiency at enterprise scale. The company moved more than 3,000 instances to Graviton-based infrastructure, achieving the transition with minimal impact on users. In production, instance counts dropped by around 30 percent, while throughput improved by up to 30 percent and latency decreased across key metrics. These gains demonstrate how optimizing infrastructure for performance per watt can enhance both user experience and cost efficiency at scale.

These improvements span media streaming, data platforms, and large-scale consumer services, where gains in latency, throughput, and compute efficiency translate directly into lower infrastructure costs and improved user experience. They are particularly significant for AI inference, real-time personalization, and continuously running workloads.

The converged AI datacenter

The rise of agentic AI is transforming the datacenter into an integrated system in which CPUs, accelerators, networking, and storage operate as a unified platform. In these environments, CPUs serve as the control plane, coordinating scheduling, data movement, memory access, and system services, while accelerators handle compute-intensive training and inference tasks.

In this model, efficiency is measured across the entire rack and datacenter footprint. AI workloads demand higher compute density while operating within fixed power and cooling limits, making the ability to maximize compute output per unit of space increasingly important. Coordinating CPUs, accelerators, memory, and networking as a unified system reduces bottlenecks and minimizes wasted energy from unnecessary data movement.

Arm’s architecture spans these layers, enabling providers to optimize the full stack while maintaining software compatibility and ecosystem consistency. This cohesion is driving the emergence of the converged AI datacenter, where CPUs and accelerators are central to the trend.

NVIDIA’s Grace Blackwell and Vera Rubin platforms combine Arm CPUs with high-performance GPU accelerators in rack-level solutions reflecting a broader industry move toward tightly integrated AI systems. In an other example, AWS with Trainium3 UltraServers, pairs Arm-based Graviton CPUs with Trainium accelerators and Nitro networking components to support large-scale AI workloads. Similarly, Google’s latest TPU 8t and TPU 8i training and inference superpods are powered by Arm-based Axion CPUs, extending this trend toward purpose-built AI infrastructure optimized for scale, performance, and efficiency.

In these architectures, Arm-based CPUs serve as the control layer, orchestrating data flow between accelerators, memory, and networking while simplifying development and driving optimization across software stacks and developer tooling.

Migration realities: less friction than before

Migration complexity has historically slowed adoption of new architectures. Today, improved tooling and ecosystem maturity are lowering that barrier. The Arm MCP Server integrates migration tools, compatibility checks, and performance analysis directly into AI-assisted workflows, helping developers analyze codebases, validate dependencies, and build multi-architecture environments.

Programs such as the Arm Cloud Migration Program are also helping organizations accelerate this transition by providing guidance, validation, and tooling for production workloads.

Arm adoption is supported by expanding software compatibility and platform support. Arm-based environments now support major Linux distributions, container platforms, and modern development frameworks. The ecosystem has matured significantly, enabling developers to focus less on compatibility and more on performance optimization. Arm’s ecosystem now spans more than 22 million developers worldwide.

For developers, this shift means building and optimizing applications for multi-architecture environments, with greater emphasis on efficiency, concurrency, and performance tuning.

Where cloud compute is heading

Purpose-built compute is becoming the default model for AI era infrastructure. As performance improvements outpace increases in power consumption and cost, the economics of cloud computing are shifting toward efficiency-driven architectures.

Looking ahead, this evolution is also extending to enterprise environments. Arm’s recently introduced Arm AGI CPU is designed specifically for the next generation of AI-driven workloads, combining high single-thread performance with scalable throughput, compute density and rack level efficiency. Built on the Neoverse platform, it reflects the shift toward Arm CPUs that are not only optimized for general-purpose compute, but also engineered to orchestrate increasingly complex, agentic AI systems across the datacenter.

Enterprises are increasingly evaluating infrastructure based on cost per workload, energy consumption, and the ability to scale within power and cooling constraints. This is driving demand for architectures that deliver predictable performance and efficiency across diverse workloads.

Arm Neoverse’s growing momentum across hyperscalers, silicon vendors, and ecosystem partners reflects a broader realignment around efficiency, scalability, and system-level optimization. As AI workloads expand, infrastructure decisions will be shaped less by raw compute capacity and more by how efficiently systems can deliver performance at scale.

The organizations redesigning cloud infrastructure today are not simply choosing new processors; they are adopting a compute foundation built for the demands of the AI era.

Sponsored by Arm.