惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Fox-IT International blog
Recent Announcements
Recent Announcements
D
Docker
IT之家
IT之家
B
Blog
Jina AI
Jina AI
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
量子位
C
Check Point Blog
Microsoft Azure Blog
Microsoft Azure Blog
罗磊的独立博客
博客园 - 司徒正美
李成银的技术随笔
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
雷峰网
雷峰网
The GitHub Blog
The GitHub Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
MongoDB | Blog
MongoDB | Blog
P
Proofpoint News Feed
L
LangChain Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Y
Y Combinator Blog
大猫的无限游戏
大猫的无限游戏
有赞技术团队
有赞技术团队
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
V
Visual Studio Blog
T
Tailwind CSS Blog
H
Help Net Security
Engineering at Meta
Engineering at Meta
小众软件
小众软件
B
Blog RSS Feed
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
M
Microsoft Research Blog - Microsoft Research
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog

The Register - Special Features: SC25

Copackaged optics have officially found their killer app - of course it's AI Nvidia's green500 dominance continues as France's Kairos super takes efficiency title Scientific computing is about to get a massive injection of AI Europe joins US as exascale superpower after Jupiter clinches Top500 run Need AI? Dell backs up the truck and tips out servers, storage, blueprints Nvidia-backed photonics startup Ayar Labs eyes hyperscale customers with GUC design collab Europe’s Jupiter supercomputer hits exascale threshold at inauguration Swiss boffins just trained a 'fully open' LLM on the Alps supercomputer Japan's sequel to Fugaku supercomputer will be Arm'd to the teeth UK reheats Edinburgh supercomputer plan sans exascale chops SiPearl ships reference node design for Rhea1 Arm chip
SC25 gets heavy with mega power and cooling solutions
2025-11-21 · via The Register - Special Features: SC25

SC25 Hydrogen-fueled gas turbines, backup generators, and air handlers probably aren't the kinds of equipment you'd expect on the show floor of a supercomputing conference. But your expectations would be wrong.

At SC25, datacenter physical infrastructure took center stage with sprawling dioramas of evaporative cooling towers and coolant distribution units (CDUs) filling massive booths rivaling those of major chip vendors and OEMs like Nvidia, AMD, and HPE Cray.

Among the largest of these displays were those from Mitsubishi Heavy Industries and Danfoss, which are best known for building power plants and industrial-scale air handling and facility cooling equipment.

REG AD

Located directly across from Nvidia's booth was datacenter physical infrastructure Vertiv vendor which had mocked up a datacenter aisle to show how their power and cooling tech could support the highest density deployments

Located directly across from Nvidia's booth was datacenter physical infrastructure Vertiv vendor, which had mocked up a datacenter aisle to show how their power and cooling tech could support the highest density deployments

These displays weren't tucked away at the back of the show floor, either. Nestled just a few feet from Nvidia's booth was Vertiv's, which had built out a full-scale mockup of a data hall to demonstrate how its liquid cooling and power delivery tech could support the deployment of the GPU giant's latest-gen rack systems.

REG AD

HPC is becoming big business

The reason for this invasion of datacenter physical infrastructure and heavy industrial equipment is simple. The construction of AI datacenters — and yes, AI is an HPC workload whether you like it or not — has become a big business, with power and cooling infrastructure becoming a key bottleneck for the buildout of new high-density datacenters.

HPC practitioners are no strangers to dense liquid-cooled systems. HPE's Cray EX4000 systems can be specced with up to 512 AMD MI300A APUs, totaling more than 293 kW per cabinet. Lawrence Livermore National Laboratory's El Capitan, the number one ranked system on the Top500 ranking of publicly known supercomputers, features 87 of these cabinets.

But while El Capitan's 44,544 APUs or Aurora's 63,744 GPUs are certainly among the largest scientific instruments ever built, they pale in comparison to the AI superclusters being built for OpenAI in Abilene, Texas or Meta in Richland Parish, Louisiana. 

When complete, OpenAI's first Stargate datacenter will exceed 400,000 Nvidia GPUs consuming 1.2 gigawatts of power. Meta's Hyperion datacenter will be even larger. Assuming of course the multi-year project is actually completed, it's expected to swell to more than 5 gigawatts, roughly 150x the design capacity of El Capitan.

These deployments aren't just larger; they're designed to support some of the densest machines ever built. Today, Nvidia's NVL72 racks are rated for anywhere between 120 kW and 140 kW each, putting them roughly on par with HPE Cray's EX cabinets, which while higher in capacity, are twice the size of a typical rack.

Needless to say, at this density, liquid cooling is a foregone conclusion, and that means, for every eight NVL72 racks, what Nvidia calls a "Superpod" datacenter, operators need a coolant distribution unit (CDU) with at least a megawatt of cooling capacity. While some datacenters have the facility water systems to support these deployments, many don't. 

At SC25, numerous thermal management vendors, including Vertiv, Nidec, nVent, and others showed off CDUs capable of dissipating anywhere from a few hundred kilowatts to more than two megawatts of power.

REG AD

If you're not familiar, CDUs are responsible for pumping coolant to and from connected systems, and exchanging the captured heat by way of either a liquid-to-air or liquid-to-liquid heat exchanger. Once captured, that heat has to be rejected to the atmosphere, something that's usually accomplished using cooling towers from the likes of Danfoss or Fourier.

Nidec was one of several thermal management vendors showing off high-density coolant distribution units (CDUs) with upwards of 2 MW of capacity, enough for about 16 GB200 NVL72 racks today or three of Nvidia's 600 kW Kyber racks

Nidec was one of several thermal management vendors showing off high-density coolant distribution units (CDUs) with upwards of 2 MW of capacity, enough for about 16 GB200 NVL72 racks today or three of Nvidia's 600 kW Kyber racks

These datacenters are only going to get denser as time goes on. Within the next two years, Nvidia plans to bring racks to market with as many as 576 GPU dies and 600 kW of system power with its Kyber reference designs. At this density, a CDU that could handle an entire superpod worth of NVL72 can barely manage 1.5 600 kW Kyber racks.

Further complicating matters is that, unlike a traditional enterprise datacenter, these systems don't use AC power directly. Instead, these racks run entirely on DC delivered via bus bars at the back of the racks.

Datacenter cooling towers aren't exactly the kind of things you can easily fit into an exhibition booth so Danfoss has gone back to tried and true dioramas

Datacenter cooling towers aren't exactly the kind of things you can easily fit into an exhibition booth so Danfoss has gone back to tried and true dioramas

Today, systems like Nvidia's GB200 or GB300 NVL72 run on 54 volt DC power but with the move to 600 kW racks, Nvidia is now adopting an 800 volt architecture.

At SC25, both power systems vendors Eaton and Vertiv showed off 800 V sidecar power racks, specifically designed for high-density deployments.

By 2027, Nvidia CEO Jensen Huang expects racks to surge to 600 kW with the debut of the Rubin Ultra NVL576

By 2027, Nvidia CEO Jensen Huang expects racks to surge to 600 kW with the debut of the Rubin Ultra NVL576

In addition to all of the electronics to convert from AC to DC, each of these sidecars is packed to the gills with batteries and capacitors. Eaton tells us that the engineering behind these systems was heavily influenced by the companies working on electric vehicles. These batteries effectively function as a UPS, providing up to 90 seconds' worth of clean power in the case of a brownout.

Even at 800 volts, these sidecars are only enough to support a single 600 kW Kyber rack. Because of this, companies like Eaton are already looking at liquid-cooled sidecars that boost the DC voltages to 1,500 volts.

Sidestepping the power crunch

REG AD

For these higher voltage sidecars to matter, you first need adequate utility to support them, which is by no means guaranteed. Hyperscalers like Meta have been forced to finance the construction of large-scale gas generator plants by local utilities to fuel their AI ambitions. 

At SC25, Eaton showed off its latest power sidecars, which increase voltages to 800 V in preparation for Nvidia's next-gen 600 kW rack platform

At SC25, Eaton showed off its latest power sidecars, which increase voltages to 800 V in preparation for Nvidia's next-gen 600 kW rack platform

Building out these plants takes time, which can be particularly problematic if your GPU servers don't have adequate power.

To get around this, some bit barn builders have taken to using mobile natural gas plants. xAI is using these plants as a stopgap to power its 200,000 GPU Colossus supercomputer in Memphis, Tennessee. At one point, the most unhinged chatbot on the internet was running on more than 35 of these portable generators.

Even after a second substation was completed, xAI said it would keep 15 of the turbines onsite for backup power.

Datacenter power has become such a bottleneck that Mitsubishi Power was showing modular power plants — or at least miniatures of them —  and even turbines designed specifically to run on hydrogen gas.

Given the scale of datacenter buildouts today, we wouldn't be surprised to see small modular reactor (SMR) startups like X-Energy or Kairos Power at next year's SC. ®

Hydrogen-powered turbine power plants probably aren't the kinds of things you'd expect to see on the show floor of a supercomputing conference

Hydrogen-powered turbine power plants probably aren't the kinds of things you'd expect to see on the show floor of a supercomputing conference