惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Fox-IT International blog
Recent Announcements
Recent Announcements
D
Docker
IT之家
IT之家
B
Blog
Jina AI
Jina AI
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
量子位
C
Check Point Blog
Microsoft Azure Blog
Microsoft Azure Blog
罗磊的独立博客
博客园 - 司徒正美
李成银的技术随笔
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
雷峰网
雷峰网
The GitHub Blog
The GitHub Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
MongoDB | Blog
MongoDB | Blog
P
Proofpoint News Feed
L
LangChain Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Y
Y Combinator Blog
大猫的无限游戏
大猫的无限游戏
有赞技术团队
有赞技术团队
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
V
Visual Studio Blog
T
Tailwind CSS Blog
H
Help Net Security
Engineering at Meta
Engineering at Meta
小众软件
小众软件
B
Blog RSS Feed
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
M
Microsoft Research Blog - Microsoft Research
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog

The Register - Special Features: Supercomputing Month

GPUs aren't worth their weight in gold – it just feels like they are HPC won't be an x86 monoculture forever – and it's starting to show Norway's new supercomputer to use waste heat to raise salmon The exascale offensive: America's race to rule AI HPC India has satisfied its supercomputing needs, but not its ambitions UK lines up £250M cloud procurement to feed its growing AI research appetite How high-end supercomputer filesystem DAOS can break out of its niche Eviden set to build France's first exascale supercomputer with AMD at the wheel GPU goliaths are devouring supercomputing – and legacy storage can't feed the beast HPE details Vera Rubin blades for next-gen Cray supercomputers Battery trade war hits booming datacenter industry AI isn't throttling HPC. It <em>is</em> HPC Oak Ridge lab gets $125M to combine HPCs with quantum Power crunch threatens to derail AI datacenter construction $10B + spent on liquid cooling this week – it's only Tuesday Nvidia, OpenAI, and the trillion-dollar loop Nvidia will help build 7 AI supercomputers for for DoE HPE to build Discovery exascale successor for Oak Ridge NextSilicon Maverick-2 promises to blow away the HPC market Nvidia left behind UK waves £750M supercomputer contract at HPC builders Tsunami forecasting about to get a lot faster thanks to El Capitan super
Power: The answer to and source of all your DC dilemmas
2025-11-15 · via The Register - Special Features: Supercomputing Month

REG AD

Supercomputing Month

Power: The answer to and source of all your AI datacenter problems

Digital Realty CTO Chris Sharp weights impact of densification on the datacenter and the rise of the AI factory

INTERVIEW In the datacenter biz, power is the product. You either have it or you don't, Chris Sharp tells El Reg.

The CTO of colocation provider Digital Realty explains that without power, there are no servers, no storage, no GPUs, and none of those AI tokens that have Wall Street in a frency. But power isn't only the limiting factor in the US and much of the world, it has also upended the way datacenters are designed and built.

Over the past few years, GPU servers have transitioned from air-cooled machines that didn't require much if any additional work to deploy in a typical datacenter to something more reminiscent of the bespoke HPC clusters built by the likes of Cray, Eviden, or Lenovo.

REG AD

This change didn't take place overnight. Nvidia's Ampere generation of GPUs introduced in 2020 didn't really require a fundamental shift in the company's approach to cooling or thermal management, Sharp told us.

REG AD

But in the intervening years the winds were changing. GPU servers were not only growing more power hungry but were in now in high demand, and deploying them was no longer as simple as racking and stacking servers.

The chief constraints: power and density. In the past five years we've gone from air-cooled systems that might pull 6 or 7 kilowatts under load to liquid-cooled rack-scale behemoths with more than 120 kW of compute on board.

"More times than not, customers are like, 'Okay, I broke through, and I'm free of the supply constraint, I have my chips,' and I have to say: slow down; there's a lot of other things you're going to need," Sharp said.

For X amount of GPUs you now need so many switches, storage servers, power delivery units, and coolant distribution units. In the case of Nvidia's densest systems, existing datacenters may not even be able to support the physical load.

"Silicon innovation is going to be hampered by the permanence of concrete in the datacenter," Sharp said. "There's a potential to have bought your infrastructure, and you do not have a place where it goes"

There are a lot of colocation providers that can't handle this level of densification, or, if they can, that doesn't mean they're prepared to support the next generation of compute platforms, he adds.

This is a problem for hardware vendors like Nvidia and AMD, who believe that as Moore's Law slows and advancements in silicon density and energy efficiency become fewer and further between, the best path forward is packing larger and larger chips closer together.

Today, Nvidia's rack systems are hovering around 140kW in compute capacity. But we've yet to reach a limit.  By 2027, Nvidia plans to launch 600kW racks which pack 576 GPU dies into the space one occupied by just 32.

REG AD

Nvidia CEO Jensen Huang announced its Vera Rubin Ultra platform and Kyber racks at GTC in March partly to move the market forward: the infrastructure required to support large-scale deployments of these racks simply didn't exist yet, and needed to be built.

Sharp is fairly confident that Digital Realty's existing facilities will be able to accommodate small deployments of Nvidia's Kyber racks, which he refers to as tuck-ins. Where things get complicated is with larger deployments — those, he says, will require a different kind of datacenter.

To get ahead of this trend toward denser AI deployments, Digital Realty announced a research center in collaboration with Nvidia in October.

The facility, located in Manassas, Virginia, aims to develop a new kind of datacenter, which Nvidia CEO Jensen Haung has taken to calling AI factories, that consumes power and churn out tokens in return. 

The facility will be among the first to feature Nvidia's Vera Rubin family of GPUs, which are slated to make their debut sometime next year, and will provide Digital Realty the infrastructure necessary to test, validate, and refine new datacenter architectures for power delivery, thermal management, and networking. 

To support this, Digital Realty is also working with Nvidia and its partners on a new software platform called Omniverse DSX, which uses digital twins to simulate gigawatt-scale datacenters and allow for rapid prototyping of modular systems, thermal management systems, networking and power delivery.

While datacenters depend on a steady supply of power, the spikier nature of AI infrastructure also poses a challenge to grid operators. To manage this, the colocation provider is working with Nvidia and startup Emerald AI to develop a grid-flexible power management system which will enable datacenter operators to proactively respond to grid conditions.

"What we're trying to do is certify designs and really create a blueprint for the broader community," Sharp said. ®