惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Fox-IT International blog
Recent Announcements
Recent Announcements
D
Docker
IT之家
IT之家
B
Blog
Jina AI
Jina AI
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
Google DeepMind News
Google DeepMind News
F
Fortinet All Blogs
量子位
C
Check Point Blog
Microsoft Azure Blog
Microsoft Azure Blog
罗磊的独立博客
博客园 - 司徒正美
李成银的技术随笔
美团技术团队
Blog — PlanetScale
Blog — PlanetScale
雷峰网
雷峰网
The GitHub Blog
The GitHub Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
T
The Blog of Author Tim Ferriss
酷 壳 – CoolShell
酷 壳 – CoolShell
MongoDB | Blog
MongoDB | Blog
P
Proofpoint News Feed
L
LangChain Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Y
Y Combinator Blog
大猫的无限游戏
大猫的无限游戏
有赞技术团队
有赞技术团队
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
V
Visual Studio Blog
T
Tailwind CSS Blog
H
Help Net Security
Engineering at Meta
Engineering at Meta
小众软件
小众软件
B
Blog RSS Feed
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
M
Microsoft Research Blog - Microsoft Research
宝玉的分享
宝玉的分享
人人都是产品经理
人人都是产品经理
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
GbyAI
GbyAI
H
Hackread – Cybersecurity News, Data Breaches, AI and More
Last Week in AI
Last Week in AI
Martin Fowler
Martin Fowler
Stack Overflow Blog
Stack Overflow Blog

The Register - On-Prem: Systems

Moving to mainframe can be cheaper than sticking with VMware: Gartner Inference is giving AI chip startups a second chance to make their mark Qualcomm teases ‘dedicated CPU for agentic experiences’ and ‘agentic smartphones’ Qualcomm teases ‘dedicated CPU for agentic experiences’ and ‘agentic smartphones’ Fujitsu confirms mainframe biz to die in 2035, in time for quantum AI supercomputers to take over Fujitsu confirms mainframe biz to die in 2035, in time for quantum AI supercomputers to take over Microsoft levels up Azure Local to make it fit for large-scale sovereign clouds ZTE partners with China's National Clinical Research Center for Interventional Medicine to build a new paradigm of smart interventional medicine ZTE partners with China's National Clinical Research Center for Interventional Medicine to build a new paradigm of smart interventional medicine Tenstorrent’s Galaxy Blackhole AI servers escape the event horizon The crypto-to-AI bandwagon jumpers' club just landed another member: Core Scientific Core Scientific accelerates crypto-to-AI pivot Meta Arms itself to the teeth by signing for 'tens of millions' of AWS Graviton cores Meta Arms itself to the teeth by signing for 'tens of millions' of AWS Graviton cores AI now gobbling up power and management chips for servers AI now gobbling up power and management chips for servers Musk bets Tesla's AI future on Intel node that isn't finished yet Tesla stakes AI dreams on Intel's unfinished AI chip SK Hynix’s aspirations for ’Merica-made HBM inch closer to reality SK Hynix breaks ground on Indiana advanced packaging plant Datacenter boom keeps dirty coal plants alive in the US Datacenter boom keeps dirty coal plants alive in the US Forget one chip to rule them all: With TPU 8, Google has an AI arms race to win Oil crisis? What oil crisis? IT spending de-coupled from wider war shock AMD's Ryzen 9 9950X3D2 Dual Edition tested World's blandest man steps down from CEO job to spend more time in tastefully appointed home World's blandest man steps down from CEO job Growing AI power slurpage prompts MPs to examine low-energy computing Intel eases reliance on TSMC with 'Merica-made Core Series 3 processors Intel eases reliance on TSMC with Core Series 3 CPUs A beginner's guide to GPU virtualization: passthrough, vGPU, and MIG Guide to GPU virtualization: passthrough, vGPU, and MIG Orbital datacenter startup admits launch economics don't fly Orbital datacenter startup admits launch economics don't fly AI-powered mainframe exits are a bubble set to pop AI-powered mainframe exits are a bubble set to pop Cloud-smart strategy helps Interactive meet GenAI demands Cloud-smart strategy helps Interactive meet GenAI demands Oracle taps Bloom for 2.8 GW of fuel cells to keep datacenter binge going Oracle taps Bloom for fuel cells to support datacenter binge Britain gives Rolls-Royce the nod to sketch out its mini reactor future Britain gives Rolls-Royce the nod to sketch out its mini reactor future When the IBM PC and shoulder pads were big, Japan led the chip industry. It's trying to get back there now Japan going back to the future by reviving its chip industry AWS ponders selling its home-grown chips by the rack-load, has almost sold out AI capacity AWS ponders selling its home-grown chips by the rack-load, has almost sold out AI capacity Supply chain challenges risk delaying Nvidia's Rubin GPUs Nvidia's Rubin GPU is likely to be late thanks to memory shortage and technical challenges Supermicro investigating alleged China chip smuggling Intel gets trapped in Elon’s reality distortion field as it joins in megafab delusions Intel gets trapped in Elon’s reality distortion field as it joins in megafab delusions No-Nvidia interconnect club delivers 2.0 spec before v1.0 silicon ships OpenInfra General Manager talks sovereignty, governments deploying tech 'kill switches' Anthropic reveals $30bn run rate, plan to use new Google TPU Anthropic reveals $30bn run rate, plan to use new Google TPU How Nvidia learned to embrace the light in its quest for scale Nvidia embraces optical scale-up as copper reaches limits IBM wants Arm software on its mainframes to better support AI IBM wants Arm software on its mainframes to better support AI AI datacenters create heat islands around them, paper finds AI datacenters create heat islands around them, paper finds Arm says agentic AI needs a new kind of CPU. Intel's DC chief isn't buying it Arm says AI agents need a new CPU. Intel doesn't buy it Memory-makers' shares are down. Some RAM prices have eased. Blaming Google is not a good idea Memory-makers' shares are down. Don't blame Google US PC shipments to fall 13% as memory and storage crunch hits budget systems US PC shipments to fall 13% as memory costs surge ZTE showcases end-to-end intelligent computing at CloudFest 2026 in Germany, empowering the digital future South Korean AI chip startup Rebellions eyes new shores for rack-scale invasion Enterprise infrastructure is entering an economic reset Enterprise infrastructure is entering an economic reset AMD's new desktop CPU oozes cache out of all 16 cores Apple signs meaningless deal to make some less-important parts in America Three more charged over alleged Nvidia GPU smuggling scheme to China Three more charged with trying to smuggle GPUs to China Dell slims down business laptops, fattens up cooling and battery life Alibaba delivers RISC-V server chip optimized to run China’s top AI models Alibaba delivers RISC-V server chip optimized to run China’s top AI models AI-pilled Arm CEO teases mystery products that will turn it into a money machine AI-pilled Arm CEO teases mystery products that will turn it into a money machine Arm rolls its own 136-core AGI CPU to chase AI hype train Arm rolls its own 136-core AGI CPU to chase AI hype train SoftBank builds AI mega-datacenter on nuke site SoftBank to build massive AI datacenter on former US nuclear weapons site US chip testing firm shrugged off ransomware hit as minor – then came the data leak Explainer: AI-ready servers Explainer: AI-ready servers Elon Musk wants to build 50 times more chips than the world currently produces, using 'new physics' Elon Musk wants to build 50 times more chips than the world currently produces, using 'new physics' Australia to datacenter operators: BYO energy, pay your way, build green, or stay home Australia to datacenter operators: BYO energy or stay home Supermicro co-founder arrested, charged over $2.5B Nvidia GPU sales to China Supermicro co-founder charged over $2.5B GPU sales to China Jeff Bezos' rocket company Blue Origin applies to launch 51,000 datacenter satellites Blue Origin applies to launch 51,000 datacenter satellites Alibaba has made 470,000 AI chips, admits they’re inferior Alibaba has made 470,000 AI chips, admits they’re inferior Decoding Nvidia's Groq-powered LPX and the rest of its new rack systems Your next car might need 300 GB of RAM, and so will autonomous robots Your next car might need 300 GB of RAM, and so will robots
Rebellions eyes global expansion with rack-scale AI platform
2026-03-30 · via The Register - On-Prem: Systems

GPU-makers like Nvidia and AMD may dominate the AI infrastructure market, but there are still more than a few AI chip startups knocking around.

One of them is Rebellions, which after establishing a foothold on its home turf in South Korea, aims to bring its tech to the rest of the world, beginning with a new rack-scale compute platform that won't require enterprises to adopt liquid cooling or ultra-power dense racks.

Founded in late 2020, the startup produces AI accelerators that have been deployed in numerous applications in the South Korean domestic market.

REG AD

Initially, "we focused a great deal on telcos, service providers, and enterprise-end users within the Korean market," Rebellions chief business officer Marshall Choy told El Reg. "We built up use cases around everything from call centers and customer service to CCTV surveillance for the national highway system."

REG AD

"We're in a very strong position to take those learnings, capabilities, and improvements we've done over the years and bring that out to other regions, outside of Korea, as less of a fresh start, but more of a rinse and repeat type of motion," he added.

Following the introduction of its Rebel Quad accelerators, since rebranded as the Rebel100, the company has turned its attention to the rest of the world. Over the past few months, Rebellions has opened offices in Japan, Saudi Arabia, Taiwan, and the US, where it hopes to win over enterprises with its new RebelRack and RebelPods.

Before looking at the racks, let's talk about the chips themselves. Our sibling site The Next Platform dug into the Rebel100 last winter, but at a high level, the chip looks quite similar to Nvidia's H200 accelerators from late 2023.

According to Rebellions, the processor is capable of a petaFLOP of dense 16-bit floating point math or double that at FP8. However, unlike the H200, which used a monolithic compute die fabbed at TSMC, Rebellions' latest processor uses a chiplet architecture with four compute dies manufactured and packaged by Samsung.

That processor is fed by four HBM3e stacks totaling 144 GB of capacity and 4.8 TB/s of aggregate bandwidth.

While the smaller compute dies and reliance on Samsung should not only help with yields and avoid competing for TSMC's limited fab and packaging capacity, it still needs to source HBM from somewhere. Memory is already in short supply and HBM is among the scarcest.

This is where being a South Korean company with close ties to both the SK chaebol and Samsung comes in handy. SK Hynix and Samsung are the largest suppliers of HBM in the world. Last we heard, Rebellions was sourcing its HBM from Samsung, but in a pinch it shouldn't have to fight that hard to get SK Hynix to kick in some capacity.

The chip itself is currently being packaged as a PCIe card with a 600 watt TDP, rather than the OAM or SXM modules we've become accustomed to.

REG AD

Rebellions' reference design calls for eight of these cards to be crammed into a single air-cooled node.

High-efficiency, standard form factors such as 19-inch chassis and air cooling were key design points for Rebellions as it meant the system could be deployed into existing enterprise datacenters, something that can't be said of Nvidia's latest generation of liquid-cooled Rubin GPUs.

The RebelRack will feature four of these nodes, each connected via quad-400 Gbps networking, for a total of 32 accelerators and 64 petaFLOPS of FP8 compute, 4.6 TB of HBM3e, and 153.6 TB/s of aggregate memory bandwidth.

For larger deployments, Rebellions is also developing what it calls the RebelPod, which can scale from eight to 128 nodes, each with eight Rebel100 accelerators interconnected using 800 Gbps Ethernet.

"Right now, people think of rack level. I think we're going to be thinking, in a few days from now, about row level and datacenter level," Choy said.

Compared to GPU systems, this isn't a lot of networking. Most HGX systems now feature at least one 800 Gbps NIC per GPU. Choy tells us that going forward, the network fabric is going to be a major focus for the company.

As we've seen with other rack-scale systems from AMD and Nvidia, compute and networking are only two pieces of the puzzle; you also need software that can stitch everything together cohesively.

Rebellions' software stack is nothing exotic. We're told the platform runs on open source frameworks like vLLM, PyTorch, and Triton. For disaggregated inference, it's using llm-d, another open source framework that enables compute-heavy prefill operations on one set of accelerators and memory bandwidth-heavy decode operations on another.

REG AD

"Everything's open source, from vLLM compiler all the way up to the very highest level of stack, Red Hat, OpenShift, and everything in between," Choy said. "If you've used any of these technologies in any other context, you already know how to use Rebellions."

We've heard similar claims from chipmakers before that haven't ended up being quite so easy to use. However, Rebellions is a member of the PyTorch Foundation, something that can't be said of many AI chip startups.

Of course, none of this is cheap, but Rebellions isn't hurting for cash. On Monday the startup raised $400 million in a pre-IPO funding round led by Mirae Asset Financial Group and the Korea National Growth Fund to both support its expansion westward and further the development of more capable of and efficient AI accelerators and systems.

According to recent reports, the company could file for an IPO as soon as this year or early next year. ®