惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Secure Thoughts
V
Visual Studio Blog
C
Check Point Blog
S
SegmentFault 最新的问题
GbyAI
GbyAI
WordPress大学
WordPress大学
Microsoft Security Blog
Microsoft Security Blog
S
Schneier on Security
The Cloudflare Blog
Microsoft Azure Blog
Microsoft Azure Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
博客园_首页
Know Your Adversary
Know Your Adversary
The Hacker News
The Hacker News
Engineering at Meta
Engineering at Meta
Project Zero
Project Zero
U
Unit 42
小众软件
小众软件
Simon Willison's Weblog
Simon Willison's Weblog
Stack Overflow Blog
Stack Overflow Blog
P
Palo Alto Networks Blog
云风的 BLOG
云风的 BLOG
B
Blog
人人都是产品经理
人人都是产品经理
P
Proofpoint News Feed
A
About on SuperTechFans
Scott Helme
Scott Helme
C
Cyber Attacks, Cyber Crime and Cyber Security
宝玉的分享
宝玉的分享
E
Exploit-DB.com RSS Feed
L
Lohrmann on Cybersecurity
S
Security @ Cisco Blogs
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
IT之家
IT之家
S
Securelist
Hacker News: Ask HN
Hacker News: Ask HN
博客园 - 叶小钗
MyScale Blog
MyScale Blog
博客园 - 聂微东
罗磊的独立博客
H
Heimdal Security Blog
T
Tor Project blog
Security Latest
Security Latest
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
G
GRAHAM CLULEY
O
OpenAI News
博客园 - Franky
T
Threat Research - Cisco Blogs
C
Cybersecurity and Infrastructure Security Agency CISA

IEEE Spectrum

We Are Crowd-Sourcing the Panopticon Magnetic Maps Promise GPS Free Navigation Indoors And Underwater Developers: Get Your Medical Mobile App Verified By IEEE Can Your Phone’s Lidar Sensors See Around Corners? AI Rings Turn Sign Language Into Text In Real Time How NeXT Turned Steve Jobs From Brash Founder to Disciplined Builder Flint Paper Battery Could Power Gadgets With Cellulose How AI Is Changing Cybersecurity Smart Earbuds Bring Visual Intelligence to Your Ears Mems Photonics Chip Shrinks Quantum Computer Control Limits Apple's 50th Anniversary and Its Forgotten Tech Can Analog Computing Transform Wearable AI? Caught by Your Own Devices: The Rise of Sensorveillance A Guide to Selecting Adhesives for Medical Device Applications - Wiley Science and Engineering Content Hub Is Bluetooth LE Audio the Future of Wireless Audio? Designing for Precision: CAD Tips for Micro-Scale 3D Printing - Wiley Science and Engineering Content Hub How Is Low-Cost Computing Handling Memory Price Hikes? NanoLEDs Push the Boundaries of Display Technology How Will FDA's New Rules Impact Your Health Gadgets? Are AI Companions Helping or Hurting Our Well-Being? AI Companions Are Growing more Popular Researchers Achieve Sub-Zero Cooling Without Harmful Refrigerants How Will AI Transform Figure Skating Judging at the Olympics? Breaking Boundaries in Wireless Communication: Simulating Animated, On-Body RF Propagation - Wiley Science and Engineering Content Hub 3D-Printed Batteries Put Power in Every Nook and Cranny IEEE Considers Safety Guidelines for Neurotech Gadgets Assistive Technology's DIY Approach Gains Traction Project G Stereo: A 60s Design Icon Stretchable OLEDs: Stable Light in Flexible Form
These AI Workstations Look Like PCs, but Pack a Stronger Punch
https://www.facebook.com/48576411181 · 2026-03-24 · via IEEE Spectrum

The rise of generative AI has spurred demand for AI workstations that can run or train models on local hardware. Yet modern PCs have proven inadequate for this task. A typical laptop has only enough memory to load a large language model (LLM) with 8 billion to 13 billion parameters—much smaller, and much less intelligent, than frontier models that are presumed to have over a trillion parameters. Even the most capable workstation PCs struggle to serve LLMs with more than 70 billion parameters.

Tenstorrent’s QuietBox 2 is an attempt to fill that gap. Though it looks like a PC workstation, the QuietBox 2 contains four of the company’s custom Blackhole AI accelerators, 128 gigabytes of GDDR6 memory—specialized memory used in GPUs—and 256 GB of DDR5 system memory (for a total of 384 GB). This configuration provides enough memory to load OpenAI’s GPT-OSS-120B and can run midsize models like Meta’s Llama 3.1 70B at speeds of nearly 500 tokens per second. For reference, that’s several times quicker than an average response from OpenAI’s GPT-5.2 or Anthropic’s Claude 4.6. The QuietBox 2 carries an expected retail price of US $9,999 and is slated to launch in the second quarter of 2026.

“The 128 gigabytes of GDDR that we have with our AI accelerators really defines how big of a model you can run at a reasonable speed,” says Milos Trajkovic, cofounder and systems engineer at Tenstorrent. “Our 128 gigabytes of GDDR6 RAM would require four Nvidia RTX 5090 graphics cards. That couldn’t fit in today’s 1,600-watt form factor, and the cost for four RTX 5090 GPUs is huge.”

An AI workstation built at the home office

Wattage, it turns out, is critical. Nvidia recommends a system power of 1,000 W for a single RTX 5090, so even a dual-GPU setup exceeds the continuous power draw for a typical 15-ampere, 120-volt power circuit. A system with four RTX 5090s could require 4,000 W or more at load.

The QuietBox 2, on the other hand, draws only 1,400 W at full load. It won’t trip the breaker, so it can be used anywhere a typical desktop PC might be plugged in, including a home office.

That’s not the only way the QuietBox 2 poses as a run-of-the-mill PC. The machine’s custom case is built to support the micro-ATX motherboard form factor, and the motherboard itself is an AMD chipset hosting an AMD CPU. The hardware is kept cool by closed-loop liquid cooling similar to that used by PC workstations and gaming computers. It even has customizable RGB LED lighting and a large semitransparent window that shows off the hardware.

“A lot of even our internal developers have requested a QuietBox because they’re just so easy to deploy,” says Chris Goulet, a thermal-mechanical engineer and team lead at Tenstorrent. “You just ship them the unit, they slap it on their desk, power it up, and they’re going.”

Where the QuietBox 2 differs from desktop PCs, though, is its AI accelerators. It’s equipped with four of Tenstorrent’s Blackhole application-specific ICs, a RISC-V chip designed specifically for AI workloads. Blackhole is packaged on an add-in card; each card has 120 Tensix AI accelerators and 32 GB of GDDR6 memory, for a total of 480 Tensix AI accelerators and 128 GB of GDDR6. Blackhole also has a large amount of on-chip SRAM at 180 megabytes per accelerator.

Two visions of desktop AI

Tenstorrent is not alone in its approach. Nvidia’s DGX Spark, released last year, packed Nvidia’s GB10 chip into a machine the size of a lunch box. Orders for the Spark’s big brother, the DGX Station with Nvidia’s GB300, were opened on 16 March 2026.

The DGX Station looks like a desktop PC workstation, and variants will be built by well-known PC brands like Asus and Dell. Nvidia’s offering has more memory than QuietBox 2, at up to 748 GB, but system power is quoted at 1,600 W—rather close to the maximum a 15-A, 120-V breaker will handle. This reflects differing visions for how their machines will be used. And, of course, the Nvidia DGX Station’s extra memory doesn’t come cheap. While most DGX Station system builders have not yet announced pricing, one retailer has listed a DGX Station from PC maker MSI for $85,000.

When I spoke to Allyn Bourgoyne, director of product marketing at Nvidia, after the announcement of DGX Spark and Station in 2025, he said the company expects most DGX owners will use the devices as remotely accessed workstations. “A common thing you might see is that I’ve got my Windows laptop, and I’m going to use my DGX Spark over the network. I’m going to send jobs over to it.” He added that companies could deploy DGX Spark and Station systems to serve multiple people at once.

The Tenstorrent QuietBox 2 can be used in this way, but the company also wants to target a good experience for people going one-on-one with the computer. “You don’t have to remote SSH into the box. You connect your monitor through HDMI, and it’s just like your PC at home. It has the Ubuntu desktop and utilities,” says Trajkovic.

Nvidia’s DGX systems also run a variant of Ubuntu (DGX OS) and include a desktop environment, but the devil is in the details. DGX systems use Nvidia CPUs based on ARM architectures and custom chipsets. The QuietBox 2 uses an AMD x86 CPU and compatible chipset, and is configured more like a traditional PC. That should be a boon for the QuietBox 2’s software compatibility.

Tenstorrent leans into that with a focus on open source software. The QuietBox 2’s entire software stack, from TT-Forge (the company’s AI compiler) to TT-Metalium (a low-level software development kit that provides kernel-level hardware control), is open source and available on GitHub. Tenstorrent has also published the instruction set architecture for its Tensix cores, so developers can see exactly how their workloads execute on the hardware. Nvidia, by contrast, is focused on its proprietary CUDA ecosystem, and DGX OS is not open source.

“A lot of our software stack is completely open, and we felt that from a hardware perspective, we kind of wanted to take a similar path,” says Goulet.