惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

V
Vulnerabilities – Threatpost
U
Unit 42
F
Fortinet All Blogs
aimingoo的专栏
aimingoo的专栏
P
Proofpoint News Feed
F
Full Disclosure
月光博客
月光博客
Engineering at Meta
Engineering at Meta
博客园_首页
The Register - Security
The Register - Security
G
Google Developers Blog
The Cloudflare Blog
博客园 - Franky
K
Kaspersky official blog
A
Arctic Wolf
Scott Helme
Scott Helme
C
Cisco Blogs
Hugging Face - Blog
Hugging Face - Blog
C
Check Point Blog
NISL@THU
NISL@THU
AI
AI
D
DataBreaches.Net
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Stack Overflow Blog
Stack Overflow Blog
Project Zero
Project Zero
The GitHub Blog
The GitHub Blog
H
Hackread – Cybersecurity News, Data Breaches, AI and More
量子位
Vercel News
Vercel News
T
Tor Project blog
P
Privacy International News Feed
D
Docker
I
Intezer
L
LangChain Blog
P
Proofpoint News Feed
Security Latest
Security Latest
C
CXSECURITY Database RSS Feed - CXSecurity.com
T
Threatpost
博客园 - 聂微东
AWS News Blog
AWS News Blog
Martin Fowler
Martin Fowler
P
Privacy & Cybersecurity Law Blog
V
V2EX
Last Week in AI
Last Week in AI
C
Cybersecurity and Infrastructure Security Agency CISA
The Hacker News
The Hacker News
T
Tenable Blog
Blog — PlanetScale
Blog — PlanetScale
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Tailwind CSS Blog

The Register - Special Features: AWS Re:invent

DJ Garman drops the ball instead of the bass in AWS re:Invent keynote Amazon keeps the pressure on Intel, AMD with 192-core Graviton5 CPU Amazon is forging a walled garden for enterprise AI AWS admits AI coding tools cause problems, reckons its three new agents fix 'em AWS joins Microsoft, Google in the security AI agent race Amazon primed to fuse Nvidia's NVLink into 4th-gen Trainium accelerators AWS: How do you do, fellow kids? Please watch our keynotes in Fortnite AWS, Google roll out multi-cloud fix they said wasn't needed AWS under pressure as big three battle to eat the cloud market Countries use cyber targeting to plan strikes: Amazon CSO EU eyes AWS, Azure for gatekeeper tag in cloud clampdown Geopolitics push European CIOs to think local on cloud Atlassian moves Jira, Confluence instances to AWS Graviton
AWS offers AI-in-a-box for enterprise datacenters
2025-12-03 · via The Register - Special Features: AWS Re:invent

re:invent Many businesses and government agencies require that all sensitive data stay on-premises for legal or security reasons. If those orgs want to work with AI, they can't rely on regular public clouds, but now they can let AWS build and manage AI hardware and software in their datacenters.

Announced Tuesday at the company's re:Invent conference, AWS AI Factories is a fully managed solution where enterprises provide the datacenter and power while the house of Bezos installs its hardware and software under their roof. They operate like a private AWS Region, using customers' existing datacenter space, power, and network links while AWS brings in and manages its own racks of infrastructure, including compute, storage, database, and AI services, all running locally. Customers will not need to worry about acquiring hardware, installing it, or buying or building software platforms for their AI models. All they need is the physical space in a datacenter and enough power capacity to juice all those GPUs. That will save orgs a lot of time and expertise.

Customers can use AWS tools like the Amazon Bedrock foundation model builder or SageMaker machine-learning platform, as well as some high-end hardware, such as the company's Trainium3 AI accelerators and Nvidia GPUs like the current-gen B200 and GB200, or next-gen GB300 and B300. It will use a petabit-scale, non-blocking network to connect the GPUs and offer Amazon FSx for Lustre and Amazon S3 Express One Zone storage technology. AWS does not yet support NVLink Fusion, which is a high-speed, chip-to-chip interconnect, but says support will arrive in its future Trainium4 chips.

Amazon didn't disclose how much this will cost, so we don't yet know whether it will be more expensive than installing one's own hardware and software from scratch.

"The AI factories operate exclusively for each customer and it helps them with that separation, maintaining the security and reliability you get from AWS while also meeting stringent compliance and sovereignty requirements," AWS CEO Matt Garman said in his keynote.

Garman said the work was inspired by the service's efforts to spin up private, secure AI capabilities for the Kingdom of Saudi Arabia’s AI endeavors, working with the ironically named AI platform company Humain. The company is helping build an "AI Zone" in the country, which will have up to 150,000 AI chips and dedicated AWS infrastructure.

According to Garman, the AWS partnership provides Saudi Arabia with high performing infrastructure, models, and AI services like SageMaker and Bedrock, while meeting the kingdom’s security, privacy, and responsible AI standards.

“This type of work has sparked some interest in others: large government organizations in the public sector who are interested in a similar concept,” Garman said. “We sat back and asked ourselves. Could we deliver this type of AI zone to a broader set of customers. Maybe even something that could leverage customers' existing datacenters?”

In building its own AI factories, AWS faces some stiff competition. Dell’s AI Factory with Nvidia was introduced in early 2024, and its promises of an edge to datacenter solution have captured billions in sales. In May, the company boasted 3,000 customers for its AI Factories, and last week, Dell said it had shipped $15.6 billion in AI servers year to date [PDF].

HPE’s own private AI cloud product, which is also backed by Nvidia, won adoption by more than 300 new logos during the quarter ended July 31, it announced during its last earnings call [PDF]. Lenovo has also announced [PDF] a massive upswing in the sale of its infrastructure solutions, up 24 percent year over year in the quarter ended September 30, including "high double digit growth" of AI servers.

Amazon's timing may be a bit of a mismatch, as polling from analysts at Forrester and Gartner indicates that the purse strings are tightening on AI spending as customers want a solid track to a return on their investment with these systems, which can cost millions of dollars to stand up, not to mention the price to power, maintain, and operate them.

Naveen Chhabra, principal analyst at Forrester, speculated that AWS has likely been building this for several quarters, and suggested AWS is playing into the challenges that customers face when deploying AI infrastructure, including cooling, long lead times for products, the piecemeal approach to architecture, and data sovereignty.

According to a November 21 report that Chhabra co-wrote, the revenue achieved as a result of AI spending is lagging and customers are taking notice.

“With free cash flow tightening, interest rates still high, and even OpenAI’s CEO warning of a dot-com-style bubble, the sector faces a reckoning,” Forrester stated. The analyst group also posited that the looming chip shortage is causing limited and uncertain access to memory across devices, while pushing vendors to rethink their architecture.

Chhabra sees many customers moving AI workloads to the cloud to combat these challenges, but that won't work if you are required to keep all of your data on-prem, which is the target market for AWS AI Factories. ®