惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

NISL@THU
NISL@THU
有赞技术团队
有赞技术团队
WordPress大学
WordPress大学
U
Unit 42
腾讯CDC
宝玉的分享
宝玉的分享
Y
Y Combinator Blog
V
Visual Studio Blog
C
Check Point Blog
N
Netflix TechBlog - Medium
云风的 BLOG
云风的 BLOG
博客园 - 聂微东
酷 壳 – CoolShell
酷 壳 – CoolShell
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
P
Privacy & Cybersecurity Law Blog
V
Vulnerabilities – Threatpost
The Hacker News
The Hacker News
人人都是产品经理
人人都是产品经理
Google DeepMind News
Google DeepMind News
Vercel News
Vercel News
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
N
News and Events Feed by Topic
aimingoo的专栏
aimingoo的专栏
S
SegmentFault 最新的问题
Engineering at Meta
Engineering at Meta
Cyberwarzone
Cyberwarzone
The Last Watchdog
The Last Watchdog
S
Secure Thoughts
Recorded Future
Recorded Future
阮一峰的网络日志
阮一峰的网络日志
博客园 - Franky
E
Exploit-DB.com RSS Feed
V
V2EX
S
Security Affairs
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
IT之家
IT之家
爱范儿
爱范儿
小众软件
小众软件
Last Week in AI
Last Week in AI
C
Cybersecurity and Infrastructure Security Agency CISA
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
O
OpenAI News
The Cloudflare Blog
Cloudbric
Cloudbric
L
Lohrmann on Cybersecurity
H
Hacker News: Front Page
C
Cisco Blogs
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
Webroot Blog
Webroot Blog
月光博客
月光博客

TechPowerUp

Microsoft Visual Studio Professional 2026 + 15 Coding Courses Just $50 No Surprise Microsoft Office Hikes—Own it for Life for $50 The Witcher 3: Wild Hunt Reaches 65 Million Copies Ahead of Songs of the Past Launch Valve Steam Deck Sells Out in 24 Hours Despite Price Hike 007 First Light Exceeds 1.5 Million Sales in 24 Hours Rockstar Workers Unionize Ahead of GTA VI Launch Following Dismissal of 31 Workers Unknown Worlds Earns $250M Performance Bonus After Stellar Subnautica 2 Launch Dell Technologies Delivers First Quarter Fiscal 2027 Financial Results Marathon Season 2 To Start With Free Week and Plenty of New Content ASRock iBox Fanless Mini PCs Get Intel Panther Lake Upgrade Acer Broadens Portfolio with Two New Laptops Powered by the Latest Snapdragon Processors Qualcomm Introduces Snapdragon C Entry‑Level Processors for Budget Laptops OneXPlayer 3 Gaming Handheld Emerges With Intel Arc G3 Extreme Intel Arc G3 CPU Family Officially Released for Handheld Gaming PCs Samsung Exynos 2600 SoC Annotated, Showcases 3-tier CPU, AMD RDNA 4 iGPU Acer Expands Gaming Portfolio with Predator Atlas 8 Handheld Powered by Intel Silicon Motion Introduces SM2524XT PCIe Gen 5 DRAMless SSD Controller PXN Launches the Vector X Professional-Grade Sim Racing Pedals TP-Link Introduces Archer 8, Its First Wi-Fi 8 Router Platform Synology Announces Availability of New FlashStation FS200T Philips Announces Evnia 32M2N8900P QD-OLED 4K 240 Hz Gaming Monitor Samsung Display Develops First 4K 360 Hz QD-OLED Panel for Monitors LG Display Begins Mass Production of World's First 240 Hz RGB Stripe OLED ZALMAN Intros ZM-STC11 Silicone-based Thermal Paste Stream Deck Becomes the Action Layer for AI, Starting with NVIDIA G-Assist GIGABYTE Debuts New BRIX Mini PC Powered by Panther Lake to Scale Enterprise AI Sharkoon Announces the S25 Series Cases Scythe Intros the Magoroku Dual Fin-stack Air CPU Cooler First Look at ZOTAC's GeForce RTX 50-series 20th Anniversary Edition Graphics Cards ADATA TRUSTA AI Scaler Extended Memory Solution Breaks GPU Limits Fosi Audio Introduces C3 Gaming Sound Card with StepSense Footstep Radar Xbox "Player Voice" Forum Sparks Outcry for Exclusives and Free Online Multiplayer Lofree Teases Flow 2 Keyboards With Apple MacBook Neo Color Schemes Kubb Fanless Mini PC Gets Intel Panther Lake Update with High Price Leaker Hints at Astronomical Steam Machine Pricing 2D Platformer "Mina the Hollower" Early Reviews Top 2026 Review Ratings HP Inc. Reports Fiscal 2026 Second Quarter Results Gigabyte NVIDIA RTX 5080 Aorus "Infinity Wood" GPU Shows up in Leak Broadcom Delivers Industry's First Integrated Wi-Fi 8 SoCs to Power Next-Gen Mesh and Multi-Gigabit Routers AMD Announces New Versal Prime Series Gen 2 Devices For Only $10, Windows 11 Pro Adds Better Productivity Features and Security Steam Deck OLED Prices Rise by Up to $300 Amid Component Shortages Microsoft is Rolling Out Windows 11 Performance Boosting Update No Man's Sky: The Swarm Update Adds Universe-Level Threat Gigabyte Announces AORUS MASTER 16 Now Available Couch Co-op Hand Drawn Puzzle Adventure Lost in Tandem Announces July Release Date 007 First Light Launches Globally Today SAMA Unveils Its Next Wave of PC Hardware at Computex 2026 Corsair Announces the Novablade Pro Wireless Invincible VS Edition in Collaboration with Skybound Latest NVIDIA 610.47 WHQL Packs DLSS 5 Neural Rendering Profile Settings LG Innotek Showcases Next‑Generation Semiconductor Substrate Technologies at ECTC COLORFUL Intros Limited Edition iGame RTX 5070 Ultra OC x 007 First Light Edition The Witcher 3 Free Next-Gen Upgrade Raises Minimum System Requirements Ahead of DLC Release 007 First Light Ships with Baked-in FSR 3.1, Lacks FSR 4 Support, DLL Mods Don't Work Intel Arc GPU Graphics Drivers 101.8824 Beta Released VIA Labs Announces VL610/VL610D MST Hub Controllers for Multi-Display USB-C Docking QNAP Introduces the QSW 2000 Series 2.5GbE/10GbE L2 Web Managed Switches Meet MSI's PRO MAX Lineup at COMPUTEX 2026: Desktops & Monitors for Aesthetic, Minimalist Workspaces MSI Announces the PRO MAX Series of Displays, Designed for Mac Users Team Group to Showcase Quad-Rank CUDIMM ECC CUDIMM, and More at Computex 2026 Proton-CachyOS Adds NVIDIA Reflex and AMD Anti-Lag 2 Enablement for Non-Native Games Alan Wake and Control Underperformed, Says New Remedy CEO—Studio Will Apply Learnings from FBC: Firebreak to Resonant Rumors Suggest Persona 6 Development Concluded, Internally Delayed To 2027 8BitDo Ultimate 3E Game Controller Now Available for Pre-Order, Deliveries in August Work Louder Framer F1 Mechanical Keyboard Has a Built-In Website Stats Display New Intel USB4Stream Driver and Protocol Enables Low-Latency Device-to-Device File Sharing in Linux Kernel 7.2 Yunzii Releases X98 Solid Milky White QMK/VIA Mechanical Keyboard Enter the Chronosphere Enters Early Access: New Roguelike Blends Turn-Based and Bullet Hell Gameplay With Hand-Drawn Art Style Trust Introduces Zevo Ultra-Fast Rechargeable Multi-Wireless Mouse Silicon Power Debuts ROG Certified XPOWER Cyclone R DDR5 Gaming Memory World of Tanks: HEAT is Live Today on PC and Consoles NVIDIA "Vera" CPU Benchmarked: Beating Intel Xeon and AMD EPYC in Select Workloads Corsair Reveals SHUGO DDR5, a Collectible Memory Series NVIDIA GeForce Graphics Drivers 610.47 WHQL Drops Control Panel Support HyperX Introduces First-Ever Valorant Gaming Laptop Kensington Launches Entry-Level Thunderbolt 5 Docking Station with 80 Gbps Speeds and Triple 4K Support Arctic Announces Freezer 36-S Tower-type Air CPU Cooler Logitech Introduces the Signature Comfort Plus Lineup Computex Best Choice Awards 2026 Reveals Asus ROG Rapture GT-BN98 Pro Wi-Fi 8 Router Formula V Line to Preview Air Power G10 Case with Tilting Front Intake Fans Samsung Stacks Two 450-Layer NAND Chips Into a 900-Layer V-NAND MSI Unveils "AI Jinni": Your Next-Gen AI Hub Tailored for MSI PCs Sennheiser Introduces the Momentum 5 Wireless ANC Headphones Inno3D Announces NVIDIA MGX 4U GPU Server ATK X1 Air Gaming Mouse Features Virtual Sensor Location for Improved Control Next Kingdom Come Game To Launch Before Q2 2028 "If All Goes Well" SK hynix Unveils iHBM Thermal Solution to Boost AI Performance California and Colorado Age Verification Laws Get Open-Source OS Exemptions—SteamOS Enforcement Still Likely Valve Steam Machine Shows up in Vulkan Compliance Database, Launch Date Remains Elusive Finalmouse Reveals 38 g Starlight X Wireless Gaming Mouse: TMR Switches, Exclusive Sensor, and Nordic MCU for $179 Bungie Kept Most Destiny 2 Devs in the Dark About Sunsetting While Moving Resources to Marathon AMD "Zen 7" IP to Use TSMC A14 Node and More Advanced Packaging Combined Revenue of Top Five Global NAND Flash Suppliers Rose by 83.7% QoQ for 1Q26 as Supply Shortages Drove Price Hikes Microsoft Copilot Returns as a Sidebar in Windows 11 AMD's China-Exclusive Radeon RX 9070 GRE May Launch Globally EINAREX Introduces the HALOX Series AIO Liquid CPU Coolers This Week in Gaming (Week 22) Helldivers 2 Gets FSR 4, DLSS 4.5, and XeSS 3.0 Upscaling and Performance Optimizations in Upcoming Update Forza Horizon 6 Nears 5 Million Sales, 42% on Xbox Subnautica 2 Continues Impressive Sales, Crosses 4 Million Units Despite "No Violence" and EULA
Lexar Wants to Offload Local AI Models to SSD Amid the RAMpocalypse
by AleksandarK · 2026-06-16 · via TechPowerUp

Lexar has been experimenting with various technologies to help consumers achieve faster data throughput and more reliable storage. However, the company is now envisioning something entirely different as the PC evolves from a regular personal computer to a local AI-enhanced experience. We had the opportunity to interview Lexar's Chief Technical Officer (CTO), Daniel Guo, about the technology Lexar is developing to help offload some of the DRAM demand to much cheaper NAND Flash. According to Guo, DRAM is about six times more expensive to manufacture than NAND Flash, and there are opportunities for AI SSDs to reduce the DRAM requirements for running AI models on local hardware. This is where the Lexar AI Storage Core SSD comes into play, as the company is creating new storage solutions for consumers to support local AI deployments using much less DRAM by offloading large language models (LLMs) to SSDs. This approach allows larger and more powerful LLMs to fit into a PC build, reducing memory footprint by at least 40%

Based on internal testing, Lexar managed to run the Qwen 3.5 122B AI model on a local PC. Traditionally, users would need to spend about $4,500 on a PC with a decent CPU and 128 GB of DRAM to run this model. Through hardware and software optimization, the Lexar AI suite with the Lexar AI Storage Core SSD can reduce the DRAM requirement to 32 GB and run the model with 35 billion parameters at 15.6 tokens per second, compared to only 5.2 tokens per second using traditional frameworks. When attempting to load the 122B model on 32 GB of DRAM, the traditional Llama.cpp fails to load and crashes, while Lexar's SSD offloading provides about 4.4 tokens per second.

When the system is equipped with a more robust configuration featuring 64 GB of DRAM, running the 122B model with a larger context window is only possible with SSD offloading. With about 4,000 tokens in context, both traditional configurations and the Lexar AI stack run at a slightly higher speed. However, for larger contexts, often needed at 256K tokens, only the Lexar AI suite can launch and manage to produce about 19.3 tokens per second. Of course, this doesn't mean the setup is perfect, and not every model size can be offloaded to the SSD. With larger LLMs, system latency increases significantly, as the time between submitting a prompt and receiving a response grows exponentially.

The time to first token, often called TTFM, has been measured at about two seconds before the first token appears after the prompt is submitted with a 2K context window. When the context is larger at 4K, the delay increases to anywhere between 6 and 8 seconds. Technically, users could offload models that are about 400 billion parameters large, but the tokens per second and TTFM would be very slow. For some, this might be suitable, but for others, buying more DRAM is the better solution. Either way, this is an intriguing concept from Lexar.

Example from Computex 2026.
The company developed a concept for Mini-PCs and desktops featuring an M.2 slot designed for multiple insertions. An M.2 SSD is encased in a metal jacket (not a full enclosure) and is inserted into a 25 mm-wide slot on the front panel of a mini PC, connecting directly to the M.2 slot wired to the processor or chipset. This design eliminates other overheads. The hot-swappable SSD, which offloads AI models onto NAND Flash, reduces dependency on DRAM and aids in running larger models. It is available in both PCIe Gen 5 and Gen 4 versions, with the Gen 5 version offering more bandwidth. This M.2 SSD uses Lexar's custom Storage Processing Unit (SPU) DRAM-less controller for complete control over data movement.