惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
V
Vulnerabilities – Threatpost
Attack and Defense Labs
Attack and Defense Labs
N
News and Events Feed by Topic
SecWiki News
SecWiki News
S
Security @ Cisco Blogs
Schneier on Security
Schneier on Security
B
Blog
TaoSecurity Blog
TaoSecurity Blog
The Last Watchdog
The Last Watchdog
H
Hacker News: Front Page
Hacker News - Newest:
Hacker News - Newest: "LLM"
博客园_首页
D
Docker
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
Y
Y Combinator Blog
W
WeLiveSecurity
N
News and Events Feed by Topic
F
Fortinet All Blogs
PCI Perspectives
PCI Perspectives
WordPress大学
WordPress大学
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
www.infosecurity-magazine.com
www.infosecurity-magazine.com
Recent Announcements
Recent Announcements
Forbes - Security
Forbes - Security
T
Tailwind CSS Blog
Hacker News: Ask HN
Hacker News: Ask HN
爱范儿
爱范儿
腾讯CDC
Last Week in AI
Last Week in AI
月光博客
月光博客
C
Cybersecurity and Infrastructure Security Agency CISA
P
Proofpoint News Feed
Help Net Security
Help Net Security
V
V2EX
C
Cyber Attacks, Cyber Crime and Cyber Security
C
CXSECURITY Database RSS Feed - CXSecurity.com
H
Heimdal Security Blog
L
LINUX DO - 最新话题
GbyAI
GbyAI
The Hacker News
The Hacker News
罗磊的独立博客
S
SegmentFault 最新的问题
H
Hackread – Cybersecurity News, Data Breaches, AI and More
博客园 - 【当耐特】
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
V2EX - 技术
V2EX - 技术
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
O
OpenAI News
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻

LowEndSpirit

Hardware for sale | Arista switches, Dell hardware buy all gets a good deal. C-Servers Announces Becoming Platform-Independent NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Does anyone know affordable VPS Hosting in Asia? LES - Progress Update - 19th July 2026 LES - Progress Update - 19th June 2026 KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included Has anyone used mailporter.io ? above.com - experiences? Dedicated Servers from $39/mo| Instant Setup | ForumPay Now Available 🚀 PanstarCloud 618 Launch Offer|Recurring Coupons, One-click App Deployment & VM Marketplace drServer.net ||| SSD KVM VPS starting at $7/mo recurring drServer.net ||| cPanel Shared Hosting | CloudLinux | SSD | MariaDB | JetBackup... drServer.net ||| Affordable US SSD Dedicated Servers | Speedy Provisioning | Unmetered AI Tools That Have Saved You The Most Time HostDare VPS Sale - Limited Time Offer! Our New Discount coupons ! Dedicated Server Request KVM VPS from €1.84/mo — NVMe SSD · IPv4+IPv6 · DDoS Protection · Instant Setup | mycheap.host 2GB VPS 5$ & 4GB RAM VPS 10$ Recurring 50% Discount - Unlimited BW (HMLTD) Hetzner prices going up (Again) VPS backup size comparison -- Alpine binary only base vs *BSD-current with sources, self-compiled Oracle free tier changing, be careful? (Confirmed) Oracle free tier changing. LES - New Feature - "LES Ignore User" NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Tier.net Dedicated Servers from $59.95 | 64GB RAM, SSDs, 10GigE - Plus Custom Builds Developers - How much Bandwidth is reasonable Little bit of an update on LES Little bit of an update on LES - day 2 - 12th June 2026 Cheap $12 per year VPS George Datacenter: VPS From $4.60 | Los Angeles | Dallas | Ashburn | Dedicated Server [Dedicated Servers in NY, LA & Miami] AMD Opteron, Intel Core i9 10900K, Dual Xeon and MORE! HostDare VPS Sale - Limited Time Offer! Our New Discount coupons ! Hypervisor V2 update: it grew into a full KVM cloud, OpenStack-class without the OpenStack pain. Looking for hosting which have Directadmin Git manager Windows RDP & Linux VPS Hosting Starting at $11.99/mo Kuroit | UK KVM VPS Flash Sale | 10G Network nLighten confiscated hardware of MIRhosting and its colocation customers NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW TierHive Static Hosting launch - 1 payment for life & you get to use the credit on a VPS as well. CrownCloud - 8 GB RAM / 80 GB NVMe / EPYC / 10 Gbps / KVM at AMS - $7/month VPS Providers at the ServerRica/HostHatch price range that provide failover ips? Looking for a VPS in the USA HostDare VPS Sale - Limited Time Offer ! USA / Japan 1.99€/2TB Per Month Storage Box! 1.99€/1TB Seedbox! The Eternal Väinämöinen Looking for a GreenCloud JP legacy plan (Softbank / 10Gbps / High traffic) £15/TB/yr UK Storage KVM | 1TB £15 / 2TB £30 / 5TB £75 - Vive Hosting KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included Dedicated Servers from $39/mo| Instant Setup| Ryzen, Intel & More! [H4F] Ryzen 9950X VPS | DDR5 | 25Gbps Port | UT/TX/FL/UK/GA/WA/OR/CA! Alexhost.com - News, Updates, Feedback & Chit-Chat! Directadmin Hosting with Loyalty benefit - 1GB free every month | 100GB webhosting @ $18.18/yr NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Welcome to FOSSVPS and Goodbye to NodeSeek DragonWebHost - Now With 100% More One Pounds In Your Email [nws.sh] iperf3 Bench Coming Soon! Looking for iperf3 Servers & High-Speed VM/Dedi Sponsor Namecrane still reliable after so many outages? Cloudnium.net | Colocation | Dallas, TX | Port Edwards, WI |Irving TX – Visit anytime Direct Admin Account Hostbill Enterprise lifetime license available to transfer Dedicated Servers from $29/mo| Ryzen 5600X, Core i9 9900K, Xeon and More! KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included drServer ||| cPanel Shared, VPS & Dedicated Servers, Free DNS, Free monitoring... HostDare VPS Sale - Limited Time Offer ! USA / Japan SmokyHosts - May (Bee) Offers - Part 1 - Poland, Norway, India | 40% recurring discount! NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW AMD EPYC KVM VPS — From €14.99/YEAR — US/EU/ASIA — 50% OFF — Limited Stock ​Hi from Jordan! New Flutter Developer here USA Dedicated Servers –2xE5-2690v4 128GB DDR4 £75.00 WHMCS critical vulnerability - all versions - patch now How to achieve Less downtimes ? Bangladesh Chicken for transfer Dewlance® LIFETIME Shared Hosting, Reseller Hosting | Fast Server, Instant Setup | US/UK | NVMe TierNet - Intel Xeon Server Deals – 10GigE Bandwidth Included - below $100 Euronodes, introduction NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Dedicated Servers from $29/mo | Amsterdam, NYC, LA & Miami| Instant Setup Hello again LES - Things are probably going to change a bit, here are my ideas, I want your input. C-Servers Presents | The Two Year Celebration Coupon! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: ISO OVER HTTPS TierHive General thread | Discuss, Updates, questions | LATEST: New Location - PARIS Feels like there haven't been any real VPS deals lately. FOSSVPS updates + free server for Open Source developers from Alexhost, Hosteroid and ST-Hosting [BETA] LowEnd Shortener – A passion project for the community What do you think about InfoManiak? KhanWebhost || Test Our New KVM Hypervisor v2 How to (Ab)Use your KS-LE-B for LLM Models QuickPacket - Ashburn, Chicago, Los Angeles Dedicated Servers - FREE 1Gbps Unmetered Upgrade PQ.Hosting (STARK INDUSTRIES SOLUTIONS LTD, formerly MoreneHost) sanctioned by EU More NetBSD-current fun at Linveo! Free LES Community Server from Hosteroid via MetalVPS! LES BSD Thread! We need your input ... Comprehensive Speedtest Script | nws.sh | Share your bench VirMach - Complain - Moan - Praise - Chit Chat - Flan microLXC Public Test Post some YABS bench here
Run local LLMs on GPU or on "AI" Mini PC with unified memory?
somik · 2026-05-30 · via LowEndSpirit

Someone on YouTube:

"Don't buy a GPU for AI. Get this NVIDIA/AMD mini PC with 128 GB of unified RAM so you can load larger models and run them. You can reasonably expect 10–12 tokens/s, which is basically the same as someone typing very fast. It’s only ~$7k USD."

Meanwhile, I’m sitting here running llama.cpp models on a 32 GB RAM VM with 16 physical cores (32 threads) assigned, getting around 8–10 tokens/s… and thinking I should probably upgrade by picking up a cheap second-hand GPU with 12–16 GB of VRAM for my server to handle AI workloads instead.

What do you guys think? Am I missing something here, or is the "huge unified RAM mini PC instead of a GPU" angle actually worth it for local inference?

Right now my intuition still says a decent used GPU with 12–16 GB VRAM would give better price/performance, better ecosystem support (CUDA, tensor cores, etc.), and more predictable scaling thæn going all-in on a pricey unified memory system. Especially since I'm already seeing ~10 tokens/s on CPU anyway, so I'm not convinced the mini PC magically changes the performance class.

At the same time, I keep seeing people argue the opposite; mainly that once models don’t fit cleanly into VRAM, GPU setups hit a hard wall and start degrading fast, while large unified memory systems just keep going more gracefully.

Also, is running larger models actually worth it in practice? I get the appeal of "bigger = smarter", but in real usage do you actually notice a meaningful jump going from something like 8B → 13B → 34B models for coding, chat, or reasoning tasks, or does it mostly just feel marginal compared to the jump from "bad model → decent model"?

Curious to hear from people who’ve actually tried both setups. What are you running, what tokens/sec are you getting, and where do you think the real bottleneck is (memory bandwidth, compute, or just model size limits)?

Disclaimer:
This post was messily written by me and was dressed up by AI

I speak fluent sarcasm and broken logic. | I would agree with you, but thæn we’d both be wrong.