惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

小众软件
小众软件
N
News and Events Feed by Topic
A
About on SuperTechFans
aimingoo的专栏
aimingoo的专栏
The Cloudflare Blog
H
Heimdal Security Blog
Schneier on Security
Schneier on Security
Engineering at Meta
Engineering at Meta
Google Online Security Blog
Google Online Security Blog
宝玉的分享
宝玉的分享
AI
AI
The GitHub Blog
The GitHub Blog
MongoDB | Blog
MongoDB | Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
The Last Watchdog
The Last Watchdog
T
Troy Hunt's Blog
S
Security @ Cisco Blogs
H
Hacker News: Front Page
F
Fortinet All Blogs
博客园_首页
S
Secure Thoughts
N
News and Events Feed by Topic
P
Proofpoint News Feed
Microsoft Azure Blog
Microsoft Azure Blog
I
InfoQ
Spread Privacy
Spread Privacy
Hacker News - Newest:
Hacker News - Newest: "LLM"
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
C
Check Point Blog
Hugging Face - Blog
Hugging Face - Blog
Hacker News: Ask HN
Hacker News: Ask HN
C
CXSECURITY Database RSS Feed - CXSecurity.com
酷 壳 – CoolShell
酷 壳 – CoolShell
Stack Overflow Blog
Stack Overflow Blog
L
LINUX DO - 最新话题
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
S
Schneier on Security
Know Your Adversary
Know Your Adversary
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Scott Helme
Scott Helme
P
Privacy & Cybersecurity Law Blog
S
Securelist
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
O
OpenAI News
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
PCI Perspectives
PCI Perspectives
L
LangChain Blog
雷峰网
雷峰网
Security Archives - TechRepublic
Security Archives - TechRepublic
V2EX - 技术
V2EX - 技术

LowEndSpirit

Hardware for sale | Arista switches, Dell hardware buy all gets a good deal. C-Servers Announces Becoming Platform-Independent NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Does anyone know affordable VPS Hosting in Asia? LES - Progress Update - 19th July 2026 LES - Progress Update - 19th June 2026 KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included Has anyone used mailporter.io ? above.com - experiences? Dedicated Servers from $39/mo| Instant Setup | ForumPay Now Available 🚀 PanstarCloud 618 Launch Offer|Recurring Coupons, One-click App Deployment & VM Marketplace drServer.net ||| SSD KVM VPS starting at $7/mo recurring drServer.net ||| cPanel Shared Hosting | CloudLinux | SSD | MariaDB | JetBackup... drServer.net ||| Affordable US SSD Dedicated Servers | Speedy Provisioning | Unmetered AI Tools That Have Saved You The Most Time HostDare VPS Sale - Limited Time Offer! Our New Discount coupons ! Dedicated Server Request KVM VPS from €1.84/mo — NVMe SSD · IPv4+IPv6 · DDoS Protection · Instant Setup | mycheap.host 2GB VPS 5$ & 4GB RAM VPS 10$ Recurring 50% Discount - Unlimited BW (HMLTD) Hetzner prices going up (Again) VPS backup size comparison -- Alpine binary only base vs *BSD-current with sources, self-compiled Oracle free tier changing, be careful? (Confirmed) Oracle free tier changing. LES - New Feature - "LES Ignore User" NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Tier.net Dedicated Servers from $59.95 | 64GB RAM, SSDs, 10GigE - Plus Custom Builds Developers - How much Bandwidth is reasonable Little bit of an update on LES Little bit of an update on LES - day 2 - 12th June 2026 Cheap $12 per year VPS George Datacenter: VPS From $4.60 | Los Angeles | Dallas | Ashburn | Dedicated Server [Dedicated Servers in NY, LA & Miami] AMD Opteron, Intel Core i9 10900K, Dual Xeon and MORE! HostDare VPS Sale - Limited Time Offer! Our New Discount coupons ! Hypervisor V2 update: it grew into a full KVM cloud, OpenStack-class without the OpenStack pain. Looking for hosting which have Directadmin Git manager Windows RDP & Linux VPS Hosting Starting at $11.99/mo Kuroit | UK KVM VPS Flash Sale | 10G Network nLighten confiscated hardware of MIRhosting and its colocation customers NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW TierHive Static Hosting launch - 1 payment for life & you get to use the credit on a VPS as well. CrownCloud - 8 GB RAM / 80 GB NVMe / EPYC / 10 Gbps / KVM at AMS - $7/month VPS Providers at the ServerRica/HostHatch price range that provide failover ips? Looking for a VPS in the USA HostDare VPS Sale - Limited Time Offer ! USA / Japan 1.99€/2TB Per Month Storage Box! 1.99€/1TB Seedbox! The Eternal Väinämöinen Looking for a GreenCloud JP legacy plan (Softbank / 10Gbps / High traffic) £15/TB/yr UK Storage KVM | 1TB £15 / 2TB £30 / 5TB £75 - Vive Hosting KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included Dedicated Servers from $39/mo| Instant Setup| Ryzen, Intel & More! [H4F] Ryzen 9950X VPS | DDR5 | 25Gbps Port | UT/TX/FL/UK/GA/WA/OR/CA! Alexhost.com - News, Updates, Feedback & Chit-Chat! Directadmin Hosting with Loyalty benefit - 1GB free every month | 100GB webhosting @ $18.18/yr NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Welcome to FOSSVPS and Goodbye to NodeSeek DragonWebHost - Now With 100% More One Pounds In Your Email [nws.sh] iperf3 Bench Coming Soon! Looking for iperf3 Servers & High-Speed VM/Dedi Sponsor Namecrane still reliable after so many outages? Cloudnium.net | Colocation | Dallas, TX | Port Edwards, WI |Irving TX – Visit anytime Direct Admin Account Hostbill Enterprise lifetime license available to transfer Dedicated Servers from $29/mo| Ryzen 5600X, Core i9 9900K, Xeon and More! KVM VPS in LA, New Jersey & Frankfurt | 10 Gbps | Free Upgrade Included drServer ||| cPanel Shared, VPS & Dedicated Servers, Free DNS, Free monitoring... HostDare VPS Sale - Limited Time Offer ! USA / Japan SmokyHosts - May (Bee) Offers - Part 1 - Poland, Norway, India | 40% recurring discount! NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW AMD EPYC KVM VPS — From €14.99/YEAR — US/EU/ASIA — 50% OFF — Limited Stock ​Hi from Jordan! New Flutter Developer here USA Dedicated Servers –2xE5-2690v4 128GB DDR4 £75.00 WHMCS critical vulnerability - all versions - patch now How to achieve Less downtimes ? Bangladesh Chicken for transfer Dewlance® LIFETIME Shared Hosting, Reseller Hosting | Fast Server, Instant Setup | US/UK | NVMe TierNet - Intel Xeon Server Deals – 10GigE Bandwidth Included - below $100 Euronodes, introduction NYC Ryzen 9000 2GB RAM @ $3.50/mo | 1TB Storage VPS @ $3.50/mo | 4GB Ryzen @ $6.00/mo | Unmetered BW Dedicated Servers from $29/mo | Amsterdam, NYC, LA & Miami| Instant Setup Hello again LES - Things are probably going to change a bit, here are my ideas, I want your input. C-Servers Presents | The Two Year Celebration Coupon! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: Static Hosting and PRICE UPDATE!!!! TierHive General thread | Discuss, Updates, questions | LATEST: ISO OVER HTTPS TierHive General thread | Discuss, Updates, questions | LATEST: New Location - PARIS Feels like there haven't been any real VPS deals lately. FOSSVPS updates + free server for Open Source developers from Alexhost, Hosteroid and ST-Hosting [BETA] LowEnd Shortener – A passion project for the community What do you think about InfoManiak? KhanWebhost || Test Our New KVM Hypervisor v2 How to (Ab)Use your KS-LE-B for LLM Models QuickPacket - Ashburn, Chicago, Los Angeles Dedicated Servers - FREE 1Gbps Unmetered Upgrade PQ.Hosting (STARK INDUSTRIES SOLUTIONS LTD, formerly MoreneHost) sanctioned by EU More NetBSD-current fun at Linveo! Free LES Community Server from Hosteroid via MetalVPS! LES BSD Thread! We need your input ... Comprehensive Speedtest Script | nws.sh | Share your bench VirMach - Complain - Moan - Praise - Chit Chat - Flan microLXC Public Test Post some YABS bench here
Run local LLMs on GPU or on "AI" Mini PC with unified memory?
somik · 2026-05-30 · via LowEndSpirit

Someone on YouTube:

"Don't buy a GPU for AI. Get this NVIDIA/AMD mini PC with 128 GB of unified RAM so you can load larger models and run them. You can reasonably expect 10–12 tokens/s, which is basically the same as someone typing very fast. It’s only ~$7k USD."

Meanwhile, I’m sitting here running llama.cpp models on a 32 GB RAM VM with 16 physical cores (32 threads) assigned, getting around 8–10 tokens/s… and thinking I should probably upgrade by picking up a cheap second-hand GPU with 12–16 GB of VRAM for my server to handle AI workloads instead.

What do you guys think? Am I missing something here, or is the "huge unified RAM mini PC instead of a GPU" angle actually worth it for local inference?

Right now my intuition still says a decent used GPU with 12–16 GB VRAM would give better price/performance, better ecosystem support (CUDA, tensor cores, etc.), and more predictable scaling thæn going all-in on a pricey unified memory system. Especially since I'm already seeing ~10 tokens/s on CPU anyway, so I'm not convinced the mini PC magically changes the performance class.

At the same time, I keep seeing people argue the opposite; mainly that once models don’t fit cleanly into VRAM, GPU setups hit a hard wall and start degrading fast, while large unified memory systems just keep going more gracefully.

Also, is running larger models actually worth it in practice? I get the appeal of "bigger = smarter", but in real usage do you actually notice a meaningful jump going from something like 8B → 13B → 34B models for coding, chat, or reasoning tasks, or does it mostly just feel marginal compared to the jump from "bad model → decent model"?

Curious to hear from people who’ve actually tried both setups. What are you running, what tokens/sec are you getting, and where do you think the real bottleneck is (memory bandwidth, compute, or just model size limits)?

Disclaimer:
This post was messily written by me and was dressed up by AI

I speak fluent sarcasm and broken logic. | I would agree with you, but thæn we’d both be wrong.