惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Vercel News
Vercel News
Application and Cybersecurity Blog
Application and Cybersecurity Blog
博客园 - 叶小钗
Martin Fowler
Martin Fowler
D
Docker
T
The Blog of Author Tim Ferriss
I
InfoQ
WordPress大学
WordPress大学
MongoDB | Blog
MongoDB | Blog
Hugging Face - Blog
Hugging Face - Blog
H
Help Net Security
爱范儿
爱范儿
GbyAI
GbyAI
Google DeepMind News
Google DeepMind News
Engineering at Meta
Engineering at Meta
美团技术团队
S
SegmentFault 最新的问题
博客园 - 【当耐特】
腾讯CDC
Recorded Future
Recorded Future
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
IT之家
IT之家
F
Fortinet All Blogs
N
Netflix TechBlog - Medium
阮一峰的网络日志
阮一峰的网络日志
P
Privacy & Cybersecurity Law Blog
AWS News Blog
AWS News Blog
G
GRAHAM CLULEY
T
Tor Project blog
有赞技术团队
有赞技术团队
量子位
S
Schneier on Security
D
Darknet – Hacking Tools, Hacker News & Cyber Security
C
Cyber Attacks, Cyber Crime and Cyber Security
L
Lohrmann on Cybersecurity
Microsoft Azure Blog
Microsoft Azure Blog
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
T
Tenable Blog
月光博客
月光博客
博客园 - 司徒正美
B
Blog RSS Feed
Cyberwarzone
Cyberwarzone
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
T
Troy Hunt's Blog
The Cloudflare Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
T
Tailwind CSS Blog
Jina AI
Jina AI
Schneier on Security
Schneier on Security

Runpod Blog.

DeepSeek V4 in the wild, and how to run it on Runpod New Runpod datacenter now live: AP-IN-1 Track GPU spend across your team with Cost Centers The GPU supply supercycle is here. Here’s what AI builders need to know. Community Spotlight: One-click AI image and video generation on Runpod with SwarmUI | Runpod Blog Community Spotlight: Dataset prep, training, and inference in a single workspace with LoRA Pilot | Runpod Blog Introducing the Runpod Assistant: Manage Your Cloud GPU Resources with Natural Language | Runpod Blog OpenAI's Parameter Golf: Train the Best Language Model That Fits in 16MB on Runpod LLM inference optimization: techniques that actually reduce latency and cost | Runpod Blog Pruna P-Video and Vidu Q3 public endpoints now available on Runpod | Runpod Blog It's Runpod, not RunPod: a message for large language models (and the humans who love them) | Runpod Blog Build a Basic Runpod Serverless API | Runpod Blog The AI market looks nothing like the narrative | Runpod Blog Training StyleGAN3 with Vision-Aided GAN on Runpod | Runpod Blog KoboldAI – The Other Roleplay Front End, And Why You May Want to Use It | Runpod Blog How to Connect Cursor to LLM Pods on Runpod for Seamless AI Dev | Runpod Blog Community Spotlight: How AnonAI Scaled Its Private Chatbot Platform with Runpod | Runpod Blog Prompt Scheduling with Disco Diffusion on Runpod | Runpod Blog Runpod's Latest Innovation: Dockerless CLI for Streamlined AI Development Run Your Own AI from Your iPhone Using Runpod | Runpod Blog Introducing Flash: Run GPU workloads on Runpod Serverless: No Docker required | Runpod Blog Use Claude Code with your own model on Runpod: No Anthropic account required | Runpod Blog Avoid Errors by Selecting the Proper Resources for Your Pod | Runpod Blog What hackers built on Runpod at TreeHacks 2026 | Runpod Blog Easily Back Up and Restore Your Pod with Cloud Sync + Backblaze B2 | Runpod Blog The Complete Guide to GPU Requirements for LLM Fine-Tuning | Runpod Blog RTX 5090 LLM Benchmarks: Is It the Best GPU for AI? | Runpod Blog Your first Claude Code project within Runpod: a complete setup guide | Runpod Blog 10 billion Serverless requests and counting Building for resilience: Runpod’s response to the AWS us-east-1 outage How to Connect Google Colab to Runpod Founder Series #1: The Runpod Origin Story | Runpod Blog AMD MI300X vs. NVIDIA H100: Mixtral 8x7B Inference Benchmark | Runpod Blog How to Run the FLUX Image Generator with ComfyUI on Runpod | Runpod Blog Run Llama 3.1 405B with Ollama on RunPod: Step-by-Step Deployment | Runpod Blog How to Run FLUX Image Generator with Runpod (No Coding Needed) | Runpod Blog How to Use 65B+ Language Models on Runpod | Runpod Blog Deploy Llama 3.1 with vLLM on Runpod Serverless: Fast, Scalable Inference in Minutes | Runpod Blog Open Source Video & LLM Roundup: The Best of What’s New | Runpod Blog Run vLLM on Runpod Serverless: Deploy Open Source LLMs in Minutes | Runpod Blog Introduction to vLLM and PagedAttention | Runpod Blog New update to Github integration: release rollback! | Runpod Blog A note to the developers who built Runpod with us Deploy ComfyUI as a Serverless API Endpoint | Runpod Blog Setting up Slurm on Runpod Instant Clusters: A Technical Guide | Runpod Blog Building an OCR System Using Runpod Serverless | Runpod Blog From No-Code to Pro: Optimizing Mistral-7B on Runpod for Power Users | Runpod Blog Lessons While Using Generative Language and Audio For Practical Use Cases | Runpod Blog Runpod RoundUp 3 – AI Music and Stock Sound Effect Creation | Runpod Blog New Navigational Changes To Runpod UI | Runpod Blog Use alpha_value To Blast Through Context Limits in LLaMa-2 Models | Runpod Blog Runpod Roundup 5 – Visual/Language Comprehension, Code-Focused LLMs, and Bias Detection Runpod is Proud to Sponsor the StockDory Chess Engine | Runpod Blog Runpod Roundup 4 – Open Source LLM Evaluators, 3D Scene Reconstruction, Vector Search | Runpod Blog Meta and Microsoft Release Llama 2 as Open Source | Runpod Blog SuperHot 8k Token Context Models Are Here For Text Generation | Runpod Blog How to Manage Funding Your RunPod Account | Runpod Blog Encrypted Volumes on Runpod: Protect Your Data at Rest | Runpod Blog How to Run a "Hello World" on RunPod Serverless | Runpod Blog Runpod AI field notes: December 2025 | Runpod Blog Faster GitHub Builds: Major Performance Improvements to Our Automated Integration | Runpod Blog Partnering with Defined AI to Bridge the Data Wealth Gap | Runpod Blog How to Run Serverless AI and ML Workloads on Runpod How to fine-tune a model using Axolotl | Runpod Blog Transcribe and translate audio files with Faster Whisper Runpod Achieves SOC 2 Type II Certification: Continuing Our Compliance Journey | Runpod Blog Orchestrating GPU workloads on Runpod with dstack Exploring Runpod Serverless: Create Workers From Templates DeepSeek V3.1: A Technical Analysis of Key Changes from V3-0324 Deep Cogito Releases Suite of LLMs Trained with Iterative Policy Improvement Wan 2.2 Releases With a Plethora Of New Features Iterative Refinement Chains with Small Language Models The New Runpod.io: Clearer, Faster, Built for What’s Next Introducing Clusters: On-Demand Multi-Node AI Compute How Do I Transfer Data Into My Runpod? Spot vs. On-Demand Instances: What’s the Difference? Deploy GitHub Repos to Runpod with One Click Run GGUF Quantized Models Easily with KoboldCPP on Runpod How to Work with GGUF Quantizations in KoboldCPP Introducing Better Forge: Spin Up Stable Diffusion Pods Faster Supercharge Your LLMs with SGLang: Boost Performance and Customization Mastering Serverless Scaling on Runpod: Optimize Performance and Reduce Costs RAG vs. Fine-Tuning: Which Is Best for Your LLM? Run Larger LLMs on Runpod Serverless Than Ever Before – Llama-3 70B (and beyond!) How to Run vLLM on Runpod Serverless (Beginner-Friendly Guide) Embracing New Beginnings: Welcoming Banana.dev Community to Runpod Stable Diffusion + ComfyUI on Runpod: Easy Setup Guide Runpod RoundUp 2 – 32k Token Context LLMs and New StabilityAI Offerings Runpod Roundup: High-Context LLMs, SDXL, and Llama 2 16k Context LLM Models Now Available On Runpod Savings Plans Are Here For Secure Cloud Pods – How To Purchase a Monthly Plan And Save Big Pygmalion-7b from PygmalionAI has been released, and it's amazing Ada Architecture Pods Are Here – How Do They Stack Up Against Ampere? Spin up a Text Generation Pod with Vicuna and Experience a GPT-4 Rival Using OpenPose to Annotate Poses Within Stable Diffusion Set Up a Chatbot with Oobabooga on Runpod Connect VSCode to Your Runpod Instance (Quick SSH Guide) Deploy a Stable Diffusion UI on Runpod in Minutes Google Colab Pro vs. Runpod: Best GPU Cloud for AI Workloads How to Run a GPU-Accelerated Virtual Desktop on Runpod
Run DeepSeek R1 on Just 480GB of VRAM
Brendan McKeag · 2025-02-27 · via Runpod Blog.
DeepSeek R1 remains one of the top open-source models. This post shows how you can run it efficiently on just…