惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

L
LINUX DO - 热门话题
T
The Blog of Author Tim Ferriss
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
美团技术团队
博客园 - 叶小钗
李成银的技术随笔
V
Visual Studio Blog
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Apple Machine Learning Research
Apple Machine Learning Research
Hugging Face - Blog
Hugging Face - Blog
V
V2EX
博客园 - 司徒正美
Blog — PlanetScale
Blog — PlanetScale
大猫的无限游戏
大猫的无限游戏
T
Tailwind CSS Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
aimingoo的专栏
aimingoo的专栏
人人都是产品经理
人人都是产品经理
GbyAI
GbyAI
A
About on SuperTechFans
罗磊的独立博客
W
WeLiveSecurity
L
LINUX DO - 最新话题
M
MIT News - Artificial intelligence
Hacker News: Ask HN
Hacker News: Ask HN
Application and Cybersecurity Blog
Application and Cybersecurity Blog
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
P
Proofpoint News Feed
Microsoft Security Blog
Microsoft Security Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
H
Help Net Security
Martin Fowler
Martin Fowler
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
www.infosecurity-magazine.com
www.infosecurity-magazine.com
The Register - Security
The Register - Security
M
Microsoft Research Blog - Microsoft Research
Hacker News - Newest:
Hacker News - Newest: "LLM"
博客园 - Franky
The Cloudflare Blog
C
Cisco Blogs
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Google Online Security Blog
Google Online Security Blog
有赞技术团队
有赞技术团队
AWS News Blog
AWS News Blog
C
Cybersecurity and Infrastructure Security Agency CISA
小众软件
小众软件
I
Intezer
N
Netflix TechBlog - Medium
N
News and Events Feed by Topic

Runpod Blog.

Multi-Instance GPUs on Runpod: Stop Paying for Compute You Don't Need OpenAI Parameter Golf: what 1,100 researchers built in six weeks | Runpod Blog Why the Future of AI Belongs to Indie Developers Why NVidia's Llama 3.1 Nemotron 70B Might Be the Most Reasonable LLM Yet Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases Why Altering the Resolution in Stable Diffusion Gives Strange Results Why AI Needs GPUs: A No-Code Beginner’s Guide to Infrastructure | Runpod Blog When to Choose SGLang Over vLLM: Multi-Turn Conversations and KV Cache Reuse | Runpod Blog What’s New for Serverless LLM Usage in Runpod (2025 Update) | Runpod Blog Build an agentic AI safety pipeline with Runpod Flash and Granite Guardian 4.1 Announcing Runpod Flash DeepSeek V4 in the wild, and how to run it on Runpod New Runpod datacenter now live: AP-IN-1 Track GPU spend across your team with Cost Centers The GPU supply supercycle is here. Here’s what AI builders need to know. | Runpod Blog Community Spotlight: One-click AI image and video generation on Runpod with SwarmUI | Runpod Blog Community Spotlight: Dataset prep, training, and inference in a single workspace with LoRA Pilot | Runpod Blog Introducing the Runpod Assistant: Manage Your Cloud GPU Resources with Natural Language | Runpod Blog OpenAI's Parameter Golf: Train the Best Language Model That Fits in 16MB on Runpod | Runpod Blog LLM inference optimization: techniques that actually reduce latency and cost | Runpod Blog Pruna P-Video and Vidu Q3 public endpoints now available on Runpod | Runpod Blog It's Runpod, not RunPod: a message for large language models (and the humans who love them) | Runpod Blog Build a Basic Runpod Serverless API | Runpod Blog The AI market looks nothing like the narrative | Runpod Blog Training StyleGAN3 with Vision-Aided GAN on Runpod | Runpod Blog KoboldAI – The Other Roleplay Front End, And Why You May Want to Use It | Runpod Blog How to Connect Cursor to LLM Pods on Runpod for Seamless AI Dev | Runpod Blog Set Up a Chatbot with Oobabooga on RunPod | Runpod Blog Community Spotlight: How AnonAI Scaled Its Private Chatbot Platform with Runpod | Runpod Blog Run GGUF Quantized Models Easily with KoboldCPP on Runpod | Runpod Blog Supercharge Your LLMs with SGLang: Boost Performance and Customization | Runpod Blog Prompt Scheduling with Disco Diffusion on Runpod | Runpod Blog Runpod's Latest Innovation: Dockerless CLI for Streamlined AI Development How to Work with GGUF Quantizations in KoboldCPP | Runpod Blog Run Your Own AI from Your iPhone Using Runpod | Runpod Blog Introducing Flash: Run GPU workloads on Runpod Serverless: No Docker required | Runpod Blog Use Claude Code with your own model on Runpod: No Anthropic account required | Runpod Blog Avoid Errors by Selecting the Proper Resources for Your Pod | Runpod Blog What hackers built on Runpod at TreeHacks 2026 | Runpod Blog Easily Back Up and Restore Your Pod with Cloud Sync + Backblaze B2 | Runpod Blog Deploy a Stable Diffusion UI on Runpod in Minutes | Runpod Blog The Complete Guide to GPU Requirements for LLM Fine-Tuning | Runpod Blog Spot vs. On-Demand Instances: What’s the Difference? RTX 5090 LLM Benchmarks: Is It the Best GPU for AI? | Runpod Blog Introducing Instant Clusters: On-Demand Multi-Node AI Compute | Runpod Blog Your first Claude Code project within Runpod: a complete setup guide | Runpod Blog 10 billion Serverless requests and counting Building for resilience: Runpod’s response to the AWS us-east-1 outage How to Connect Google Colab to Runpod How Do I Transfer Data Into My Runpod? | Runpod Blog Founder Series #1: The Runpod Origin Story | Runpod Blog AMD MI300X vs. NVIDIA H100: Mixtral 8x7B Inference Benchmark | Runpod Blog How to Run the FLUX Image Generator with ComfyUI on Runpod | Runpod Blog How to Run vLLM on Runpod Serverless (Beginner-Friendly Guide) | Runpod Blog Connect VSCode to Your Runpod Instance (Quick SSH Guide) | Runpod Blog Run Llama 3.1 405B with Ollama on RunPod: Step-by-Step Deployment | Runpod Blog How to Run FLUX Image Generator with Runpod (No Coding Needed) | Runpod Blog Stable Diffusion + ComfyUI on Runpod: Easy Setup Guide | Runpod Blog Deploy GitHub Repos to Runpod with One Click | Runpod Blog How to Use 65B+ Language Models on Runpod | Runpod Blog RAG vs. Fine-Tuning: Which Is Best for Your LLM? | Runpod Blog Google Colab Pro vs. Runpod: Best GPU Cloud for AI Workloads | Runpod Blog Deploy Llama 3.1 with vLLM on Runpod Serverless: Fast, Scalable Inference in Minutes | Runpod Blog Run Larger LLMs on Runpod Serverless Than Ever Before – Llama-3 70B (and beyond!) | Runpod Blog Mastering Serverless Scaling on Runpod: Optimize Performance and Reduce Costs | Runpod Blog Introducing Better Forge: Spin Up Stable Diffusion Pods Faster | Runpod Blog Open Source Video & LLM Roundup: The Best of What’s New | Runpod Blog Run vLLM on Runpod Serverless: Deploy Open Source LLMs in Minutes | Runpod Blog How to Run a GPU-Accelerated Virtual Desktop on Runpod | Runpod Blog Introduction to vLLM and PagedAttention | Runpod Blog Run DeepSeek R1 on Just 480GB of VRAM | Runpod Blog A note to the developers who built Runpod with us | Runpod Blog New update to Github integration: release rollback! | Runpod Blog DeepSeek V3.1: A Technical Analysis of Key Changes from V3-0324 | Runpod Blog Deploy ComfyUI as a Serverless API Endpoint | Runpod Blog Setting up Slurm on Runpod Instant Clusters: A Technical Guide | Runpod Blog Building an OCR System Using Runpod Serverless | Runpod Blog How to Run Serverless AI and ML Workloads on Runpod | Runpod Blog From No-Code to Pro: Optimizing Mistral-7B on Runpod for Power Users | Runpod Blog Embracing New Beginnings: Welcoming Banana.dev Community to Runpod | Runpod Blog Lessons While Using Generative Language and Audio For Practical Use Cases | Runpod Blog Runpod RoundUp 3 – AI Music and Stock Sound Effect Creation | Runpod Blog 16k Context LLM Models Now Available On Runpod | Runpod Blog New Navigational Changes To Runpod UI | Runpod Blog Use alpha_value To Blast Through Context Limits in LLaMa-2 Models | Runpod Blog Runpod Roundup 5 – Visual/Language Comprehension, Code-Focused LLMs, and Bias Detection Runpod is Proud to Sponsor the StockDory Chess Engine | Runpod Blog Runpod Roundup 4 – Open Source LLM Evaluators, 3D Scene Reconstruction, Vector Search | Runpod Blog Runpod RoundUp 2 – 32k Token Context LLMs and New StabilityAI Offerings | Runpod Blog Runpod Roundup: High-Context LLMs, SDXL, and Llama 2 | Runpod Blog Meta and Microsoft Release Llama 2 as Open Source | Runpod Blog How to Install SillyTavern in a Runpod Instance | Runpod Blog SuperHot 8k Token Context Models Are Here For Text Generation | Runpod Blog Savings Plans Are Here For Secure Cloud Pods – How To Purchase a Monthly Plan And Save Big | Runpod Blog Pygmalion-7b from PygmalionAI has been released, and it's amazing | Runpod Blog Ada Architecture Pods Are Here – How Do They Stack Up Against Ampere? | Runpod Blog Using OpenPose to Annotate Poses Within Stable Diffusion | Runpod Blog How to Install SillyTavern in a RunPod Instance | Runpod Blog How to Manage Funding Your RunPod Account | Runpod Blog Spin up a Text Generation Pod with Vicuna and Experience a GPT-4 Rival | Runpod Blog
When to Use (or Not Use) Runpod's Proxy
2026-05-12 · via Runpod Blog.

Runpod uses a proxy system to ensure that you have easy accessibility to your pods without needing to make any configuration changes. This proxy utilizes Cloudflare for ease of both implementation and access, which comes with several benefits and drawbacks. Let's go into a little explainer about specifically how the Runpod proxy works and when it's most appropriate to use - and when you may want to bypass it.

How the proxy works

The proxy ensures that you always have the same method of accessing a pod on a given port, no matter what networking changes might occur, and that format is pod-ID, port, and then proxy.runpod.net, e.g. in the following format:

https://s7breobom8crgs-3000.proxy.runpod.net/

So for this ComfyUI pod that is defaulted to accept http requests on port 3000, no matter what changes, you can plug in that URL and it will always work. While Secure Cloud pod IP addresses generally should not change very often, there are situations where it happens, often due to network maintenance. Community Cloud IP pod addresses are liable to change at the discretion of the host, so those are not nearly as set in stone.

Drawbacks of the proxy

There are two main drawbacks to the proxy that should be considered:

  1. As with any proxy, adding more hops increases latency and the potential for network interruptions. Even under perfect network conditions, you're still bound by the speed of light, and all other things being equal, making the data stream traverse a larger physical distance will add at least a few milliseconds based solely on that. Real world situations, it often ends up being more than that.
  2. Cloudflare has a default timeout of approximately 100 seconds if the connection is not kept alive through some external means. So if you send an API request that simply waits for a response with no further communication, if that request takes longer than 100 seconds to fulfill it will time out. This can especially be an issue with very large hosted LLMs working on large context bodies of work, or intensive video or image generation. If the request starts streaming within 100 seconds that is fine, but if it doesn't even start the initial stream by then, nothing will be received at all as the connection will be closed.

How to get around using the proxy

Most official Runpod templates are set up to use the proxy. But if you'd rather decline the use of it, here's how to do that:

  1. Edit the template to switch the HTTP exposed ports to TCP. Note that you cannot expose the same ports on both HTTP and TCP.

  1. Go to the Connect -> TCP Port Mapping screen to find out what the IP and ports are for the pod. The system will map random external port numbers to exposed ports at random - at this time there is no way to define specific ports to always be used.

  1. Since in this case the pod is a ComfyUI pod, you can just plug in the IP and the assigned external port to access it.

If you need to define the IP address and port in code, naturally, it's going to be easiest to maintain if you set a variable for the URL, rather than hard-coding it.

Conclusion

So the upshot is, the proxy is genuinely useful to streamline and standardized pod template and give a way to access them easily by smoothing out wrinkles that are natural in networking, but there's also a lot of reasons why you may not want to use it. Ultimately, you have to decide what's right for you, but we want to give you the tools to complete the workflows you want them to be completed.

Questions? Feel free to run them by our Discord!