惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Schneier on Security
Hugging Face - Blog
Hugging Face - Blog
V
Visual Studio Blog
博客园 - Franky
酷 壳 – CoolShell
酷 壳 – CoolShell
Last Week in AI
Last Week in AI
博客园 - 叶小钗
博客园_首页
阮一峰的网络日志
阮一峰的网络日志
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Application and Cybersecurity Blog
Application and Cybersecurity Blog
TaoSecurity Blog
TaoSecurity Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
J
Java Code Geeks
爱范儿
爱范儿
宝玉的分享
宝玉的分享
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
量子位
N
News and Events Feed by Topic
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Recent Commits to openclaw:main
Recent Commits to openclaw:main
SecWiki News
SecWiki News
MyScale Blog
MyScale Blog
AI
AI
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
博客园 - 【当耐特】
Security Archives - TechRepublic
Security Archives - TechRepublic
F
Fortinet All Blogs
V2EX - 技术
V2EX - 技术
T
Troy Hunt's Blog
有赞技术团队
有赞技术团队
W
WeLiveSecurity
Project Zero
Project Zero
T
Tor Project blog
Help Net Security
Help Net Security
L
LINUX DO - 最新话题
IT之家
IT之家
The Hacker News
The Hacker News
腾讯CDC
Schneier on Security
Schneier on Security
N
News and Events Feed by Topic
C
Cisco Blogs
博客园 - 聂微东
Webroot Blog
Webroot Blog
Forbes - Security
Forbes - Security
M
MIT News - Artificial intelligence
C
Cyber Attacks, Cyber Crime and Cyber Security
雷峰网
雷峰网
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
A
About on SuperTechFans

Replicate's changelog

Agent skills for Replicate – Replicate changelog Fallback model for Nano Banana Pro – Replicate changelog MCP server auto-discovery – Replicate changelog Filter predictions by source – Replicate changelog The little things, week ending December 19, 2025 – Replicate changelog The little things, week ending December 5, 2025 – Replicate changelog The little things, week ending November 21, 2025 – Replicate changelog Code mode for Replicate's MCP server – Replicate changelog The little things, week ending November 7, 2025 – Replicate changelog Deployment setup monitoring – Replicate changelog The little things, week ending October 24, 2025 – Replicate changelog The little things, week ending October 10, 2025 – Replicate changelog Download invoices from billing settings – Replicate changelog Sort models by creation date via API – Replicate changelog Update model metadata via API – Replicate changelog The little things, week ending September 26, 2025 – Replicate changelog New search API, now in beta – Replicate changelog The little things, week ending September 12, 2025 – Replicate changelog Torch compile caching – Replicate changelog The little things, week ending August 29, 2025 – Replicate changelog The little things, week ending August 14, 2025 – Replicate changelog Run all models with the same API endpoint – Replicate changelog The little things, week ending August 1, 2025 – Replicate changelog Purchase prepaid credit – Replicate changelog Introducing a new Cog runtime – Replicate changelog The little things, week ending July 18, 2025 – Replicate changelog The little things, week ending July 4, 2025 – Replicate changelog Set a monthly spend limit deprecated – Replicate changelog See up to 24 hours of data on deployment metric graphs – Replicate changelog The little things, week ending June 20, 2025 – Replicate changelog Environment variables in running containers – Replicate changelog Slimmer API responses for model metadata – Replicate changelog The little things, week ending June 6, 2025 – Replicate changelog Faster, cheaper Flux training – Replicate changelog The little things, week ending May 23, 2025 – Replicate changelog Streaming text support in the playground – Replicate changelog Web URLs now in prediction objects – Replicate changelog Iterate in playground with audio support – Replicate changelog NVIDIA H100 GPUs now available – Replicate changelog More by this user – Replicate changelog The little things, week ending May 9, 2025 – Replicate changelog Copy docs directly to your LLM – Replicate changelog The little things, week ending April 25, 2025 – Replicate changelog The little things, week ending March 28, 2025 – Replicate changelog Simplified organizations – Replicate changelog The little things, week ending March 14, 2025 – Replicate changelog The little things, week ending February 28, 2025 – Replicate changelog Better prediction list – Replicate changelog The little things, week ending February 14, 2025 – Replicate changelog The little things, week ending January 31, 2025 – Replicate changelog Official models – Replicate changelog The little things, week ending January 17, 2025 – Replicate changelog The little things, week ending December 20, 2024 – Replicate changelog Language model training no longer supported – Replicate changelog GPU memory monitoring – Replicate changelog The little things, week ending December 06, 2024 – Replicate changelog The little things, week ending November 22, 2024 – Replicate changelog The little things, week ending November 8, 2024 – Replicate changelog A fond farewell to Python 3.7 – Replicate changelog The little things, week ending October 25, 2024 – Replicate changelog Dark mode – Replicate changelog Playground (beta) – Replicate changelog New documentation – Replicate changelog Synchronous API – Replicate changelog Large log outputs might be truncated – Replicate changelog Time limit for sharing predictions – Replicate changelog View training outputs on the web – Replicate changelog API for searching public models – Replicate changelog Streams always available, stream parameter deprecated – Replicate changelog Secret inputs for models – Replicate changelog Disable API tokens – Replicate changelog RSS and Atom feeds – Replicate changelog Delete stuff – Replicate changelog Docs and client library support for webhook verification – Replicate changelog Search for deployments – Replicate changelog Webhooks activity UI – Replicate changelog Improved validation for API prediction payloads – Replicate changelog T4 models now have more RAM – Replicate changelog GitHub secret scanning – Replicate changelog Bearer tokens – Replicate changelog Deployments API – Replicate changelog 3D viewer for GLB outputs – Replicate changelog Task-oriented collections – Replicate changelog Webhook verification – Replicate changelog Fine-tune SDXL from the web – Replicate changelog Compare image inputs to outputs – Replicate changelog Code snippets for every prediction – Replicate changelog API for creating models – Replicate changelog Improved training detail pages – Replicate changelog Prediction parameters as JSON – Replicate changelog API for listing public models – Replicate changelog Deployments – Replicate changelog Prediction query parameter – Replicate changelog Fullscreen training logs – Replicate changelog Dynamic status favicons – Replicate changelog Streaming output for language models – Replicate changelog Multiple API tokens for users – Replicate changelog A40 GPUs now available – Replicate changelog Hardware and pricing for trainable models – Replicate changelog Training API for language models – Replicate changelog
Set deadlines for predictions – Replicate changelog
2025-10-16 · via Replicate's changelog

You can now set a deadline to automatically cancel a prediction if it doesn’t complete within a specified duration. This is useful when you’re building real-time or interactive experiences, like a virtual try-on experience for an online clothing store. In this case, shoppers have usually moved on if an image takes more than 15 seconds to generate.

How it works

Set a deadline by including a Cancel-After header when creating a prediction. See our docs for details on the header format.

Here’s an example that sets a 1 minute deadline:

What happens when a deadline is reached

Replicate sets the prediction’s status to aborted if the deadline is reached before it starts running, and canceled if the deadline is reached while it’s running.

For public models, you’re only charged for predictions with a canceled status, not for aborted ones.

Deadline vs sync mode wait duration

Prediction deadlines and sync mode serve different purposes. Use prediction deadline (Cancel-After header) to control when the prediction itself should be canceled. Use sync mode (Prefer: wait header) to control how long the HTTP request stays open waiting for results.

You can also use both together. In the previous cURL example, Prefer: wait defaults to 1 min and we’ve explicitly set Cancel-After to 1 min. This means that the HTTP request will stay open for 1 minute to wait for results, after which the prediction will be canceled, even if it has not completed.

Alternatively, setting Cancel-After: 1m and Prefer: wait=10 means that the request returns after 10 seconds. If the prediction is still running, you’ll get an incomplete prediction object, and the prediction will continue to run until it completes or is canceled at the 1-minute deadline.

Read more in the docs: