
























We’re thrilled to announce the addition of three more outstanding serverless Inference Providers to the Hugging Face Hub: Hyperbolic, Nebius AI Studio, and Novita. These providers join our growing ecosystem, enhancing the breadth and capabilities of serverless inference directly on the Hub’s model pages. They’re also seamlessly integrated into our client SDKs (for both JS and Python), making it super easy to use a wide variety of models with your preferred providers.
These partners join the ranks of our existing providers, including Together AI, Sambanova, Replicate, fal and Fireworks.ai.
The new partners enable a swath of new models: DeepSeek-R1, Flux.1, and many others. Find all the models supported by them below:
We're quite excited to see what you'll build with these new providers!



The following example shows how to use DeepSeek-R1 using Hyperbolic as the inference provider. You can use a Hugging Face token for automatic routing through Hugging Face, or your own Hyperbolic API key if you have one.
Install huggingface_hub from source (see instructions). Official support will be released soon in version v0.29.0.
from huggingface_hub import InferenceClient
client = InferenceClient(
provider="hyperbolic",
api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)
messages = [
{
"role": "user",
"content": "What is the capital of France?"
}
]
completion = client.chat.completions.create(
model="deepseek-ai/DeepSeek-R1",
messages=messages,
max_tokens=500
)
print(completion.choices[0].message)
And here's how to generate an image from a text prompt using FLUX.1-dev running on Nebius AI Studio:
from huggingface_hub import InferenceClient
client = InferenceClient(
provider="nebius",
api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)
# output is a PIL.Image object
image = client.text_to_image(
"Bob Marley in the style of a painting by Johannes Vermeer",
model="black-forest-labs/FLUX.1-schnell"
)
To move to a different provider, you can simply change the provider name, everything else stays the same:
from huggingface_hub import InferenceClient
client = InferenceClient(
- provider="nebius",
+ provider="hyperbolic",
api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)
import { HfInference } from "@huggingface/inference";
const client = new HfInference("xxxxxxxxxxxxxxxxxxxxxxxx");
const chatCompletion = await client.chatCompletion({
model: "deepseek-ai/DeepSeek-R1",
messages: [
{
role: "user",
content: "What is the capital of France?"
}
],
provider: "novita",
max_tokens: 500
});
console.log(chatCompletion.choices[0].message);
For direct requests, i.e. when you use the key from an inference provider, you are billed by the corresponding provider. For instance, if you use a Nebius AI Studio key you're billed on your Nebius AI Studio account.
For routed requests, i.e. when you authenticate via the hub, you'll only pay the standard provider API rates. There's no additional markup from us, we just pass through the provider costs directly. (In the future, we may establish revenue-sharing agreements with our provider partners.)
Important Note ‼️ PRO users get $2 worth of Inference credits every month. You can use them across providers. 🔥
Subscribe to the Hugging Face PRO plan to get access to Inference credits, ZeroGPU, Spaces Dev Mode, 20x higher limits, and more.
We also provide free inference with a small quota for our signed-in free users, but please upgrade to PRO if you can!
We would love to get your feedback! Here’s a Hub discussion you can use: https://huggingface.co/spaces/huggingface/HuggingDiscussions/discussions/49
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。