惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

美团技术团队
D
DataBreaches.Net
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
D
Docker
N
Netflix TechBlog - Medium
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
C
Check Point Blog
腾讯CDC
Stack Overflow Blog
Stack Overflow Blog
V
Visual Studio Blog
IT之家
IT之家
月光博客
月光博客
U
Unit 42
K
Kaspersky official blog
T
Threatpost
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
GbyAI
GbyAI
P
Proofpoint News Feed
Last Week in AI
Last Week in AI
云风的 BLOG
云风的 BLOG
酷 壳 – CoolShell
酷 壳 – CoolShell
I
InfoQ
Engineering at Meta
Engineering at Meta
Recorded Future
Recorded Future
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
Security @ Cisco Blogs
MyScale Blog
MyScale Blog
大猫的无限游戏
大猫的无限游戏
Security Archives - TechRepublic
Security Archives - TechRepublic
Webroot Blog
Webroot Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Hacker News - Newest:
Hacker News - Newest: "LLM"
S
Schneier on Security
S
Secure Thoughts
The Register - Security
The Register - Security
B
Blog RSS Feed
The Last Watchdog
The Last Watchdog
P
Palo Alto Networks Blog
爱范儿
爱范儿
B
Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
N
News and Events Feed by Topic
阮一峰的网络日志
阮一峰的网络日志
L
LINUX DO - 热门话题
C
Cisco Blogs
Spread Privacy
Spread Privacy
F
Full Disclosure
博客园 - 聂微东
T
The Blog of Author Tim Ferriss

博客园 - 河在谈

华为云挂载其它硬盘 copaw安装使用,ollama ubuntu24.04 安装 cuda 13.2 群辉采用Virtual Machine Manager安装ImmortalWrt Excel 宏 日期选择 ollama下载模型并运行 docker镜像离线导出、导入 构建一个huggingface的python虚拟环境,并下载模型 ESXi-6.7.0 安装黑群辉 7.1.1 群辉备份数据至通过rsync 备份至linux服务器 学位论文中 公式的插入,格式调整、编号以及引用 php-fpm配置redis存session 专利线框图绘制和导出,从Blender到3ds max 显卡掉卡收集日志给服务器供应商 photoshop 批量导出每个图层为独立图像 thinkcmfx 修改菜单不生效 mathtype中嵌入数学公式导致行距变大的解决方法 ubuntu 采用.run文件安装的cuda,升级版本,安装cudnn docker 容器,将网络更改为默认bridge网络 CentOS系统实现PPT转图片
ubuntu24.04 LiteLLM的安装配置
河在谈 · 2026-03-28 · via 博客园 - 河在谈

1、安装docker

2、  docker-compose.yml

services:
  litellm:
    image: docker.litellm.ai/berriai/litellm:main-stable
    volumes:
      - ./config.yaml:/app/config.yaml
    ports:
      - "4000:4000"
    command:
      - "--config=/app/config.yaml"
      - "--port=4000"
    extra_hosts:
      - "host.docker.internal:host-gateway"

3、config.yaml

model_list:
  - model_name: qwen3.5:35b-a3b-bf16
    litellm_params:
      model: ollama/qwen3.5:35b-a3b-bf16
      api_base: http://172.24.193.66:11434 
  - model_name: qwen3-coder-next:q8_0
    litellm_params:
      model: ollama/qwen3-coder-next:q8_0
      api_base: http://172.24.193.67:11434 
  - model_name: nemotron-cascade-2
    litellm_params:
      model: ollama/nemotron-cascade-2
      api_base: http://172.24.193.68:11434

# 设置访问密钥
general_settings:
  master_key: sk-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

4、测试

curl -X POST http://localhost:4000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX" \ 
-d '{"model":"ollama/qwen3.5:35b-a3b-bf16","messages":[{"role":"user","content":"你好"}]}'

posted @ 2026-03-28 13:17  河在谈  阅读(109)  评论()    收藏  举报