惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

WordPress大学
WordPress大学
Recent Announcements
Recent Announcements
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Microsoft Azure Blog
Microsoft Azure Blog
S
Security @ Cisco Blogs
P
Proofpoint News Feed
博客园 - 三生石上(FineUI控件)
T
Tailwind CSS Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
The Last Watchdog
The Last Watchdog
AI
AI
Webroot Blog
Webroot Blog
aimingoo的专栏
aimingoo的专栏
Hacker News: Ask HN
Hacker News: Ask HN
B
Blog RSS Feed
小众软件
小众软件
T
The Blog of Author Tim Ferriss
博客园 - 叶小钗
W
WeLiveSecurity
C
CXSECURITY Database RSS Feed - CXSecurity.com
H
Hackread – Cybersecurity News, Data Breaches, AI and More
T
Troy Hunt's Blog
云风的 BLOG
云风的 BLOG
P
Privacy International News Feed
Application and Cybersecurity Blog
Application and Cybersecurity Blog
Vercel News
Vercel News
Y
Y Combinator Blog
P
Proofpoint News Feed
V2EX - 技术
V2EX - 技术
AWS News Blog
AWS News Blog
F
Fortinet All Blogs
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
The GitHub Blog
The GitHub Blog
A
Arctic Wolf
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Hugging Face - Blog
Hugging Face - Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
V
V2EX
MongoDB | Blog
MongoDB | Blog
SecWiki News
SecWiki News
The Register - Security
The Register - Security
博客园_首页
T
Threat Research - Cisco Blogs
Hacker News - Newest:
Hacker News - Newest: "LLM"
Recorded Future
Recorded Future
V
Vulnerabilities – Threatpost
I
InfoQ
雷峰网
雷峰网
C
Check Point Blog

Tags from textgen

Release v4.9 · oobabooga/textgen Release v4.8 · oobabooga/textgen Release v4.7.3 · oobabooga/textgen Release v4.7.2 · oobabooga/textgen Release v4.7.1 · oobabooga/textgen Release v4.7 · oobabooga/textgen Release v4.6.2 · oobabooga/textgen Release v4.6.1 · oobabooga/textgen Release v4.6 · oobabooga/textgen
Release v4.5.2 · oobabooga/textgen
oobabooga · 2026-04-16 · via Tags from textgen

Changes

Bug fixes

  • Fix Gemma-4 tool calling: handle double quotes and newline chars in arguments (#7477). Thanks, @mamei16.
  • Fix chat scroll getting stuck on thinking blocks (#7485).
  • Prevent Tool Icon SVG Shrinking When Tool Calls Are Long (#7488). Thanks, @mamei16.
  • Fix: wrong chat deleted when selection changes before confirm (#7483). Thanks, @lawrence3699.
  • Fix bos/eos tokens not being set for models without a chat template. Defaults are now reset before reading model metadata.
  • Fix duplicate BOS token being prepended in ExLlamav3.
  • Fix version metadata not syncing on Continue (#7492).
  • Fix row_split not working with ik_llama.cpp — --split-mode row is now converted to --split-mode graph (#7489).
  • Fix "Start reply with" crash (#7497). 🆕 - v4.5.1.
  • Fix tool responses with Gemma 4 template (#7498). 🆕 - v4.5.1.
  • UI: Fix consecutive thinking blocks rendering with Gemma 4. 🆕 - v4.5.1.
  • Fix bos/eos tokens being overwritten after GGUF metadata sets them (#7496). 🆕 - v4.5.2

Dependency updates


Portable builds

Below you can find self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip/extract, and run.

Note

NVIDIA GPU: If nvidia-smi reports CUDA Version >= 13.1, use the cuda13.1 build. Otherwise, use cuda12.4.

ik_llama.cpp is a llama.cpp fork with new quant types. If unsure, use the llama.cpp column.

Windows

GPU/Platform llama.cpp ik_llama.cpp
NVIDIA (CUDA 12.4) Download (774 MB) Download (1.09 GB)
NVIDIA (CUDA 13.1) Download (696 MB) Download (1.19 GB)
AMD/Intel (Vulkan) Download (209 MB)
AMD (ROCm 7.2) Download (517 MB)
CPU only Download (191 MB) Download (192 MB)

Linux

GPU/Platform llama.cpp ik_llama.cpp
NVIDIA (CUDA 12.4) Download (758 MB) Download (1.09 GB)
NVIDIA (CUDA 13.1) Download (710 MB) Download (1.21 GB)
AMD/Intel (Vulkan) Download (225 MB)
AMD (ROCm 7.2) Download (330 MB)
CPU only Download (207 MB) Download (218 MB)

macOS

Architecture llama.cpp
Apple Silicon (arm64) Download (182 MB)
Intel (x86_64) Download (188 MB)

Updating a portable install:

  1. Download and extract the latest version.
  2. Replace the user_data folder with the one in your existing install. All your settings and models will be moved.

Starting with 4.0, you can also move user_data one folder up, next to the install folder. It will be detected automatically, making updates easier:

text-generation-webui-4.0/
text-generation-webui-4.1/
user_data/                    <-- shared by both installs