惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Schneier on Security
有赞技术团队
有赞技术团队
T
The Blog of Author Tim Ferriss
F
Fortinet All Blogs
D
DataBreaches.Net
F
Full Disclosure
腾讯CDC
博客园 - 【当耐特】
MyScale Blog
MyScale Blog
Stack Overflow Blog
Stack Overflow Blog
小众软件
小众软件
Hugging Face - Blog
Hugging Face - Blog
Last Week in AI
Last Week in AI
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
爱范儿
爱范儿
The GitHub Blog
The GitHub Blog
Engineering at Meta
Engineering at Meta
大猫的无限游戏
大猫的无限游戏
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
S
SegmentFault 最新的问题
The Register - Security
The Register - Security
WordPress大学
WordPress大学
博客园 - 聂微东
雷峰网
雷峰网
J
Java Code Geeks
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
P
Privacy International News Feed
酷 壳 – CoolShell
酷 壳 – CoolShell
A
Arctic Wolf
Scott Helme
Scott Helme
C
Cyber Attacks, Cyber Crime and Cyber Security
T
Tor Project blog
博客园 - 三生石上(FineUI控件)
Know Your Adversary
Know Your Adversary
AWS News Blog
AWS News Blog
G
Google Developers Blog
www.infosecurity-magazine.com
www.infosecurity-magazine.com
C
CERT Recently Published Vulnerability Notes
O
OpenAI News
Project Zero
Project Zero
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
Application and Cybersecurity Blog
Application and Cybersecurity Blog
云风的 BLOG
云风的 BLOG
N
News and Events Feed by Topic
MongoDB | Blog
MongoDB | Blog
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Microsoft Security Blog
Microsoft Security Blog
Cisco Talos Blog
Cisco Talos Blog
P
Palo Alto Networks Blog
Schneier on Security
Schneier on Security

Tags from textgen

Release v4.9 · oobabooga/textgen Release v4.8 · oobabooga/textgen Release v4.7.3 · oobabooga/textgen Release v4.7.2 · oobabooga/textgen Release v4.7.1 · oobabooga/textgen Release v4.7 · oobabooga/textgen Release v4.6.1 · oobabooga/textgen Release v4.6 · oobabooga/textgen Release v4.5.2 · oobabooga/textgen
Release v4.6.2 · oobabooga/textgen
oobabooga · 2026-04-23 · via Tags from textgen

Changes

  • Tool call confirmation: Add inline approve/reject/always-approve buttons that appear before each tool call is executed. Enable via the new "Confirm tool calls" checkbox in the Chat tab.
  • Stdio MCP server support: In addition to HTTP MCP servers, you can now configure local subprocess-based MCP servers via user_data/mcp.json, using the same format as Claude Desktop and Cursor. [Tutorial]
  • preserve_thinking chat template parameter: New UI checkbox and --preserve-thinking CLI flag to control whether thinking blocks from prior turns are kept in the context.
  • UI: Sidebars overhaul: Sidebars now toggle independently and persist their state on page refresh. Default visibility adapts to viewport width.
  • llama.cpp: Pass --draft-min 48 by default for draftless speculative decoding.
  • Only show the "Reasoning effort" and "Enable thinking" controls for models whose chat template actually uses them.
  • Cache MCP tool discovery to avoid re-querying servers on each generation.
  • Add model download branch handling in download_model_wrapper (#7506). Thanks, @Th-Underscore.
  • UI: Improve border colors in light theme, fix code block copy button colors and centering, fix code block scrollbar flash during page load, improve past chats menu spacing.

Security

  • Fix SSRF vulnerabilities in URL fetching: add backslash and userinfo rejection, validate every redirect hop.

Bug fixes

  • Fix Gemma 4 thinking tags not hidden after tool calls (#7509).
  • Fix GPT-OSS channel tokens leaking in UI after tool calls.
  • Fix Slider preprocess not handling None from cleared number input. 🆕 - v4.6.1.
  • llama.cpp: Fix multimodal by using server's random media marker. 🆕 - v4.6.1.

Dependency updates

Portable builds

Below you can find self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip/extract, and run.

Note

NVIDIA GPU: If nvidia-smi reports CUDA Version >= 13.1, use the cuda13.1 build. Otherwise, use cuda12.4.

ik_llama.cpp is a llama.cpp fork with new quant types. If unsure, use the llama.cpp column.

Windows

GPU/Platform llama.cpp ik_llama.cpp
NVIDIA (CUDA 12.4) Download (766 MB) Download (1.1 GB)
NVIDIA (CUDA 13.1) Download (686 MB) Download (1.19 GB)
AMD/Intel (Vulkan) Download (196 MB)
AMD (ROCm 7.2) Download (499 MB)
CPU only Download (178 MB) Download (194 MB)

Linux

GPU/Platform llama.cpp ik_llama.cpp
NVIDIA (CUDA 12.4) Download (747 MB) Download (1.09 GB)
NVIDIA (CUDA 13.1) Download (696 MB) Download (1.21 GB)
AMD/Intel (Vulkan) Download (208 MB)
AMD (ROCm 7.2) Download (307 MB)
CPU only Download (190 MB) Download (217 MB)

macOS

Architecture llama.cpp
Apple Silicon (arm64) Download (156 MB)
Intel (x86_64) Download (162 MB)

Updating a portable install:

  1. Download and extract the latest version.
  2. Replace the user_data folder with the one in your existing install. All your settings and models will be moved.

Starting with 4.0, you can also move user_data one folder up, next to the install folder. It will be detected automatically, making updates easier:

text-generation-webui-4.0/
text-generation-webui-4.1/
user_data/                    <-- shared by both installs