Release v4.6.2 · oobabooga/textgen

推荐订阅源

Lohrmann on Cybersecurity

Secure Thoughts

Intezer

Forbes - Security

Threat Intelligence Blog | Flashpoint

Help Net Security

IT之家

cs.AI updates on arXiv.org

宝玉的分享

Securelist

The Exploit Database - CXSecurity.com

CERT Recently Published Vulnerability Notes

爱范儿

freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

www.infosecurity-magazine.com

博

博客园 - 【当耐特】

Threatpost

Security Archives - TechRepublic

MIT News - Artificial intelligence

Hackread – Cybersecurity News, Data Breaches, AI and More

月光博客

阮一峰的网络日志

Last Week in AI

Threat Research - Cisco Blogs

Security Affairs

Tor Project blog

Tailwind CSS Blog

News | PayPal Newsroom

CXSECURITY Database RSS Feed - CXSecurity.com

云风的 BLOG

Proofpoint News Feed

The Register - Security

Darknet – Hacking Tools, Hacker News & Cyber Security

Tags from textgen

Release v4.9 · oobabooga/textgen Release v4.8 · oobabooga/textgen Release v4.7.3 · oobabooga/textgen Release v4.7.2 · oobabooga/textgen Release v4.7.1 · oobabooga/textgen Release v4.7 · oobabooga/textgen Release v4.6.1 · oobabooga/textgen Release v4.6 · oobabooga/textgen Release v4.5.2 · oobabooga/textgen

Release v4.6.2 · oobabooga/textgen

oobabooga · 2026-04-23 · via Tags from textgen

Changes

Tool call confirmation: Add inline approve/reject/always-approve buttons that appear before each tool call is executed. Enable via the new "Confirm tool calls" checkbox in the Chat tab.
Stdio MCP server support: In addition to HTTP MCP servers, you can now configure local subprocess-based MCP servers via user_data/mcp.json, using the same format as Claude Desktop and Cursor. [Tutorial]
preserve_thinking chat template parameter: New UI checkbox and --preserve-thinking CLI flag to control whether thinking blocks from prior turns are kept in the context.
UI: Sidebars overhaul: Sidebars now toggle independently and persist their state on page refresh. Default visibility adapts to viewport width.
llama.cpp: Pass --draft-min 48 by default for draftless speculative decoding.
Only show the "Reasoning effort" and "Enable thinking" controls for models whose chat template actually uses them.
Cache MCP tool discovery to avoid re-querying servers on each generation.
Add model download branch handling in download_model_wrapper (#7506). Thanks, @Th-Underscore.
UI: Improve border colors in light theme, fix code block copy button colors and centering, fix code block scrollbar flash during page load, improve past chats menu spacing.

Security

Fix SSRF vulnerabilities in URL fetching: add backslash and userinfo rejection, validate every redirect hop.

Bug fixes

Fix Gemma 4 thinking tags not hidden after tool calls (#7509).
Fix GPT-OSS channel tokens leaking in UI after tool calls.
Fix Slider preprocess not handling None from cleared number input. 🆕 - v4.6.1.
llama.cpp: Fix multimodal by using server's random media marker. 🆕 - v4.6.1.

Dependency updates

Update llama.cpp to ggml-org/llama.cpp@6217b49
Update ik_llama.cpp to ikawrakow/ik_llama.cpp@286ce32
Update ExLlamaV3 to 0.0.30

Portable builds

Below you can find self-contained packages that work with GGUF models (llama.cpp) and require no installation! Just download the right version for your system, unzip/extract, and run.

Note

NVIDIA GPU: If nvidia-smi reports CUDA Version >= 13.1, use the cuda13.1 build. Otherwise, use cuda12.4.

ik_llama.cpp is a llama.cpp fork with new quant types. If unsure, use the llama.cpp column.

Windows

GPU/Platform	llama.cpp	ik_llama.cpp
NVIDIA (CUDA 12.4)	Download (766 MB)	Download (1.1 GB)
NVIDIA (CUDA 13.1)	Download (686 MB)	Download (1.19 GB)
AMD/Intel (Vulkan)	Download (196 MB)	—
AMD (ROCm 7.2)	Download (499 MB)	—
CPU only	Download (178 MB)	Download (194 MB)

Linux

GPU/Platform	llama.cpp	ik_llama.cpp
NVIDIA (CUDA 12.4)	Download (747 MB)	Download (1.09 GB)
NVIDIA (CUDA 13.1)	Download (696 MB)	Download (1.21 GB)
AMD/Intel (Vulkan)	Download (208 MB)	—
AMD (ROCm 7.2)	Download (307 MB)	—
CPU only	Download (190 MB)	Download (217 MB)

macOS

Architecture	llama.cpp
Apple Silicon (arm64)	Download (156 MB)
Intel (x86_64)	Download (162 MB)

Updating a portable install:

Download and extract the latest version.
Replace the user_data folder with the one in your existing install. All your settings and models will be moved.

Starting with 4.0, you can also move user_data one folder up, next to the install folder. It will be detected automatically, making updates easier:

text-generation-webui-4.0/
text-generation-webui-4.1/
user_data/                    <-- shared by both installs

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。