惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Martin Fowler
Martin Fowler
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Threat Research - Cisco Blogs
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cyber Attacks, Cyber Crime and Cyber Security
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
T
Troy Hunt's Blog
V
V2EX - 技术
Hacker News - Newest:
Hacker News - Newest: "LLM"
H
Heimdal Security Blog
T
Tor Project blog
IT之家
IT之家
Project Zero
Project Zero
GbyAI
GbyAI
Security Latest
Security Latest
S
Security Archives - TechRepublic
人人都是产品经理
人人都是产品经理
大猫的无限游戏
大猫的无限游戏
Spread Privacy
Spread Privacy
S
Security Affairs
A
Arctic Wolf
C
Cybersecurity and Infrastructure Security Agency CISA
I
Intezer
P
Palo Alto Networks Blog
宝玉的分享
宝玉的分享
Google DeepMind News
Google DeepMind News
T
Threatpost
I
InfoQ
F
Full Disclosure
Blog — PlanetScale
Blog — PlanetScale
Last Week in AI
Last Week in AI
Cisco Talos Blog
Cisco Talos Blog
N
Netflix TechBlog - Medium
MyScale Blog
MyScale Blog
H
Help Net Security
S
Securelist
Y
Y Combinator Blog
月光博客
月光博客
博客园_首页
Engineering at Meta
Engineering at Meta
酷 壳 – CoolShell
酷 壳 – CoolShell
J
Java Code Geeks
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
A
About on SuperTechFans
K
Kaspersky official blog
Microsoft Azure Blog
Microsoft Azure Blog
Vercel News
Vercel News
阮一峰的网络日志
阮一峰的网络日志
T
The Exploit Database - CXSecurity.com
B
Blog

Recent Commits to openclaw:main

test: merge chat side-result checks · openclaw/openclaw@ddd2c2a test: merge cron history checks · openclaw/openclaw@f7eb746 test: merge responsive navigation shell checks · openclaw/openclaw@c2e4b47 docs(changelog): add codex oauth fixes · openclaw/openclaw@628e6cd test: merge navigation routing cases · openclaw/openclaw@5d8cecb Tests: mock channel registry bundled fallback · openclaw/openclaw@2b08233 Secrets: avoid broad web search discovery for single plugin config · openclaw/openclaw@a464f59 test: merge config view browser checks · openclaw/openclaw@20cf511 fix(status): align oauth health with runtime · openclaw/openclaw@eed7116 feat: add macOS screen snapshots for monitor preview (#67954) thanks … · openclaw/openclaw@f377db1 fix: report shared auth scopes in hello-ok (#67810) thanks @BunsDev · openclaw/openclaw@0b6c39b Auto-reply: avoid eager bundled route fallback · openclaw/openclaw@3ea1bf4 Tests: narrow session binding contract setup · openclaw/openclaw@54e4e16 fix(macOS): enable undo/redo in webchat composer text input (#34962) · openclaw/openclaw@00951dc Tests: speed up channel setup promotion · openclaw/openclaw@82b529a Docs: refresh agent instructions · openclaw/openclaw@5775fe2 fix(auth): serialize OAuth refresh across agents to fix #26322 (#67876) · openclaw/openclaw@8e79080 test: allow ollama public surface boundary test · openclaw/openclaw@7d4f1a6 Docs: add test performance guardrails · openclaw/openclaw@89706d3 Tests: restore context-engine usage proof · openclaw/openclaw@e4c4f95 Tests: slim context engine runtime coverage · openclaw/openclaw@74c198f ci: retry failed custom checkouts · openclaw/openclaw@0ee5baf test: trim duplicate provider auth onboarding cases · openclaw/openclaw@1ffc02e matrix: fix sessions_spawn --thread subagent session spawning (#67643) · openclaw/openclaw@1ce2596 test: reduce auth choice fixture churn · openclaw/openclaw@857b9cd test: mock health status config boundaries · openclaw/openclaw@9d5ab4a test: mock onboard config io boundary · openclaw/openclaw@299694d test: mock legacy state plugin boundaries · openclaw/openclaw@2713089 test: mock channel install boundaries · openclaw/openclaw@b945248 test: mock doctor preview channel boundaries · openclaw/openclaw@b1a3ad4 test: trim doctor command hotspots · openclaw/openclaw@c66f16a test: isolate agent auth and spawn hotspots · openclaw/openclaw@9285935 test: stabilize MCP startup disposal race · openclaw/openclaw@dd9d2eb test: merge browser contract server suites · openclaw/openclaw@5817a76 test: narrow ollama provider discovery setup · openclaw/openclaw@a0d9598 build: declare qa-lab aimock runtime dependency · openclaw/openclaw@24431e5 test: speed up safe-bins exec harness · openclaw/openclaw@ee856ab test: preserve tool helpers in embedded runner mocks · openclaw/openclaw@acd86a0 refactor: move memory embeddings into provider plugins · openclaw/openclaw@77e6e4c test: reuse system-run temp fixtures · openclaw/openclaw@7e9ff0f test: trim hotspot wait overhead · openclaw/openclaw@12a59b0 Check: avoid duplicate boundary prep · openclaw/openclaw@baf11b8 test: reduce hotspot fixture overhead · openclaw/openclaw@3a59edd feat(ui): overhaul settings and slash command UX (#67819) thanks @Bun… · openclaw/openclaw@2cfb660 QA Matrix: exit cleanly on failure · openclaw/openclaw@42805d2 QA Matrix: isolate scenario coverage · openclaw/openclaw@7e659e1 Matrix: refresh crypto bootstrap state · openclaw/openclaw@94081d8 QA Lab: add provider registry · openclaw/openclaw@bb7e982 Matrix: add plugin changelog · openclaw/openclaw@4acab55 test: trim more hotspot overhead · openclaw/openclaw@f485311 test: trim remaining hotspot tests · openclaw/openclaw@6ba8626 test: narrow hotspot mocks · openclaw/openclaw@dbc8179 test: isolate gemini embedding request helpers · openclaw/openclaw@cd330f5 test: trim memory and mcp hotspots · openclaw/openclaw@fd48dfa test: slim provider registry mocks · openclaw/openclaw@2e08c77 test: harden Parallels update smoke · openclaw/openclaw@1a98090 feat: default Anthropic to Opus 4.7 · openclaw/openclaw@628b454 fix: harden node-host shell payload mutability checks · openclaw/openclaw@75c551e fix: land node-host approval binding for native binaries (#66731) (th… · openclaw/openclaw@29919bb CI: add daily schedule to CodeQL workflow (#67645) fix(gateway): capture config hash after plugin auto-enable to prevent… · openclaw/openclaw@8c11210 fix: repair sanitized replay tool results before send (#67620) (thank… fix: restrict HTML timeout short-circuit to transient statuses fix: keep TUI watchdog bound to active run (#67401) (thanks @xantorres) Gateway/skills: dedupe skills prefix-match + drop dead fallback on log Extensions/lmstudio: back off inference preload after consecutive fai… TUI/streaming: add watchdog that resets the activity indicator after … Agents/tool-loop: enable unknown-tool stream guard by default · openclaw/openclaw@36ed367 Gateway/skills: invalidate session skills snapshot on config write fix: classify HTML provider error pages correctly (#67642) (thanks @s… fix(skills): remove unused model-usage import (#67641) · openclaw/openclaw@55f05df docs(changelog): credit codex fix superseded PRs · openclaw/openclaw@e485f24 fix(openai-codex): normalize stale transport metadata in resolution a… · openclaw/openclaw@90801ba CI: pin Docker-related GitHub Actions (#67632) · openclaw/openclaw@f697b01 Android: modernize WebView and discovery API usage (#67627) · openclaw/openclaw@44a6e50 fix(deps): bump hono to 4.12.14 and @hono/node-server to 1.19.14 (GHS… fix(deps): bump dompurify to 3.4.0 (#67614) CI: add explicit permissions to all workflow jobs (fixes code-scannin… fix: register bundled TTS providers and route overrides correctly (#6… fix: align host tilde paths with OS home (#62804) (thanks @stainlu) fix: flush creds queue before reconnect socket open (#67464) (thanks … · openclaw/openclaw@405c63f fix: strip standalone <function> tool call tags from visible text (#6… · openclaw/openclaw@78df859 fix(agents): preserve cli session metadata before transcript persist … · openclaw/openclaw@898fd04 docs(changelog): move cli transcript entry · openclaw/openclaw@c1817c6 fix(agents): normalize cli transcript api field · openclaw/openclaw@3a3fae0 docs(changelog): note cli transcript persistence · openclaw/openclaw@6c343f1 fix(agents): persist cli transcript turns · openclaw/openclaw@b8ef507 fix(msteams): harden security-sensitive flows (#65841) · openclaw/openclaw@c56b56e [Dashboard] Fix exec approval modal overflow for long command content… · openclaw/openclaw@053c5b0 Docs: remove QA changelog entry · openclaw/openclaw@7fd5771 QA: fix private runtime source loading (#67428) · openclaw/openclaw@d5933af docs(gateway): correct protocol.md schema path, hello-ok example, aut… · openclaw/openclaw@489404d CI: pin Node 22 runners to 22.18.0 · openclaw/openclaw@4ffa621 models.authStatus: normalize provider ids + tighten env-backed escape… · openclaw/openclaw@f2fdb9d Update CHANGELOG.md · openclaw/openclaw@7694a92 test(parallels): clean up npm update guard jobs · openclaw/openclaw@045ea7b Plugins: prefer scanDir override paths · openclaw/openclaw@b2974da fix(dreaming): default storage.mode to "separate" so phase blocks sto… · openclaw/openclaw@8c392f0 fix(memory-core): skip dreaming transcript ingestion via session stor… · openclaw/openclaw@a1b01f0 fix: dedupe replayed exec.finished node events (#67281) · openclaw/openclaw@5dcf526
feat: add xai media providers · openclaw/openclaw@f342da5
KateWilkins · 2026-04-23 · via Recent Commits to openclaw:main

@@ -63,6 +63,32 @@ they follow the same API shape.

6363

current image-capable Grok refs in the bundled catalog.

6464

</Tip>

656566+

## OpenClaw feature coverage

67+68+

The bundled plugin maps xAI's current public API surface onto OpenClaw's shared

69+

provider and tool contracts where the behavior fits cleanly.

70+71+

| xAI capability | OpenClaw surface | Status |

72+

| -------------------------- | -------------------------------------- | ------------------------------------------------------------------- |

73+

| Chat / Responses | `xai/<model>` model provider | Yes |

74+

| Server-side web search | `web_search` provider `grok` | Yes |

75+

| Server-side X search | `x_search` tool | Yes |

76+

| Server-side code execution | `code_execution` tool | Yes |

77+

| Images | `image_generate` | Yes |

78+

| Videos | `video_generate` | Yes |

79+

| Batch text-to-speech | `messages.tts.provider: "xai"` / `tts` | Yes |

80+

| Streaming TTS || Not exposed; OpenClaw's TTS contract returns complete audio buffers |

81+

| Speech-to-text || Not exposed yet; needs a transcription provider surface |

82+

| Realtime voice || Not exposed yet; different session/WebSocket contract |

83+

| Files / batches | Generic model API compatibility only | Not a first-class OpenClaw tool |

84+85+

<Note>

86+

OpenClaw uses xAI's REST image/video/TTS APIs for media generation and the

87+

Responses API for model, search, and code-execution tools. Features that need

88+

new OpenClaw contracts, such as streaming STT or Realtime voice sessions, are

89+

documented here as upstream capabilities rather than hidden plugin behavior.

90+

</Note>

91+6692

### Fast-mode mappings

67936894

`/fast on` or `agents.defaults.models["xai/<model>"].params.fastMode: true`

@@ -103,12 +129,17 @@ Legacy aliases still normalize to the canonical bundled ids:

103129

`video_generate` tool.

104130105131

- Default video model: `xai/grok-imagine-video`

106-

- Modes: text-to-video, image-to-video, and remote video edit/extend flows

107-

- Supports `aspectRatio` and `resolution`

132+

- Modes: text-to-video, image-to-video, remote video edit, and remote video

133+

extension

134+

- Aspect ratios: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `3:2`, `2:3`

135+

- Resolutions: `480P`, `720P`

136+

- Duration: 1-15 seconds for generation/image-to-video, 2-10 seconds for

137+

extension

108138109139

<Warning>

110140

Local video buffers are not accepted. Use remote `http(s)` URLs for

111-

video-reference and edit inputs.

141+

video edit/extend inputs. Image-to-video accepts local image buffers because

142+

OpenClaw can encode those as data URLs for xAI.

112143

</Warning>

113144114145

To use xAI as the default video provider:

@@ -132,6 +163,82 @@ Legacy aliases still normalize to the canonical bundled ids:

132163133164

</Accordion>

134165166+

<Accordion title="Image generation">

167+

The bundled `xai` plugin registers image generation through the shared

168+

`image_generate` tool.

169+170+

- Default image model: `xai/grok-imagine-image`

171+

- Additional model: `xai/grok-imagine-image-pro`

172+

- Modes: text-to-image and reference-image edit

173+

- Reference inputs: one `image` or up to five `images`

174+

- Aspect ratios: `1:1`, `16:9`, `9:16`, `4:3`, `3:4`, `2:3`, `3:2`

175+

- Resolutions: `1K`, `2K`

176+

- Count: up to 4 images

177+178+

OpenClaw asks xAI for `b64_json` image responses so generated media can be

179+

stored and delivered through the normal channel attachment path. Local

180+

reference images are converted to data URLs; remote `http(s)` references are

181+

passed through.

182+183+

To use xAI as the default image provider:

184+185+

```json5

186+

{

187+

agents: {

188+

defaults: {

189+

imageGenerationModel: {

190+

primary: "xai/grok-imagine-image",

191+

},

192+

},

193+

},

194+

}

195+

```

196+197+

<Note>

198+

xAI also documents `quality`, `mask`, `user`, and additional native ratios

199+

such as `1:2`, `2:1`, `9:20`, and `20:9`. OpenClaw forwards only the

200+

shared cross-provider image controls today; unsupported native-only knobs

201+

are intentionally not exposed through `image_generate`.

202+

</Note>

203+204+

</Accordion>

205+206+

<Accordion title="Text-to-speech">

207+

The bundled `xai` plugin registers text-to-speech through the shared `tts`

208+

provider surface.

209+210+

- Voices: `eve`, `ara`, `rex`, `sal`, `leo`, `una`

211+

- Default voice: `eve`

212+

- Formats: `mp3`, `wav`, `pcm`, `mulaw`, `alaw`

213+

- Language: BCP-47 code or `auto`

214+

- Speed: provider-native speed override

215+

- Native Opus voice-note format is not supported

216+217+

To use xAI as the default TTS provider:

218+219+

```json5

220+

{

221+

messages: {

222+

tts: {

223+

provider: "xai",

224+

providers: {

225+

xai: {

226+

voiceId: "eve",

227+

},

228+

},

229+

},

230+

},

231+

}

232+

```

233+234+

<Note>

235+

OpenClaw uses xAI's batch `/v1/tts` endpoint. xAI also offers streaming TTS

236+

over WebSocket, but the OpenClaw speech provider contract currently expects

237+

a complete audio buffer before reply delivery.

238+

</Note>

239+240+

</Accordion>

241+135242

<Accordion title="x_search configuration">

136243

The bundled xAI plugin exposes `x_search` as an OpenClaw tool for searching

137244

X (formerly Twitter) content via Grok.

@@ -209,6 +316,12 @@ Legacy aliases still normalize to the canonical bundled ids:

209316

- `grok-4.20-multi-agent-experimental-beta-0304` is not supported on the

210317

normal xAI provider path because it requires a different upstream API

211318

surface than the standard OpenClaw xAI transport.

319+

- xAI STT and Realtime voice are not registered as OpenClaw providers yet.

320+

They require transcription/session contracts rather than the existing

321+

batch TTS provider shape.

322+

- xAI image `quality`, image `mask`, and extra native-only aspect ratios are

323+

not exposed until the shared `image_generate` tool has corresponding

324+

cross-provider controls.

212325

</Accordion>

213326214327

<Accordion title="Advanced notes">

@@ -229,6 +342,23 @@ Legacy aliases still normalize to the canonical bundled ids:

229342

</Accordion>

230343

</AccordionGroup>

231344345+

## Live testing

346+347+

The xAI media paths are covered by unit tests and opt-in live suites. The live

348+

commands load secrets from your login shell, including `~/.profile`, before

349+

probing `XAI_API_KEY`.

350+351+

```bash

352+

pnpm test extensions/xai

353+

OPENCLAW_LIVE_TEST=1 OPENCLAW_LIVE_TEST_QUIET=1 pnpm test:live -- extensions/xai/xai.live.test.ts

354+

OPENCLAW_LIVE_TEST=1 OPENCLAW_LIVE_TEST_QUIET=1 OPENCLAW_LIVE_IMAGE_GENERATION_PROVIDERS=xai pnpm test:live -- test/image-generation.runtime.live.test.ts

355+

```

356+357+

The provider-specific live file synthesizes normal TTS, telephony-friendly PCM

358+

TTS, text-to-image generation, and reference-image editing. The shared image

359+

live file verifies the same xAI provider through OpenClaw's runtime selection,

360+

fallback, normalization, and media attachment path.

361+232362

## Related

233363234364

<CardGroup cols={2}>