惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Martin Fowler
Martin Fowler
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Threat Research - Cisco Blogs
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
C
Cyber Attacks, Cyber Crime and Cyber Security
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
T
Troy Hunt's Blog
V
V2EX - 技术
Hacker News - Newest:
Hacker News - Newest: "LLM"
H
Heimdal Security Blog
T
Tor Project blog
IT之家
IT之家
Project Zero
Project Zero
GbyAI
GbyAI
Security Latest
Security Latest
S
Security Archives - TechRepublic
人人都是产品经理
人人都是产品经理
大猫的无限游戏
大猫的无限游戏
Spread Privacy
Spread Privacy
S
Security Affairs
A
Arctic Wolf
C
Cybersecurity and Infrastructure Security Agency CISA
I
Intezer
P
Palo Alto Networks Blog
宝玉的分享
宝玉的分享
Google DeepMind News
Google DeepMind News
T
Threatpost
I
InfoQ
F
Full Disclosure
Blog — PlanetScale
Blog — PlanetScale
Last Week in AI
Last Week in AI
Cisco Talos Blog
Cisco Talos Blog
N
Netflix TechBlog - Medium
MyScale Blog
MyScale Blog
H
Help Net Security
S
Securelist
Y
Y Combinator Blog
月光博客
月光博客
博客园_首页
Engineering at Meta
Engineering at Meta
酷 壳 – CoolShell
酷 壳 – CoolShell
J
Java Code Geeks
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
A
About on SuperTechFans
K
Kaspersky official blog
Microsoft Azure Blog
Microsoft Azure Blog
Vercel News
Vercel News
阮一峰的网络日志
阮一峰的网络日志
T
The Exploit Database - CXSecurity.com
B
Blog

Recent Commits to openclaw:main

test: merge chat side-result checks · openclaw/openclaw@ddd2c2a test: merge cron history checks · openclaw/openclaw@f7eb746 test: merge responsive navigation shell checks · openclaw/openclaw@c2e4b47 docs(changelog): add codex oauth fixes · openclaw/openclaw@628e6cd test: merge navigation routing cases · openclaw/openclaw@5d8cecb Tests: mock channel registry bundled fallback · openclaw/openclaw@2b08233 Secrets: avoid broad web search discovery for single plugin config · openclaw/openclaw@a464f59 test: merge config view browser checks · openclaw/openclaw@20cf511 fix(status): align oauth health with runtime · openclaw/openclaw@eed7116 feat: add macOS screen snapshots for monitor preview (#67954) thanks … · openclaw/openclaw@f377db1 fix: report shared auth scopes in hello-ok (#67810) thanks @BunsDev · openclaw/openclaw@0b6c39b Auto-reply: avoid eager bundled route fallback · openclaw/openclaw@3ea1bf4 Tests: narrow session binding contract setup · openclaw/openclaw@54e4e16 fix(macOS): enable undo/redo in webchat composer text input (#34962) · openclaw/openclaw@00951dc Tests: speed up channel setup promotion · openclaw/openclaw@82b529a Docs: refresh agent instructions · openclaw/openclaw@5775fe2 fix(auth): serialize OAuth refresh across agents to fix #26322 (#67876) · openclaw/openclaw@8e79080 test: allow ollama public surface boundary test · openclaw/openclaw@7d4f1a6 Docs: add test performance guardrails · openclaw/openclaw@89706d3 Tests: restore context-engine usage proof · openclaw/openclaw@e4c4f95 Tests: slim context engine runtime coverage · openclaw/openclaw@74c198f ci: retry failed custom checkouts · openclaw/openclaw@0ee5baf test: trim duplicate provider auth onboarding cases · openclaw/openclaw@1ffc02e matrix: fix sessions_spawn --thread subagent session spawning (#67643) · openclaw/openclaw@1ce2596 test: reduce auth choice fixture churn · openclaw/openclaw@857b9cd test: mock health status config boundaries · openclaw/openclaw@9d5ab4a test: mock onboard config io boundary · openclaw/openclaw@299694d test: mock legacy state plugin boundaries · openclaw/openclaw@2713089 test: mock channel install boundaries · openclaw/openclaw@b945248 test: mock doctor preview channel boundaries · openclaw/openclaw@b1a3ad4 test: trim doctor command hotspots · openclaw/openclaw@c66f16a test: isolate agent auth and spawn hotspots · openclaw/openclaw@9285935 test: stabilize MCP startup disposal race · openclaw/openclaw@dd9d2eb test: merge browser contract server suites · openclaw/openclaw@5817a76 test: narrow ollama provider discovery setup · openclaw/openclaw@a0d9598 build: declare qa-lab aimock runtime dependency · openclaw/openclaw@24431e5 test: speed up safe-bins exec harness · openclaw/openclaw@ee856ab test: preserve tool helpers in embedded runner mocks · openclaw/openclaw@acd86a0 refactor: move memory embeddings into provider plugins · openclaw/openclaw@77e6e4c test: reuse system-run temp fixtures · openclaw/openclaw@7e9ff0f test: trim hotspot wait overhead · openclaw/openclaw@12a59b0 Check: avoid duplicate boundary prep · openclaw/openclaw@baf11b8 test: reduce hotspot fixture overhead · openclaw/openclaw@3a59edd feat(ui): overhaul settings and slash command UX (#67819) thanks @Bun… · openclaw/openclaw@2cfb660 QA Matrix: exit cleanly on failure · openclaw/openclaw@42805d2 QA Matrix: isolate scenario coverage · openclaw/openclaw@7e659e1 Matrix: refresh crypto bootstrap state · openclaw/openclaw@94081d8 QA Lab: add provider registry · openclaw/openclaw@bb7e982 Matrix: add plugin changelog · openclaw/openclaw@4acab55 test: trim more hotspot overhead · openclaw/openclaw@f485311 test: trim remaining hotspot tests · openclaw/openclaw@6ba8626 test: narrow hotspot mocks · openclaw/openclaw@dbc8179 test: isolate gemini embedding request helpers · openclaw/openclaw@cd330f5 test: trim memory and mcp hotspots · openclaw/openclaw@fd48dfa test: slim provider registry mocks · openclaw/openclaw@2e08c77 test: harden Parallels update smoke · openclaw/openclaw@1a98090 feat: default Anthropic to Opus 4.7 · openclaw/openclaw@628b454 fix: harden node-host shell payload mutability checks · openclaw/openclaw@75c551e fix: land node-host approval binding for native binaries (#66731) (th… · openclaw/openclaw@29919bb CI: add daily schedule to CodeQL workflow (#67645) fix(gateway): capture config hash after plugin auto-enable to prevent… · openclaw/openclaw@8c11210 fix: repair sanitized replay tool results before send (#67620) (thank… fix: restrict HTML timeout short-circuit to transient statuses fix: keep TUI watchdog bound to active run (#67401) (thanks @xantorres) Gateway/skills: dedupe skills prefix-match + drop dead fallback on log Extensions/lmstudio: back off inference preload after consecutive fai… TUI/streaming: add watchdog that resets the activity indicator after … Agents/tool-loop: enable unknown-tool stream guard by default · openclaw/openclaw@36ed367 Gateway/skills: invalidate session skills snapshot on config write fix: classify HTML provider error pages correctly (#67642) (thanks @s… fix(skills): remove unused model-usage import (#67641) · openclaw/openclaw@55f05df docs(changelog): credit codex fix superseded PRs · openclaw/openclaw@e485f24 fix(openai-codex): normalize stale transport metadata in resolution a… · openclaw/openclaw@90801ba CI: pin Docker-related GitHub Actions (#67632) · openclaw/openclaw@f697b01 Android: modernize WebView and discovery API usage (#67627) · openclaw/openclaw@44a6e50 fix(deps): bump hono to 4.12.14 and @hono/node-server to 1.19.14 (GHS… fix(deps): bump dompurify to 3.4.0 (#67614) CI: add explicit permissions to all workflow jobs (fixes code-scannin… fix: register bundled TTS providers and route overrides correctly (#6… fix: align host tilde paths with OS home (#62804) (thanks @stainlu) fix: flush creds queue before reconnect socket open (#67464) (thanks … · openclaw/openclaw@405c63f fix: strip standalone <function> tool call tags from visible text (#6… · openclaw/openclaw@78df859 fix(agents): preserve cli session metadata before transcript persist … · openclaw/openclaw@898fd04 docs(changelog): move cli transcript entry · openclaw/openclaw@c1817c6 fix(agents): normalize cli transcript api field · openclaw/openclaw@3a3fae0 docs(changelog): note cli transcript persistence · openclaw/openclaw@6c343f1 fix(agents): persist cli transcript turns · openclaw/openclaw@b8ef507 fix(msteams): harden security-sensitive flows (#65841) · openclaw/openclaw@c56b56e [Dashboard] Fix exec approval modal overflow for long command content… · openclaw/openclaw@053c5b0 Docs: remove QA changelog entry · openclaw/openclaw@7fd5771 QA: fix private runtime source loading (#67428) · openclaw/openclaw@d5933af docs(gateway): correct protocol.md schema path, hello-ok example, aut… · openclaw/openclaw@489404d CI: pin Node 22 runners to 22.18.0 · openclaw/openclaw@4ffa621 models.authStatus: normalize provider ids + tighten env-backed escape… · openclaw/openclaw@f2fdb9d Update CHANGELOG.md · openclaw/openclaw@7694a92 test(parallels): clean up npm update guard jobs · openclaw/openclaw@045ea7b Plugins: prefer scanDir override paths · openclaw/openclaw@b2974da fix(dreaming): default storage.mode to "separate" so phase blocks sto… · openclaw/openclaw@8c392f0 fix(memory-core): skip dreaming transcript ingestion via session stor… · openclaw/openclaw@a1b01f0 fix: dedupe replayed exec.finished node events (#67281) · openclaw/openclaw@5dcf526
fix(amazon-bedrock): add known model context windows to discovery (#6… · openclaw/openclaw@2a15a3b
wirjo · 2026-04-23 · via Recent Commits to openclaw:main

@@ -21,8 +21,121 @@ import {

2121

const log = createSubsystemLogger("bedrock-discovery");

22222323

const DEFAULT_REFRESH_INTERVAL_SECONDS = 3600;

24-

const DEFAULT_CONTEXT_WINDOW = 32000;

24+

const DEFAULT_CONTEXT_WINDOW = 32_000;

2525

const DEFAULT_MAX_TOKENS = 4096;

26+27+

// ---------------------------------------------------------------------------

28+

// Known model context windows (Bedrock API does not expose token limits)

29+

// ---------------------------------------------------------------------------

30+31+

/**

32+

* Bedrock's ListFoundationModels and GetFoundationModel APIs return no token

33+

* limit information — only model ID, name, modalities, and lifecycle status.

34+

* There is currently no Bedrock API to discover context windows or max output

35+

* tokens programmatically.

36+

*

37+

* This map provides correct context window values for known models so that

38+

* session management, compaction thresholds, and context overflow detection

39+

* work correctly. If AWS adds token metadata to the API in the future, this

40+

* table should become a fallback rather than the primary source.

41+

*

42+

* Inference profile prefixes (us., eu., ap., global.) are stripped before lookup.

43+

*

44+

* Sources: https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html

45+

* https://platform.claude.com/docs/en/about-claude/models

46+

*/

47+

const KNOWN_CONTEXT_WINDOWS: Record<string, number> = {

48+

// Anthropic Claude

49+

"anthropic.claude-3-7-sonnet-20250219-v1:0": 200_000,

50+

"anthropic.claude-opus-4-7": 1_000_000,

51+

"anthropic.claude-opus-4-6-v1": 1_000_000,

52+

"anthropic.claude-opus-4-6-v1:0": 1_000_000,

53+

"anthropic.claude-sonnet-4-6": 1_000_000,

54+

"anthropic.claude-sonnet-4-6-v1:0": 1_000_000,

55+

"anthropic.claude-sonnet-4-5-20250929-v1:0": 200_000,

56+

"anthropic.claude-sonnet-4-20250514-v1:0": 200_000,

57+

"anthropic.claude-opus-4-5-20251101-v1:0": 200_000,

58+

"anthropic.claude-opus-4-1-20250805-v1:0": 200_000,

59+

"anthropic.claude-haiku-4-5-20251001-v1:0": 200_000,

60+

"anthropic.claude-3-5-haiku-20241022-v1:0": 200_000,

61+

"anthropic.claude-3-haiku-20240307-v1:0": 200_000,

62+

// Amazon Nova

63+

"amazon.nova-premier-v1:0": 1_000_000,

64+

"amazon.nova-pro-v1:0": 300_000,

65+

"amazon.nova-lite-v1:0": 300_000,

66+

"amazon.nova-micro-v1:0": 128_000,

67+

"amazon.nova-2-lite-v1:0": 300_000,

68+

// MiniMax

69+

"minimax.minimax-m2.5": 1_000_000,

70+

"minimax.minimax-m2.1": 1_000_000,

71+

"minimax.minimax-m2": 1_000_000,

72+

// Meta Llama 4

73+

"meta.llama4-maverick-17b-instruct-v1:0": 1_000_000,

74+

"meta.llama4-scout-17b-instruct-v1:0": 512_000,

75+

// Meta Llama 3

76+

"meta.llama3-3-70b-instruct-v1:0": 128_000,

77+

"meta.llama3-2-90b-instruct-v1:0": 128_000,

78+

"meta.llama3-2-11b-instruct-v1:0": 128_000,

79+

"meta.llama3-2-3b-instruct-v1:0": 128_000,

80+

"meta.llama3-2-1b-instruct-v1:0": 128_000,

81+

"meta.llama3-1-405b-instruct-v1:0": 128_000,

82+

"meta.llama3-1-70b-instruct-v1:0": 128_000,

83+

"meta.llama3-1-8b-instruct-v1:0": 128_000,

84+

// NVIDIA Nemotron

85+

"nvidia.nemotron-super-3-120b": 256_000,

86+

"nvidia.nemotron-nano-3-30b": 128_000,

87+

"nvidia.nemotron-nano-12b-v2": 128_000,

88+

"nvidia.nemotron-nano-9b-v2": 128_000,

89+

// Mistral

90+

"mistral.mistral-large-3-675b-instruct": 128_000,

91+

"mistral.mistral-large-2407-v1:0": 128_000,

92+

"mistral.mistral-small-2402-v1:0": 32_000,

93+

// DeepSeek

94+

"deepseek.r1-v1:0": 128_000,

95+

"deepseek.v3.2": 128_000,

96+

// Cohere

97+

"cohere.command-r-plus-v1:0": 128_000,

98+

"cohere.command-r-v1:0": 128_000,

99+

// AI21

100+

"ai21.jamba-1-5-large-v1:0": 256_000,

101+

"ai21.jamba-1-5-mini-v1:0": 256_000,

102+

// Google Gemma

103+

"google.gemma-3-27b-it": 128_000,

104+

"google.gemma-3-12b-it": 128_000,

105+

"google.gemma-3-4b-it": 128_000,

106+

// GLM

107+

"zai.glm-5": 128_000,

108+

"zai.glm-4.7": 128_000,

109+

"zai.glm-4.7-flash": 128_000,

110+

// Qwen

111+

"qwen.qwen3-coder-next": 256_000,

112+

"qwen.qwen3-coder-30b-a3b-v1:0": 256_000,

113+

"qwen.qwen3-32b-v1:0": 128_000,

114+

"qwen.qwen3-vl-235b-a22b": 128_000,

115+

};

116+117+

/**

118+

* Resolve the real context window for a Bedrock model ID.

119+

* Strips inference profile prefixes (us., eu., ap., global.) before lookup.

120+

*/

121+

function resolveKnownContextWindow(modelId: string): number | undefined {

122+

const stripped = modelId.replace(/^(?:us|eu|ap|apac|au|jp|global)\./, "");

123+

const candidates = [modelId, stripped];

124+

for (const candidate of candidates) {

125+

if (KNOWN_CONTEXT_WINDOWS[candidate] !== undefined) {

126+

return KNOWN_CONTEXT_WINDOWS[candidate];

127+

}

128+

const withoutVersionSuffix = candidate.replace(/:0$/, "");

129+

if (

130+

withoutVersionSuffix !== candidate &&

131+

KNOWN_CONTEXT_WINDOWS[withoutVersionSuffix] !== undefined

132+

) {

133+

return KNOWN_CONTEXT_WINDOWS[withoutVersionSuffix];

134+

}

135+

}

136+

return undefined;

137+

}

138+26139

const DEFAULT_COST = {

27140

input: 0,

28141

output: 0,

@@ -163,7 +276,7 @@ function toModelDefinition(

163276

reasoning: inferReasoningSupport(summary),

164277

input: mapInputModalities(summary),

165278

cost: DEFAULT_COST,

166-

contextWindow: defaults.contextWindow,

279+

contextWindow: resolveKnownContextWindow(id) ?? defaults.contextWindow,

167280

maxTokens: defaults.maxTokens,

168281

};

169282

}

@@ -192,7 +305,7 @@ function resolveBaseModelId(profile: InferenceProfileSummary): string | undefine

192305

}

193306

if (profile.type === "SYSTEM_DEFINED") {

194307

const id = profile.inferenceProfileId ?? "";

195-

const prefixMatch = /^(?:us|eu|ap|jp|global)\.(.+)$/i.exec(id);

308+

const prefixMatch = /^(?:us|eu|ap|apac|au|jp|global)\.(.+)$/i.exec(id);

196309

if (prefixMatch) {

197310

return prefixMatch[1];

198311

}

@@ -282,7 +395,9 @@ function resolveInferenceProfiles(

282395

reasoning: baseModel?.reasoning ?? false,

283396

input: baseModel?.input ?? ["text"],

284397

cost: baseModel?.cost ?? DEFAULT_COST,

285-

contextWindow: baseModel?.contextWindow ?? defaults.contextWindow,

398+

contextWindow: baseModel?.contextWindow

399+

?? resolveKnownContextWindow(baseModelId ?? profile.inferenceProfileId ?? "")

400+

?? defaults.contextWindow,

286401

maxTokens: baseModel?.maxTokens ?? defaults.maxTokens,

287402

});

288403

}