惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

W
WeLiveSecurity
The GitHub Blog
The GitHub Blog
Engineering at Meta
Engineering at Meta
Microsoft Azure Blog
Microsoft Azure Blog
The Register - Security
The Register - Security
Stack Overflow Blog
Stack Overflow Blog
博客园 - 三生石上(FineUI控件)
T
Threat Research - Cisco Blogs
S
SegmentFault 最新的问题
V2EX - 技术
V2EX - 技术
Hacker News: Ask HN
Hacker News: Ask HN
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
P
Proofpoint News Feed
J
Java Code Geeks
Microsoft Security Blog
Microsoft Security Blog
M
MIT News - Artificial intelligence
AI
AI
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
P
Proofpoint News Feed
Hacker News - Newest:
Hacker News - Newest: "LLM"
B
Blog
N
News and Events Feed by Topic
N
News | PayPal Newsroom
Google DeepMind News
Google DeepMind News
酷 壳 – CoolShell
酷 壳 – CoolShell
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
WordPress大学
WordPress大学
C
Cybersecurity and Infrastructure Security Agency CISA
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
博客园 - 【当耐特】
U
Unit 42
腾讯CDC
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Cloudflare Blog
H
Help Net Security
Recent Announcements
Recent Announcements
P
Privacy & Cybersecurity Law Blog
IT之家
IT之家
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Security Archives - TechRepublic
Security Archives - TechRepublic
L
LINUX DO - 热门话题
Martin Fowler
Martin Fowler
MongoDB | Blog
MongoDB | Blog
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
H
Heimdal Security Blog
博客园 - 聂微东
S
Securelist
大猫的无限游戏
大猫的无限游戏
Cloudbric
Cloudbric
Cisco Talos Blog
Cisco Talos Blog

Release notes from OmniClip-RAG

Release v0.4.8 · msjsc001/OmniClip-RAG Release v0.4.7 · msjsc001/OmniClip-RAG Release v0.4.6 · msjsc001/OmniClip-RAG Release v0.4.5 · msjsc001/OmniClip-RAG Release v0.4.4 · msjsc001/OmniClip-RAG Release v0.4.3 · msjsc001/OmniClip-RAG Release v0.4.2 · msjsc001/OmniClip-RAG Release v0.4.1 · msjsc001/OmniClip-RAG Release v0.4.0 · msjsc001/OmniClip-RAG
Release v0.3.3 · msjsc001/OmniClip-RAG
msjsc001 · 2026-03-17 · via Release notes from OmniClip-RAG

OmniClip RAG / 方寸引 v0.3.3

Summary

v0.3.3 is the Tika stability closure release after v0.3.2.

This release focuses on one practical outcome: if a user selects a Tika-supported format and the file itself is healthy, the build pipeline should actually index it instead of reporting a vague "skipped" result. At the same time, the Tika runtime installer should finally show visible progress instead of behaving like a black box.

Highlights

  • Tika indexing is now compatibility-first:
    • first try PUT /tika with Accept: text/plain
    • then fall back to PUT /rmeta with Accept: application/json
    • only fail after both strategies are exhausted
  • Tika success is now defined as extractable body text, not "XHTML must exist"
  • expected local-file skips are now separated from real parser failures:
    • zero-byte or unreadable files stay skippable
    • sidecar/protocol failures are reported as true failures
  • the Tika Runtime card now shows inline install progress:
    • current stage
    • current download item
    • byte progress
    • install target
  • project docs were refreshed to match the current product shape:
    • Core Features / 核心特性
    • explicit open-source acknowledgements
    • a permanent in-repo Tika stabilization plan

Included In This Release

Tika build/index path

  • replaced the old XHTML-only parse contract with a compatibility-first multi-strategy parser
  • added structured Tika parse results and failure reasons
  • added unified content normalization for plain text, rmeta JSON, and XHTML
  • improved build reporting so users can distinguish:
    • indexed files
    • expected skips
    • true parse failures

Tika installer UX

  • switched the Tika auto-install flow to a progress-aware worker path
  • surfaced installation stage, byte progress, current item, and target directory inside the page
  • improved busy/disabled state handling while install or redetect operations are running

Documentation

  • updated README and README.zh-CN to v0.3.3
  • renamed the feature overview section to Core Features / 核心特性
  • added an open-source acknowledgements section before the license block
  • recorded this stabilization work in Tika建库稳定性与安装进度闭环计划

Release Shape

  • GitHub source push: code, docs, tests, release notes
  • Packaged build output:
    • dist/OmniClipRAG-v0.3.3/
    • dist/OmniClipRAG-v0.3.3-win64.zip
  • Still intentionally not bundled into the app package:
    • Runtime payloads
    • Tika JAR / JRE
    • model cache
    • user data / indexes / logs / exports

Notes

This release does not change the architecture boundary that matters most:

  • PDF still stays on its own isolated route
  • Tika still stays isolated from the Markdown and PDF stores
  • the packaged app remains lean, while heavy runtime assets stay outside the EXE payload