惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
Cyber Attacks, Cyber Crime and Cyber Security
The Last Watchdog
The Last Watchdog
Forbes - Security
Forbes - Security
S
Security @ Cisco Blogs
TaoSecurity Blog
TaoSecurity Blog
T
Troy Hunt's Blog
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
H
Hacker News: Front Page
W
WeLiveSecurity
WordPress大学
WordPress大学
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
N
News | PayPal Newsroom
D
DataBreaches.Net
博客园_首页
Y
Y Combinator Blog
F
Fortinet All Blogs
罗磊的独立博客
Apple Machine Learning Research
Apple Machine Learning Research
T
Tailwind CSS Blog
Hacker News - Newest:
Hacker News - Newest: "LLM"
Hugging Face - Blog
Hugging Face - Blog
The GitHub Blog
The GitHub Blog
B
Blog RSS Feed
C
CERT Recently Published Vulnerability Notes
P
Privacy & Cybersecurity Law Blog
Help Net Security
Help Net Security
S
SegmentFault 最新的问题
Recorded Future
Recorded Future
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
N
News and Events Feed by Topic
Schneier on Security
Schneier on Security
V
Vulnerabilities – Threatpost
A
About on SuperTechFans
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
L
LangChain Blog
酷 壳 – CoolShell
酷 壳 – CoolShell
GbyAI
GbyAI
L
LINUX DO - 热门话题
Simon Willison's Weblog
Simon Willison's Weblog
雷峰网
雷峰网
G
Google Developers Blog
Cyberwarzone
Cyberwarzone
I
Intezer
Google DeepMind News
Google DeepMind News
AWS News Blog
AWS News Blog
C
Check Point Blog
AI
AI
博客园 - 【当耐特】
有赞技术团队
有赞技术团队
博客园 - 司徒正美

Release notes from OmniClip-RAG

Release v0.4.8 · msjsc001/OmniClip-RAG Release v0.4.7 · msjsc001/OmniClip-RAG Release v0.4.6 · msjsc001/OmniClip-RAG Release v0.4.5 · msjsc001/OmniClip-RAG Release v0.4.4 · msjsc001/OmniClip-RAG Release v0.4.3 · msjsc001/OmniClip-RAG Release v0.4.2 · msjsc001/OmniClip-RAG Release v0.4.1 · msjsc001/OmniClip-RAG Release v0.4.0 · msjsc001/OmniClip-RAG
Release v0.3.3 · msjsc001/OmniClip-RAG
msjsc001 · 2026-03-17 · via Release notes from OmniClip-RAG

OmniClip RAG / 方寸引 v0.3.3

Summary

v0.3.3 is the Tika stability closure release after v0.3.2.

This release focuses on one practical outcome: if a user selects a Tika-supported format and the file itself is healthy, the build pipeline should actually index it instead of reporting a vague "skipped" result. At the same time, the Tika runtime installer should finally show visible progress instead of behaving like a black box.

Highlights

  • Tika indexing is now compatibility-first:
    • first try PUT /tika with Accept: text/plain
    • then fall back to PUT /rmeta with Accept: application/json
    • only fail after both strategies are exhausted
  • Tika success is now defined as extractable body text, not "XHTML must exist"
  • expected local-file skips are now separated from real parser failures:
    • zero-byte or unreadable files stay skippable
    • sidecar/protocol failures are reported as true failures
  • the Tika Runtime card now shows inline install progress:
    • current stage
    • current download item
    • byte progress
    • install target
  • project docs were refreshed to match the current product shape:
    • Core Features / 核心特性
    • explicit open-source acknowledgements
    • a permanent in-repo Tika stabilization plan

Included In This Release

Tika build/index path

  • replaced the old XHTML-only parse contract with a compatibility-first multi-strategy parser
  • added structured Tika parse results and failure reasons
  • added unified content normalization for plain text, rmeta JSON, and XHTML
  • improved build reporting so users can distinguish:
    • indexed files
    • expected skips
    • true parse failures

Tika installer UX

  • switched the Tika auto-install flow to a progress-aware worker path
  • surfaced installation stage, byte progress, current item, and target directory inside the page
  • improved busy/disabled state handling while install or redetect operations are running

Documentation

  • updated README and README.zh-CN to v0.3.3
  • renamed the feature overview section to Core Features / 核心特性
  • added an open-source acknowledgements section before the license block
  • recorded this stabilization work in Tika建库稳定性与安装进度闭环计划

Release Shape

  • GitHub source push: code, docs, tests, release notes
  • Packaged build output:
    • dist/OmniClipRAG-v0.3.3/
    • dist/OmniClipRAG-v0.3.3-win64.zip
  • Still intentionally not bundled into the app package:
    • Runtime payloads
    • Tika JAR / JRE
    • model cache
    • user data / indexes / logs / exports

Notes

This release does not change the architecture boundary that matters most:

  • PDF still stays on its own isolated route
  • Tika still stays isolated from the Markdown and PDF stores
  • the packaged app remains lean, while heavy runtime assets stay outside the EXE payload