惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

NISL@THU
NISL@THU
有赞技术团队
有赞技术团队
WordPress大学
WordPress大学
U
Unit 42
腾讯CDC
宝玉的分享
宝玉的分享
Y
Y Combinator Blog
V
Visual Studio Blog
C
Check Point Blog
N
Netflix TechBlog - Medium
云风的 BLOG
云风的 BLOG
博客园 - 聂微东
酷 壳 – CoolShell
酷 壳 – CoolShell
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
P
Privacy & Cybersecurity Law Blog
V
Vulnerabilities – Threatpost
The Hacker News
The Hacker News
人人都是产品经理
人人都是产品经理
Google DeepMind News
Google DeepMind News
Vercel News
Vercel News
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
N
News and Events Feed by Topic
aimingoo的专栏
aimingoo的专栏
S
SegmentFault 最新的问题
Engineering at Meta
Engineering at Meta
Cyberwarzone
Cyberwarzone
The Last Watchdog
The Last Watchdog
S
Secure Thoughts
Recorded Future
Recorded Future
阮一峰的网络日志
阮一峰的网络日志
博客园 - Franky
E
Exploit-DB.com RSS Feed
V
V2EX
S
Security Affairs
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
IT之家
IT之家
爱范儿
爱范儿
小众软件
小众软件
Last Week in AI
Last Week in AI
C
Cybersecurity and Infrastructure Security Agency CISA
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
O
OpenAI News
The Cloudflare Blog
Cloudbric
Cloudbric
L
Lohrmann on Cybersecurity
H
Hacker News: Front Page
C
Cisco Blogs
CTFtime.org: upcoming CTF events
CTFtime.org: upcoming CTF events
Webroot Blog
Webroot Blog
月光博客
月光博客

Show HN

Release v1.3.0 — AI-Powered Migration Explanation & Migrations Folder Support · migradiff/migra Show HN: HumanForScale – See how big things are DropLock Show HN: Ego lite – why our browser agent writes JavaScript not CLI commands Extend AI · sound like you, everywhere SnapState — Your workspace, perfected. Helios. Is plug-in solar worth it? GitHub - riddleling/docOCR: macOS CLI and HTTP OCR tool for converting document images to Markdown. Geostakes — Stake. Guess. Win. Show HN: AI-org – org-mode powered by AI GitHub - PepperDev/totpgate: Lightweight SPA TOTP port knocking daemon GitHub - ppnpm/clinlang: Shorthand writing for doctors. Write shortly and later convert into structured case documentations. cartographer-skill/skills/cartographer/SKILL.md at main · spinchange/cartographer-skill GitHub - schildep/verified-polygon-intersection: Formally verified polygon intersection FreeCal — calendars for your organisation Show HN: Self Publish Studio Owl VIP Email Alerts | Gmail Notifications for Specific Senders 1 Million Pixels Show HN: [Geo-Cast] hear what other people are saying ClawChat GitHub - migradiff/migra: The actively maintained fork of migra — PostgreSQL schema diff and migration script generator. vibebnb · bring back Airbnb's vibe filters GitHub - jmaczan/tiny-vllm: Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM Reposeek - Find the repo to build on GitHub - hunvreus/heypi: Chat agents for your team, with approvals and sandboxed tools. Slack, Discord, Telegram, webhooks. Show HN: Heirlooms – pass your legacy to family after stop breathing Show HN: I launched a micro-gig marketplace and used it to buy my own GTM plan Show HN: AionOS – self-healing microkernel in Zig (boots on real hardware) GitHub - njbrake/dotpi: My ds4 + pi configuration for success Sverklo - Repo Memory for Coding Agents GAIA Atlas - Local Stellar Map GitHub - joshduffy/claude-handoff-guard: Hook-enforced ownership for AI coding session handoffs Prezlo — The AI Visibility Platform for Professionals GitHub - kenm47/nvEnvy: Fast, keyboard-driven note-taking app for macOS — a modern rebuild of nvALT in Swift/SwiftUI. vibewarz — bot-vs-bot arena Inkfeed TV Explorer — 10,000 Free TV Channels Oort — The prompt stack solo devs ship with GitHub - Bella3202019/promptloop: Claude Code for prompt eval Stefan Le Noach Show HN: A smarter CSS selector generator Phoenix Code - Free Open Source Code Editor | Successor to Brackets Korean OEM/ODM Manufacturer Sourcing Platform | OEMKorea GitHub - OWASP/www-project-agent-memory-guard: OWASP Foundation web repository GitHub - fynyky/elemental: Simple reactive front-end library HolaClaw: run OpenClaw securely in Mac Semiconductor Review GitHub - vaddisrinivas/tab-council: Chrome MV3 extension that turns AI tabs into a structured model council Stillis - The Social Stock Market OpenHive — Agents working together Repolog — SEO, Performance, Security & AI Readiness audits Integuru - Generate fast, reliable APIs for any platform JobTrue — Your home base for the job search. LocalizeASO - ASO Localization for App Store Screenshots and Metadata RFC Reader — search & read IETF RFCs Kotlin Stdlib API Search — Autocomplete & Doc Reference KeptWell — Your family's medical binder, replaced. Cliparr | Self-hosted video clipper for Plex, Jellyfin, and local files. GitHub - glebmish/claude-code-replay: Replay Claude Code session logs to reconstruct lost project files, commit by commit. One Tile GitHub - aavilagallego/TheFoundry: The Foundry is a User Friendly - Enterprise Ready Multi-Agent System (MAS) bootstrapping framework. Monitoring | Firecrawl EverFree — Free, GitHub-backed notes Show HN: Orbital Package System (Ops) Free Furigana Converter: Kanji to Hiragana | EZFurigana How I built Ensemble · Brutal Cut Claude Code Costs ~50% Without Quality Loss | Headroom GitHub - stateflow-dev/adaptive-runtime: Adaptive Runtime Layer for Stateful AI Systems Show HN: Artwork in the style of Mark Rothko GitHub - fayazara/Screendrop: A native macOS menu bar app for taking screenshots, recording the screen, annotating captures, and sharing them when needed. It is built for a fast local workflow: capture something, preview it immediately, mark it up, save it, copy it, or upload it from the same floating preview. deepface.dev GitHub - gitricko/hermes-webtop: An agent that grows with you Sensonym - Forget Flashcards, Learn Languages by Doing Steam 上的 Code: Terraform GitHub - Thinklanceai/agentkeeper: Crash-resistant cognitive continuity for AI agents — checkpoint/restore, cross-model state reconstruction, semantic recall, and compression. Your agent survives crashes, restarts, and model switches. Tap — The browser with no tabs. Blinken · Bring back the blink GitHub - remontsuri/EV-QA-Framework: ML-powered QA framework for EV battery systems — telemetry validation, anomaly detection, SOH prediction, CAN bus (2.0B + J1939) emulation, DBC parser, Prometheus metrics, Grafana dashboard Show HN: Sixbpm – a free thing that slows your breathing down MapZap — 300 Local Business Leads in 60 Seconds Alphaset - Organic data, expert-grown Show HN: OSSentinel.live – AI-powered open source security monitoring Show HN: Claude Code AskUserQuestion which works for subagents/teams/workflows GitHub - SharkUI/SharkBay: SharkBay is a local-first macOS workbench for software projects. It helps you keep a set of local repositories visible, open project-scoped terminals and browser tabs, inspect Git state, and coordinate agent work through local Markdown task records. GitHub - doiito/gliding_horse: Gliding Horse is a multi-agent orchestration framework built in Rust that supports PDCA scheduling and knowledge graph-based agents, with comprehensive Chinese documentation, and is suitable for building enterprise-level AI agent systems. GitHub - 0xJaksun/lithium-core: Storage engine for AI agents to navigate, store, and retrieve structured data. PostgreSQL ltree, built-in versioning, scoped queries. GitHub - GiorgosXou/MLPico: Static-allocation MLP inference in ANSI C using 2-slot circular buffer with fixed stride indexing. An easy to use, minimal MLP alternative to GiorgosXou/NeuralNetworks enhanced with PROGMEM, int-quantization etc. GitHub - skorotkiewicz/acp-p2p: A P2P (POC) of the ACP for decentralized agent communication GitHub - yeet-src/airtop: htop for the airwaves — a live 802.11 (Wi-Fi) RF dashboard in your terminal GitHub - tamarillo-ai/theta-spec: harness agnostic configuration standard Show HN: DRD – Git for distributed consensus failures" GitHub - Heidar-An/Mira: Search for files semantically - no exact filenames required. GitHub - djadmin/fort: macOS CLI: endpoint security audit + SOC 2 readiness reports GitHub - exlee/rik: rik - limited agent edition Pieces | The pre-information market GitHub - jmilinovich/grove: Open-source MCP server over a git-backed Obsidian vault. Single-user, self-host. Six tools, hybrid search, provenance/blame, auto-link discovery. GitHub - Astralchemist/rig: Local-first semantic knowledge graph with magnetic-pull retrieval GitHub - sediman-agent/OpenSkynet: Your 24/7 Terminator CSP Radar GitHub - wartzar-bee/tokenscope: See what your AI-coding session actually cost — and what is eating your context. Local, read-only CLI for Claude Code.
shivvr — semantic embedding & cognitive agent service
kordlessagai · 2026-06-17 · via Show HN

shivvr 🔪 v0.3.0

Ephemeral semantic embedding & cognitive agent service.

Chunk text. Embed with GTR-T5-base. Search via hybrid FST-BM25 vector rank fusion. Stream turn-based cognitive agent reasoning via native Model Context Protocol (MCP).

Capabilities

Ingest

Sentence-boundary chunking + GTR-T5-base embeddings (768d). Stores in RwLock<HashMap> — pure ephemeral compute.

FST-BM25 Hybrid Search

RRF blending dense vectors and sparse lexical indices. FST dictionary scanning provides microsecond safe query guardrails and intent-entity score boosting.

Cognitive GhostAgent

Turn-based agent loop with integrated memory search, document ingestion, session indexing, and sandboxed command execution. Powered by OpenAI/Anthropic.

Native MCP Server

Complete Model Context Protocol HTTP/SSE server endpoint (/mcp/sse) allowing immediate, zero-config integration with Claude Code, Antigravity, and Codex.

Crypto

Per-agent orthogonal matrix rotation on embeddings. Cosine similarity preserved under encryption. Keys are in-memory only.

Dual embedding

organize role uses GTR-T5-base (768d). retrieve role uses OpenAI text-embedding-ada-002 (1536d) — pass your own key or set server-side.

API

MethodEndpointDescription
GET/healthStatus, model info, live counts
GET/mcp/sseMCP Server SSE handshake
POST/mcp/messageMCP Server JSON-RPC message router
POST/sessions/:id/agent/chatStream non-blocking GhostAgent cognitive turns (SSE)
POST/sessions/:id/ingestChunk + embed text into session
GET/sessions/:id/search?q=...Semantic search (supports RRF hybrid & lexical_only)
GET/sessions/:idSession metadata
DELETE/sessions/:idDelete session
GET/tempList temp stores with TTL
POST/temp/:name/ingestIngest into temp store (2 hr TTL)
GET/temp/:name/search?q=...Search temp store
DELETE/temp/:nameDelete temp store
POST/agent/:id/registerRegister per-agent orthogonal key
POST/agent/:id/encryptEncrypt embeddings
POST/agent/:id/decryptDecrypt embeddings
POST/invertReconstruct text from embedding vector

Quick start

# Ingest into session
curl -X POST https://shivvr.nuts.services/sessions/my-session/ingest \
  -H "Content-Type: application/json" \
  -d '{"text": "Supreme Raven is protected by Known Opossum.", "source": "vault_specs"}'

# Autonomous Agent Conversational Chat (Streams Thoughts, ToolCalls, & Answer via SSE)
curl -i -X POST http://localhost:8085/sessions/my-session/agent/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "Who protects the Supreme Raven?"}'

# Vector-Lexical Hybrid RRF Search
curl "http://localhost:8085/sessions/my-session/search?q=Known+Opossum&hybrid=true"

# High-speed Lexical-Only BM25 Search (Bypasses ONNX embedder)
curl "http://localhost:8085/sessions/my-session/search?q=Opossum&lexical_only=true"

# Synchronize Claude Code or Antigravity with shivvr's Native MCP Server
nemesis8 mcp add http://localhost:8085/mcp/sse

Search parameters

ParamDefaultDescription
qrequiredQuery text
n5Number of results
hybridfalseBlend semantic vectors + BM25 scores (Reciprocal Rank Fusion)
lexical_onlyfalseBypass vector embedder, execute pure BM25 search
guardrailtrueEnable FST toxic term scanning and automatic query blocking
roleorganizeorganize (768d local) or retrieve (1536d OpenAI)
time_weight0.0Blend semantic + recency score (0–1)
decay_halflife_hours168Recency decay half-life in hours
include_nearbyfalseReturn temporally adjacent chunks
agent_idAgent ID for encrypted search
openai_api_keyPer-request OpenAI key for retrieve role (overrides server key)

Environment

VariableDefaultDescription
PORT8080Listen port
MODEL_PATHmodels/gtr-t5-base.onnxGTR-T5-base ONNX embedder
TOKENIZER_PATHmodels/tokenizer.jsonTokenizer
OPENAI_API_KEYEnables OpenAI completions and retrieve embeddings
ANTHROPIC_API_KEYEnables Anthropic completions and GhostAgent loops
NUTS_AUTH_JWKS_URLEnable auth (open dev mode if unset)
NUTS_AUTH_VALIDATE_URLhttps://auth.nuts.services/api/validateAPI token validation endpoint

Stack

LayerChoice
RuntimeRust + Tokio + axum
CognitionGhostAgent cognitive RAG turn loop (OpenAI / Anthropic compat)
MCP ServerHTTP/SSE JSON-RPC 2.0 Model Context Protocol transport layer
Hybrid IndexTantivy FST deterministic phrase engine + BM25F field indexer
EmbeddingGTR-T5-base (768d) via ONNX Runtime 2.0 — local, required
StorageEphemeral RwLock<HashMap> — no disk, no volume mounts
GPUCUDA 12.6 via ort EP on Cloud Run L4 — CPU fallback automatic
Inversionvec2text gtr-base (projection + T5 enc/dec) — optional