慣性聚合 高效追蹤和閱讀你感興趣的部落格、新聞、科技資訊
閱讀原文 在慣性聚合中打開

推薦訂閱源

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
你的收件箱知道太多:Parsli 適合對隱私感到擔憂的人
Olga Bragins · 2026-05-24 · via DEV Community

這是一份提交給Gemma 4挑戰:用Gemma 4

所建構的內容 我所建構的

每隔幾週就會有關客戶數據洩露或遭到攻擊的SaaS平台的新聞。同時,人們仍然樂呵呵地將他們的Gmail帳戶連接到隨機的AI產品,僅僅為了回答一個問題:「我的包裹在哪裡?」

Parsli 是一個本機優先的AI助理,它能在不將你的收件箱傳送至別人雲端的情況下回答這個問題.

它能本機連接到Gmail,解析與運送相關的郵件,提取追蹤信息,分類運送事件,並從市場、承運商、取貨點、海關通知和隨機商店(它們仍然像2009年一樣發送郵件)的混亂中建立時間線。

Screenshot of the main page

運送郵件一旦仔細查看,會令人驚訝地顯示出你的個人資料。它們悄悄透露你購物的地方、你使用的藥房、你購買的昂貴物品、你出行的時間,甚至有時候你甚至不在家。我不希望將這股個人行為的資料交給另一間標榜「我們嚴肅看待你的隱私」的創業公司。

Parsli 仍然是一個早期原型,但這是一個我真心計劃要持續開發的工具。我透過不同市場平台從不同國家訂購商品,而運輸追蹤很快就在不同承運商、語言和通知格式之間變得混亂不堪。這最初是一個關於本地 AI 工作流程的實驗,但逐漸變成了我自己實際想要使用的東西。下一步包括添加 SMS、螢幕截圖和語音訊息作為輸入來源——運輸更新並非僅僅集中於電子郵件,而是分散在多個渠道。

我還想讓系統可觀察,而不是變成另一個「黑色方盒」AI代理程式。除了儲存運送事件外,Parsli還持續記錄規則匹配、模型決策、信心水平、提取的實體、處理時間、token使用情況和分類推理。一旦離開快樂路徑的示範,郵件解析幾乎會立即變成邊緣案例的地獄,因此擁有完整的決策軌跡使調試顯著變得更容易.

示範

代码

https://github.com/olgazju/parsli

我如何使用Gemma 4

Parsli將Gemma 4用作確定性萃取流程上的推理層。

很多運送郵件根本不需要LLM。亞馬遜、聯合包裹、以色列郵政和網絡上的一半人一直在重複發送相同的模板,因此像HTML清理、追蹤號提取、發票篩選和明顯的運送更新這類事情都是通過確定性規則和語言套件處理的。在每封郵件上浪費模型調用既慢又沒有意義。

但一旦郵件開始偏離標準模板——一份多語言的海關通知、一份帶有隱藏在散文中的追蹤信息的取貨點通知、一個格式化所有內容都不同的市場——單純的確定性規則就不再足夠了。這就是Gemma作為貨物分類器以及規則之上的審計層出現的地方。

此流程首先確定性地提取結構化候選項,然後將模棱兩可的情況發送到模型進行驗證、信心估計、出貨狀態分類以及一般的「這真的有道理嗎」檢查,在結果被持續保存之前。

我儲存整個決策軌跡:哪些規則觸發了、模型輸出、信心分數、token 使用情況、時間以及最終答案來自規則還是模型。在我的實際郵箱中,在48封相關郵件中,55% 由規則解決,模型僅作為廉價審計表示同意,38% 模型實際糾正了規則錯誤的部分,其餘則分佈在邊緣情況中。單獨使用規則可能只能達到60%的效果。單獨使用模型可以處理所有情況,但會很慢且浪費資源。它們一起覆蓋了彼此的盲點。

我使用google/gemma-4-e4b在M2 MacBook Pro上透過LM Studio以無頭模式執行,用於本地推論。這個模型大小對這個工作負載來說已經足夠了。一旦你去除HTML垃圾和郵件混亂,運送追蹤是一個狹窄的結構化問題——你是在有限狀態集中進行分類,而不是寫詩。E4B給了我所需的推理品質,同時速度足夠快,可以在沒有專用GPU伺服器的情况下在本地運行,這正是整個目的。