惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
Comments on: Blog
酷 壳 – CoolShell
酷 壳 – CoolShell
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
李成银的技术随笔
美团技术团队
博客园 - 三生石上(FineUI控件)
爱范儿
爱范儿
Simon Willison's Weblog
Simon Willison's Weblog
Cisco Talos Blog
Cisco Talos Blog
博客园 - 司徒正美
Jina AI
Jina AI
S
SegmentFault 最新的问题
Recorded Future
Recorded Future
大猫的无限游戏
大猫的无限游戏
月光博客
月光博客
E
Exploit-DB.com RSS Feed
J
Java Code Geeks
腾讯CDC
V
V2EX
NISL@THU
NISL@THU
M
MIT News - Artificial intelligence
量子位
T
Tor Project blog
T
Threatpost
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
博客园 - Franky
Scott Helme
Scott Helme
U
Unit 42
博客园 - 聂微东
Hacker News - Newest:
Hacker News - Newest: "LLM"
雷峰网
雷峰网
Vercel News
Vercel News
GbyAI
GbyAI
MyScale Blog
MyScale Blog
Microsoft Security Blog
Microsoft Security Blog
Recent Commits to openclaw:main
Recent Commits to openclaw:main
aimingoo的专栏
aimingoo的专栏
H
Hackread – Cybersecurity News, Data Breaches, AI and More
有赞技术团队
有赞技术团队
W
WeLiveSecurity
T
Tailwind CSS Blog
S
Schneier on Security
Hugging Face - Blog
Hugging Face - Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Y
Y Combinator Blog
I
Intezer
Last Week in AI
Last Week in AI
D
Darknet – Hacking Tools, Hacker News & Cyber Security

DEV Community

Gemma 4 challenge inspired me to build my first app! 96. LoRA: Fine-Tune a Billion-Parameter Model on a Laptop From a Student Who Used CircuitVerse to a GSoC Contributor — My Community Bonding Story I asked Claude to explain the chip war and ended up understanding modern geopolitics differently Stop Manually Checking for Server Updates: Automate With Email Notifications Nostalgia Meets Cybersecurity: Spotting Modern Scams in a Retro OS Simulator - Forward or Fraud CRACKING CODING INTERVIEW From Python to Production Pipeline :A Practical guide to Apache Airflow Antigravity 2.0: Google Just Changed What It Means to Be an Engineer I Built a Free Sticker Maker Because Every Other One Hid the Export How I bypassed Blazor WebAssembly's Virtual DOM using raw WASM pointers Distributed Tracing for LLM Agents: When MCP Makes Tool Calls Observable The Zero-Budget Memory Setup Behind My AI Agent Workflow No database. No framework. Just files, startup order, correction logs, and discipline. I Built an AI Second Brain with Gemma 4 The Most Exciting Google I/O 2026 Announcement for Me: HTML-in-Canvas CrisisLens: Compressing Disaster Scenes into 200-Byte Emergency Payloads with Gemma 4 I'm 15 and I built a todo app with Telegram Stars payments — only legal way for me to monetize before turning 18 Crypto Branding After the Token Launch Building an on-chain alerts bot in Python without any blockchain library FinePrint — An AI Pocket Lawyer That Decodes Predatory Contracts Using Gemma 4 How to Connect OpenAI with Supabase in 10 Minutes for a Lightning-Fast AI MVP One AI Gateway for AWS Bedrock, Google Vertex AI, Gemini, and Anthropic Reading Log #9 — Aoashi The Tacit Dimension Thinking, Fast and Slow Web3 Onboarding Is Not a Wallet Problem. It Is a Trust Problem. FHE Prompt Privacy: The Metadata Leak Your Demo Still Has Software Might Be Becoming Agent-Aware: What if software starts coordinating itself? The Silent Killers of Go Concurrency: Mutexes, Semaphores, and Goroutine Leaks Lynx framework first look Building Aries AI: A Solo-Built AI Abacus Tutor on OpenAI + Supabase + Render + Razorpay I built a paid Telegram bot. Here's what Telegram Stars actually pay. Transfer Fees, Metadata, and Soulbound Tokens: A Tour of Solana Token Extensions Improving AI resume matching with prompt iteration — 7.37 to 8.37/10 7 things you can do with Rogue Studio that no other AI IDE will let you do Why I Think WordPress Still Matters Reading Log #7 — Aoashi Guns, Germs, and Steel Distinction Open Models and the Sub-Saharan Region What 12 Months of AI-Generated Pull Requests Taught My Engineering Team Feature Flags in .NET 8: ASP.NET Core, Minimal APIs, Blazor The Quiet Architecture of Systems That Refuse to Die From OOP to SOLID: Everything You Need to Know in One Article I Scanned 5 Common LangChain Agent Patterns. Every Single One Was Over-Permissioned. Production-Ready MCP Servers in 60 Seconds (Auth, Rate Limits, Audit Logs Included) Dari OOP ke SOLID: Semua yang Perlu Kamu Tahu dalam Satu Artikel The Most Important Part of Google I/O 2026 Wasn’t a Model — It Was the Infrastructure When SafetyCo Goes to War: Anthropic, the DOD, and the Limits of Ideals-Based Frameworks Why AI Memory Resolves Too Much — And What to Preserve Instead What Gemma 4 Means for the Future of Local AI (And Why It Matters More Than GPT-5) The Classroom Gap: Why Applied AI Has Yet to Transform How the World Learns Cell-to-Sentence (C2S): LLM-Powered scRNA-seq Annotation with Gemma 4 GitHub rust-2026-template — my Rust starter in 2026 Stop Editing JSON by Hand How I Turned an Old Movie Recommendation Project Into a Cinematic AI Platform Linux Command Line: The 25 Commands I Use Every Day (2026) The Multilingual SEO Trap: When Your Meta Description Speaks the Wrong Language young-colleague-job-worries What I Learned About Token Design on Solana as a Web2 Developer 19/30 Days System Design Questions! My first Android App - NightLock Tabula vs Camelot vs pdfplumber in 2026: Which Python Library Actually Wins? AI Agent Failure Loops: When Persistence Becomes a Quality Bug Experienced devs are slower with AI and they don't even know it Building a No-KYC Poker Bot: What I Learned Automating Crypto Tables React.lazy + chunk errors: how to recover users stuck after a deploy How I Built Clinical Trials API - From Public Data to RapidAPI in 2 Weeks Where is the Code Editor?! - Reception for Antigravity 2.0 I built a tool to catch AI coding agents misbehaving — and put zero AI in it Reading Log #5 — Aoashi Seeing Like a State Distinction [Boost] How to Build a Clinical Trial Search App in 5 Minutes - Clinical Trials API Tutorial Gemma For Dummies: I Knew Nothing. Now I'm Running AI on My Laptop. I gave an AI a Kill Switch. Here's what I learned about trust in local-first tooling. Notification System Technical Specification What ElumKit v0.1 already does (and the one primitive I missed) Why Every Student Developer Should Know About Microsoft Imagine Cup 🚀 Mikplanu: Empowering Education through Edge AI Sovereignty 터미널 AI 에이전트 구축 (v9) What If Your Portfolio Verifier Could Actually See Your UI? Node.js Event Loop Architecture — How a Single-Threaded Runtime Handles Massive Concurrency From Concept to Code: Bringing Your Vision to Life with Michael K. Laweh Caching Layers in 2026: CDN, App, DB, Query: What Goes Where Stop Wasting Tokens on Android Automation Building a GamepadTester: A Developer’s Perspective on Reading Controller Input in the Browser Your Inbox Knows Too Much: Parsli for the Privacy Paranoid I Ran Every Gemma 4 Model on My Home Lab. E4B Crushes E2B. Here's the Data. How I Use an Online TI-84 Calculator for Quick Math While Coding Building a Blog Platform with Docker #5: Add a Dockerfile + Deploy to Clouderized I Scanned 10 Popular F-Droid Apps With My Security Scanner — Open Source Secure How Microsoft Azure Ensures Reliability, Scalability, and Business Continuity Shelfie: I Built a Book Scanner That Runs Entirely on a $75 Raspberry Pi (Using Gemma 4) Beyond the Hype: Why Google AI Studio Might Become the Bridge Between African Ideas and Global Innovation I built a GitHub Action that blocks PRs when your Figma file is over budget eBPF From Scratch: from the eBPF VM to writing your own tools (tested on a live Cilium cluster) The Case of the Misidentified null A Decade After: Why We Still Can't Get the Treasure Hunt Engine Right I Solved 512+ LeetCode Problems, and Here’s What I Learned 🧠 Deeper into Dataform 2: Other API features Deeper into Dataform 1: Exploring the API Guild — A Free Autonomous Coding Agent That Escalates Through Gemma 4 Models The Web Should Become a VM rabbitholes: how I built a Chrome extension with no server to touch your data
How Bf-Tree Keeps Mini-Pages Small, Hot, and Cheap to Evict
Athreya aka · 2026-05-25 · via DEV Community

Hello, I'm Maneshwar. I'm building git-lrc, a Micro AI code reviewer that runs on every commit. It is free and source-available on Github. Star git-lrc to help devs discover the project. Do give it a try and share your feedback for improving the project.

Mini-pages are central to Bf-Tree performance, but they cannot grow forever.

If every insert, update, and tombstone remained in memory indefinitely, insertion cost would increase and memory would fill with stale records.

Bf-Tree continuously reorganizes mini-pages through merges, copying, and eviction.

When Does a Mini-Page Merge?

The paper describes two situations where a mini-page is merged back into its leaf page:

  1. The mini-page becomes too large
  2. The mini-page becomes cold (rarely accessed)

Since records inside a mini-page remain sorted, insertion cost rises as it grows.

Mini-page size increases

512B → 1KB → 2KB → ...

Insertion overhead rises

Enter fullscreen mode Exit fullscreen mode

When a mini-page exceeds roughly 2KB, Bf-Tree merges it with the leaf page and produces a larger 4KB mini-page mirroring the leaf.

The merge process is roughly:

Locate leaf page
      ↓
Check available space
      ↓
Split leaf if needed
      ↓
Merge mini-page records
      ↓
Discard old mini-page

Enter fullscreen mode Exit fullscreen mode

The old memory can then be reused.

This keeps write buffering useful without letting mini-pages become expensive secondary indexes.

Copy-On-Access Prevents Hot Data From Being Evicted

Bf-Tree stores mini-pages in a circular buffer.

Suppose a mini-page is close to eviction but suddenly becomes active again.

Instead of keeping it in place:

Old mini-page
      ↓
Copy to buffer tail
      ↓
Mark old copy as tombstone

Enter fullscreen mode Exit fullscreen mode

The frequently accessed mini-page effectively receives a second life.

Hot data moves away from eviction boundaries.

Cold data moves toward disk.

This resembles a lightweight cache promotion strategy.

Evicting Records Instead of Evicting Entire Pages

One interesting detail in the paper is that Bf-Tree can evict individual cold records.

Each record maintains a reference bit:

Accessed recently → keep
Not accessed → evict

Enter fullscreen mode Exit fullscreen mode

During copy-on-access:

  • Hot records remain in memory
  • Cold cache records are discarded immediately
  • Dirty records (insertions/tombstones) trigger merge with leaf pages

So a mini-page gradually evolves:

Before:
[A][B][C][D][E]

Only A and D are hot

After:
[A][D]

Enter fullscreen mode Exit fullscreen mode

Memory becomes concentrated around active keys.

This is much finer-grained than traditional page caches, where an entire page often survives because one record is frequently accessed.

Leaf Splits Still Exist

Despite the redesign, Bf-Tree has not eliminated classic B-Tree behavior.

Leaf pages still split when full.

The difference is where inserts land first:

Traditional B-Tree:

Insert → Leaf page → Possible split

Enter fullscreen mode Exit fullscreen mode

Bf-Tree:

Insert → Mini-page
            ↓
       Merge/Evict
            ↓
       Possible leaf split

Enter fullscreen mode Exit fullscreen mode

A split happens later, after buffering absorbs multiple writes.

Remaining mini-page records are compared against the split key and distributed across new leaf pages.

This delays structural modification and reduces immediate disk work.

The Bigger Pattern

Across merges, copy-on-access, record eviction, and splits, Bf-Tree repeatedly applies the same idea:

Keep hot data close, push cold data out, and postpone expensive disk operations as long as possible.

Mini-pages are not just a cache—they behave more like a continuously self-cleaning write buffer.

AI agents write code fast. They also silently remove logic, change behavior, and introduce bugs -- without telling you. You often find out in production.

git-lrc fixes this. It hooks into git commit and reviews every diff before it lands. 60-second setup. Completely free.*

Any feedback or contributors are welcome! It's online, source-available, and ready for anyone to use.

⭐ Star it on GitHub:


AI agents write code fast. They also silently remove logic, change behavior, and introduce bugs -- without telling you. You often find out in production.

git-lrc fixes this. It hooks into git commit and reviews every diff before it lands. 60-second setup. Completely free.

See It In Action

See git-lrc catch serious security issues such as leaked credentials, expensive cloud operations, and sensitive material in log statements

git-lrc-intro-60s.mp4

Why

  • 🤖 AI agents silently break things. Code removed. Logic changed. Edge cases gone. You won't notice until production.
  • 🔍 Catch it before it ships. AI-powered inline comments show you exactly what changed and what looks wrong.
  • 🔁 Build a