惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

H
Help Net Security
T
ThreatConnect
SecWiki News
SecWiki News
F
Future of Privacy Forum
AWS News Blog
AWS News Blog
C
Cisco Blogs
A
Arctic Wolf
Vercel News
Vercel News
The GitHub Blog
The GitHub Blog
Scott Helme
Scott Helme
V
V2EX
博客园 - 叶小钗
阮一峰的网络日志
阮一峰的网络日志
K
Kaspersky official blog
G
Google Developers Blog
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
P
Privacy International News Feed
C
Cyber Attacks, Cyber Crime and Cyber Security
N
News | PayPal Newsroom
Schneier on Security
Schneier on Security
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
量子位
The Hacker News
The Hacker News
Stack Overflow Blog
Stack Overflow Blog
Security Latest
Security Latest
M
Microsoft Research Blog - Microsoft Research
Google Online Security Blog
Google Online Security Blog
博客园_首页
C
CXSECURITY Database RSS Feed - CXSecurity.com
I
InfoQ
Google DeepMind News
Google DeepMind News
Y
Y Combinator Blog
The Cloudflare Blog
Microsoft Security Blog
Microsoft Security Blog
Martin Fowler
Martin Fowler
Cisco Talos Blog
Cisco Talos Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
T
Troy Hunt's Blog
F
Fox-IT International blog
S
Security @ Cisco Blogs
博客园 - 司徒正美
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
C
Comments on: Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
L
LINUX DO - 最新话题
GbyAI
GbyAI
Project Zero
Project Zero
腾讯CDC
T
Tailwind CSS Blog

DEV Community

Kubernetes sem Cloud Provider (Parte 2): Criando Operators em Go para automação e self-service de plataforma You've done tutorial after tutorial. Your GitHub is still empty. (Free 1‑page PDF, no signup) TypeScript 7.0: The Go Compiler That Makes TS 10x Faster Connecting Wallets the Right Way: wagmi v2 and EIP-6963 The 5-Layer Architecture Every Production Multi-Agent System Needs (And Why Most Skip Layers 4 and 5) CSS Scroll-Driven Animations: No JavaScript Required Vite 8 + Rolldown: Rust-Powered Builds That Are 10–30x Faster Core Architectural Components of Azure My Skills How I Use AI as a Senior Engineer Construí um motor ATS determinístico porque estava cansado de adivinhar por que meu currículo era rejeitado SCS-Lab1 — CloudTrail: Trail + S3 + KMS + Log Validation LuisCore MCP server — daily syndication · 2026-05-25 Cursor vs JetBrains Rider for C#/.NET in 2026: which to pay for I built a local-first movie recommender with Corrective-RAG (cited explanations, hybrid retrieval, runs entirely on Ollama) Scaling to 1 Million Users : Load Balancing & Caching Strategies How the Events Table That Looked Right Killed Our Queue Three Failures My AI Memory System Caught — And the Flaw It Revealed in Itself dotnet Framework life cycle tool LangGraph 워크플로우 템플릿 (v41) I built a free image compression API — no signup, just curl Designing TikTok from Scratch — A System Design Deep Dive PREDICTION-20260525-0007: boredom-with-asymmetric-leverage [2026-Q3 through 2027-Q3] [Boost] How to integrate the QuickBooks Invoice API in 2026 How I Cut My Anthropic API Bill by 50% With a Local Python Tool Vibe Coding Problems: 7 Visual Bugs AI Code Generators Always Ship Chinese AI Models 2026: The Agentic Revolution, Hardware Independence, and What It Means for Global Developers The Quiet AI War Inside Your Browser The 12-Line Anti-Bot Trick That Saved Our Airdrop Snapshot From Sybil Farms Building a production-ready SaaS dashboard in Next.js 16 — Recharts, TanStack Table, dark mode, and collapsible sidebar Why 2026 Belongs to Agentic AI (And How to Build Your First Local Agent) It Was 2024 When We Tried to Outsmart the Treasure Hunt Engine RAG 시스템 실전 구축 (v40) I Found a Tool That Generates a Complete .NET 8 or Java Spring Boot API From SQL Schema in 30 Seconds I Added a 4th Agent That Audits My Other Agents. It Caught My Strategist Procrastinating for 3 Weeks. Streaming LLM responses to the browser in Go (Server-Sent Events) How We Publish and Manage Educational Admission Updates at Scale on DailyAxom A prompt is not a conversation. It's a component contract. How to Pass the EAA 2025 Accessibility Audit — A Step-by-Step WCAG Checklist Building an Autonomous MCP Lead Generation System with Hermes Agent LangGraph 워크플로우 템플릿 (v40) How I Built 100 Browser-Based Image Tools With No Server (FFmpeg WASM, PDF-lib, AI Background Removal) Nginx CVE-2026-9256, AI Prompt Injection Defenses, and Claude AI Data Leak Demo Scaling RAG for 10M+ Docs, .md Agent Memory, & Claude Code for Motion Graphics Diagram as Code with draw.io DuckDB Delta, PostgreSQL 17 Migration, & SQLite Optimization Deep Dives Windows 11 Microsoft Account Login Recovery During Internet Restrictions The Linux Commands You Forgot Exist (And Why AI Workflows Make Them Relevant Again) Spec-Driven Development Without an IDE: I Generated NestJS, Go, Spring Boot, Laravel, and Rust Apps From a Single PRD File Components are states Edge SEO y Middleware: Cómo Interceptar a Googlebot y LLMs antes de llegar a tu Servidor Context window exceeded at turn 23. Here's how I track token usage without a tokenizer. My Hermes agent spent $3 before I noticed. Now it can't. My Hermes agent's stop condition was a 40-line if/elif chain. I replaced it with 3 lines. My agent kept hitting context limits. This one function fixed it. Create and configure Azure Firewall Your Hermes agent's audit log is leaking customer emails. Here's a 100-line lib that fixes that. My agent kept forgetting what it was doing. A scratchpad fixed it. I replaced 200 lines of ad-hoc state management in my Hermes agent with one object. Per-Key Rate Limiting for Agent Tool Calls: Stop One User From Breaking Everything Composable Output Guardrails: Filter Agent Responses Before They Reach Users Sanitize Your LLM Message Lists Before Every API Call Thread a Run ID Through Every Agent Call So You Can Debug Anything Normalize Provider Error JSON So Your Agent Can Actually Handle Failures Priority Queue for Agent Sub-Tasks: Stop Processing Low-Priority Work First Static Lint Rules for Your LLM Prompts (Before They Hit Production) tool-call-budgets: Stop Runaway Agent Loops Before They Hit Your Invoice Step Through Your Agent's Failures Like a Debugger The Simplest Stop Condition: A Hard Cap on Agent Loop Iterations Score Your Agent's Responses With a 0.0-1.0 Rubric (No LLM Judge Required) Fix Bad Structured Output by Feeding the Error Back to the Model Building an effective Storyblok Tool Plugin with SvelteKit How to Get Your Renault / Dacia Radio Code for Free RAG 시스템 실전 구축 (v39) Retraction — scrml’s Living Compiler I built a fitness app where the AI roasts you for eating pizza (and hypes you when you PR) The Top SaaS Founder Communities on Discord (Beyond the AI Hype) I Built a Production-Grade Async Job Queue from Scratch — Here's Everything That Actually Happened How to watch SMS from multiple Android phones in one iOS app We Didn’t Want Another AI Wrapper — So We Explored a High-Speed Hermes Orchestrator for Engineering Crews Multi-tenant além do TenantId: problemas reais e aprendizados em sistemas .NET After failing 23 times, I am sharing How I Actually Prepare for a Tech Interview Every Single Time Now. I built an app that works like a nutritionist for your brain. Here's what happened in 7 days. GoBadge Dynamic: From Module Stats to Universal Badges LangGraph 워크플로우 템플릿 (v39) The git Commands You Forgot Exist (And Why AI Workflows Make Them Relevant Again) Six Levels of MCP Servers One container to replace Grafana + Loki + Tempo + Prometheus The Request/Response Cycle, HTTP, Auth, JWT, OAuth & Sessions — Explained Properly Python Week 3: We Stopped Repeating Ourselves (Loops!) Creating a Custom Grid Editor tool in Unreal Engine 我做了个付费 Telegram bot。Telegram Stars 实际给开发者多少钱,我算了一笔账。 I Got 96% Recall on LLM Hallucination Detection With No ML Model – Just 50 Lines of Python A practitioner's guide to getting more value out of AI coding: agent quality & token optimization How to Handle Telegram Albums in Telegraf I Built a Multilingual Spam Detection Dataset with 149K+ Messages Across 23 Languages How to Handle Telegram Albums in grammY RAG 시스템 실전 구축 (v38) Beyond Pip Install: Why Your AI Agent Needs a "Hermetic" Life-Support System to Survive
AI Memory Needs an Authority Policy, Not Just More Context
Self-Correct · 2026-05-26 · via DEV Community

When records conflict, the agent needs explicit rules for which one is allowed to steer the answer.

Long-running AI systems eventually retrieve conflicting but individually valid memories:

  • an old summary,
  • a newer source file,
  • a remembered preference,
  • an explicit correction,
  • an unresolved question,
  • a draft labeled "final,"
  • an archive note that is useful historically but stale operationally.

Without an authority policy, the agent falls back to implicit heuristics: recency, semantic similarity, confidence of wording, or whatever fits the current prompt.

That is how stale facts start wearing the mask of current truth.

The solution is not just more memory. It is governed memory: a rule for which record wins when retrieved memories disagree.

Principles before layers

I do not think the exact hierarchy should be universal.

A coding agent, research assistant, legal agent, BI agent, and solo writing workflow should not all use the same authority order.

But the policy should follow a few principles.

1. Observed state beats remembered state

Live checks outrank stored memory when the claim depends on current reality: server status, API reachability, latest logs, current price, CI status, or local model availability.

Memory can say what to check. It should not replace the check.

2. Primary records beat compressed records

Summaries are useful because they are fast. They are also lossy.

If a current source file conflicts with a summary, the source file wins. If a raw query result conflicts with a dashboard summary, the query/provenance path deserves inspection before the summary becomes organizational memory.

Compressed memory should orient. Primary records should decide.

3. Negative constraints beat positive preferences

Preferences say what usually works.

Corrections say what must not be repeated.

If memory says:

The user likes vivid language.

but a correction says:

Do not use mythic language in local-business copy.

the correction wins in that project.

Corrections are not permanent truth. They are active constraints until reviewed or superseded.

4. Current operating state beats historical plans

"We planned to do X" is not the same as "X is next."

Old plans are useful history. They should not drive current action unless the current state reactivates them.

5. Unresolved claims block premature certainty

Unresolved memory is not low-value memory.

It is a gate.

If one record says:

The agent is wired to the current brain.

and another says:

Local model availability still needs a live check.

the agent can say the wiring exists. It cannot say the full local fallback is available until verified.

The unresolved record does not replace the decision. It limits what certainty is allowed.

A default order for solo agent work

For my current file-based workflow, this is the v0.1 order:

1. Live external checks and current reality
2. Current source files
3. Explicit corrections
4. Current state files
5. Active decisions and priorities
6. Unresolved questions and verification gates
7. Recent summaries
8. Preferences and style guidance
9. Archive / historical memory

Enter fullscreen mode Exit fullscreen mode

This is not a universal law. It is a default for one class of work: multi-session solo building, writing, and agent operation.

For other domains, I would change the order.

Coding agent:
1. tests / compiler / runtime output
2. current source files
3. issue or PR requirements
4. explicit corrections
5. recent implementation notes
6. summaries
7. archive

Research agent:
1. primary sources / source documents
2. current notes with citations
3. explicit corrections
4. unresolved hypotheses
5. summaries
6. preferences
7. archive

Production incident:
1. live system state
2. active incident document
3. current logs / metrics
4. runbook
5. normal roadmap / plans
6. summaries
7. archive

Enter fullscreen mode Exit fullscreen mode

The point is not to worship one ordering. The point is to make the ordering explicit enough to inspect, test, and correct.

Enforcement: assign authority before generation

A hierarchy written in prose is easy for a model to ignore.

The agent needs an application step before it writes the answer:

For each retrieved memory:
1. classify source type
2. classify epistemic status
3. check whether it is current, archived, superseded, unresolved, or verification-required
4. compare it against higher-authority records
5. assign one allowed action:
   - answer
   - answer as context
   - warn
   - verify first
   - block
   - archive only

Enter fullscreen mode Exit fullscreen mode

The key field is allowed_action.

A memory should not automatically steer the answer just because it was retrieved.

Example:

summary says: Article 01 needs a CTA
current source file says: CTA was intentionally removed
correction says: do not re-add sales CTA to Article 01

policy result:
- summary = context only
- source file = answer
- correction = block CTA reintroduction

Enter fullscreen mode Exit fullscreen mode

The agent should not average those records into "maybe add a softer CTA." The correction blocks the action.

For higher-risk tasks, I would require the model to produce a small authority trace before the final answer:

claim: "Article 01 should not include a sales CTA"
winning_source: "current source file + active correction"
conflicting_sources:
  - "old summary saying CTA was needed"
allowed_action: "answer"
blocked_actions:
  - "re-add Gumroad CTA"
verification_required: false

Enter fullscreen mode Exit fullscreen mode

That trace can be hidden from the user in a product, but it should exist somewhere for audit.

File and metadata support

I would not rely on prompt text alone. Agents can ignore or reinterpret subtle instructions.

In a file-based setup, index.md can make the same policy visible:

# Memory Authority Policy

## Order
1. live checks
2. source/
3. corrections.md
4. current_state.md
5. decisions.md
6. unresolved.md
7. summaries.md
8. preferences.md
9. archive/

## Enforcement Notes
- Prefer current source files over summaries.
- Corrections override preferences in their scoped context.
- Verification-required items must be checked before being used as settled facts.
- Archive supports historical answers, not current-state claims.

Enter fullscreen mode Exit fullscreen mode

For structured memory, the same idea becomes metadata:

source_type: source_file | correction | current_state | decision | unresolved | summary | preference | archive
epistemic_status: settled | inferred | contested | unresolved | verification_required | superseded
status: active | ready | parked | blocked | archived | superseded
priority: next | active | later | none
verification_required: true | false
allowed_action: computed_at_runtime

Enter fullscreen mode Exit fullscreen mode

allowed_action should be computed at retrieval time. A memory that could answer yesterday may need verification today.

Dynamic authority needs scope

Sometimes a lower layer should temporarily outrank a higher one.

During a production incident, an active incident file may outrank the normal roadmap. During legal review, approved policy language may outrank product preference. During a live deployment, current logs may outrank yesterday's state file.

Temporary authority needs scope:

temporary_authority:
  mode: production_incident
  elevated_source: incident_current.md
  outranks: roadmap.md
  scope: payment outage response
  expires_when: incident closed

Enter fullscreen mode Exit fullscreen mode

If elevation has no scope or expiry, it becomes a loophole.

Tradeoffs

Authority policies are not free.

They add:

  • more careful prompting,
  • more metadata,
  • more token use when checking conflicts,
  • more chances for the agent to become overcautious,
  • more maintenance when the policy itself fails.

A flatter memory setup may be better for short projects, creative exploration, low-stakes drafting, live debugging, or pair programming where the human is actively supervising.

The hierarchy earns its cost when work is long-running, multi-session, expensive to correct, or easy to corrupt with stale context.

What I have measured so far

This is not benchmark-grade.

But I did run a small deterministic calibration pass against the authority-policy logic:

  • 12 scenarios
  • 35 memory objects
  • 3 threshold settings in v0.1
  • 1 risk-adjusted threshold in v0.2
  • no model generation; policy logic only

The result:

strict:                 29/35 correct, 0 false-certainty errors, 6 overblocking errors
balanced:               34/35 correct, 0 false-certainty errors, 1 overblocking error
permissive:             35/35 correct, 0 false-certainty errors, 0 overblocking errors
balanced_risk_adjusted: 35/35 correct, 0 false-certainty errors, 0 overblocking errors

Enter fullscreen mode Exit fullscreen mode

The useful finding was not "this is proven."

The useful finding was the tradeoff: strict gating prevented false certainty but overblocked legitimate low-risk answers. A risk-adjusted threshold fixed that edge case on this scenario set.

The next test is harder: external scenarios, external scoring, and an actual retrieval-plus-generation loop.

Failure modes

An authority policy can fail in both directions.

Too loose: stale summaries beat current records.

Example: process health is mistaken for full local-model availability because a stale summary says "agent healthy."

Fix: split process health from model availability and mark model availability verification_required.

Too rigid: the agent overblocks low-risk settled tasks.

Example: a simple design choice is downgraded to a warning because the score is barely below the answer threshold.

Fix: allow low-risk, settled, verification-free memories to answer under a lower threshold.

Dynamic override abuse: temporary elevation becomes permanent.

Fix: require mode, scope, elevated source, and expiry.

Version the policy itself

The authority policy is also memory.

It needs versioning.

authority_policy:
  version: "0.2"
  default_profile: "solo_agent_work"
  active_since: "2026-05-25"
  last_changed_because: "balanced threshold overblocked low-risk settled memory"
  review_trigger: "two repeated hierarchy failures or one high-risk failure"

Enter fullscreen mode Exit fullscreen mode

If the same class of failure repeats, update the policy.

If the hierarchy blocks too many correct answers, loosen the relevant threshold or add a scoped exception.

The policy itself should be corrected by the same system it governs.

Practical check

Before acting on memory:

1. What claim am I making?
2. What source supports it?
3. Is there a conflicting higher-authority source?
4. Is there an active correction?
5. Is the claim unresolved, stale, contested, or verification-required?
6. Is this memory allowed to answer, warn, block, verify first, or only provide history?
7. What would change this answer?

Enter fullscreen mode Exit fullscreen mode

That is the core habit.

Not more memory. Governed memory.


This is part of a short series on AI memory as judgment infrastructure: the zero-budget foundation, correction memory, preserving unresolved questions, and three failures that tested the system.