惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

GbyAI
GbyAI
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
P
Proofpoint News Feed
L
Lohrmann on Cybersecurity
S
Secure Thoughts
Attack and Defense Labs
Attack and Defense Labs
人人都是产品经理
人人都是产品经理
Stack Overflow Blog
Stack Overflow Blog
W
WeLiveSecurity
O
OpenAI News
SecWiki News
SecWiki News
博客园 - Franky
NISL@THU
NISL@THU
Microsoft Azure Blog
Microsoft Azure Blog
T
Tor Project blog
Microsoft Security Blog
Microsoft Security Blog
aimingoo的专栏
aimingoo的专栏
Security Latest
Security Latest
H
Hacker News: Front Page
Google Online Security Blog
Google Online Security Blog
P
Privacy & Cybersecurity Law Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
D
Darknet – Hacking Tools, Hacker News & Cyber Security
月光博客
月光博客
李成银的技术随笔
Spread Privacy
Spread Privacy
F
Full Disclosure
F
Fortinet All Blogs
T
The Exploit Database - CXSecurity.com
Vercel News
Vercel News
AWS News Blog
AWS News Blog
WordPress大学
WordPress大学
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
V
Visual Studio Blog
J
Java Code Geeks
博客园 - 三生石上(FineUI控件)
G
Google Developers Blog
云风的 BLOG
云风的 BLOG
博客园 - 司徒正美
Engineering at Meta
Engineering at Meta
Last Week in AI
Last Week in AI
P
Palo Alto Networks Blog
宝玉的分享
宝玉的分享
T
True Tiger Recordings
N
News and Events Feed by Topic
酷 壳 – CoolShell
酷 壳 – CoolShell
Cisco Talos Blog
Cisco Talos Blog
N
News | PayPal Newsroom
S
SegmentFault 最新的问题
Jina AI
Jina AI

Hacker News - Newest: "AI"

Hating AI Is Good Ask HN: Are there any social media sites that are AI positive? Show HN: We dropped Go for Rust in our real-time telephony AI media plane Nvidia says it has ‘largely conceded’ China’s AI chip market to Huawei Can AI solve this Bongard problem? Home — Noada Ask HN: Does anyone believe role-play AI is effective for training? Lovable’s AI built a 100% accessible site – or did it? | Axess Lab Designing a AI Access Layer for Systems of Record HiAI - HiAI IDE - HUAWEI Developer GitHub - openclaw-easy/ViralMint: Open-source viral content pipeline — scout trends, analyze competitors, generate AI videos, auto-publish. AGPL-3.0. Show HN: SoMatic – Vision-based OS automation framework for AI agents Physics AI – Free Physics Solver Online (Step-by-Step) PocketWebTools Forma – Smart AI Autofill for Job and Placement Forms (100% Local, No Cloud) Introducing AI Agent Support for Grid Global Accounts Let the AI cook | ivan.codes AI Can Seem More Human Than Real Humans in a Classic Turing Test, Study Finds Built a live multi-agent AI operations workspace for software engineering teams KiroGraph: Local code knowledge graph for AI, optimized for token efficiency GitHub - the-ai-coop/open-letter Intuit CEO says company’s 17% workforce cut had ‘nothing to do with AI’ AMD Ups Ante With 192GB Ryzen AI Max PRO 400 Chips for AI Systems I Taught an AI to Be Our On-Call Engineer AI token streaming isn't about SSE vs WebSockets — /dev/knill AI Engineering from Scratch Google search AI mode, the biggest update Gemini Omni Flash AI Video Generator | Free Online Twelve Ways to Be Wrong About AI-Assisted Coding Linki – open-source AI SDR for LinkedIn sequences and cold email Transforming Digital Pathology with AI GitHub - enzoferraripapa-arch/ai-vprocess-ops: Engineering memory for AI coding agents: requirements, decisions, evidence, traceability, and V-process/ALM handoff China has named, defined and started governing agentic AI WebMCP: I Made My Website AI Agent Ready (Here's How) Bezos defends billionaires, hypes AI, talks taxes and praises Trump in CNBC interview Growing an AI orchestration platform to $3k MRR in 4 weeks Do you enjoy reading any type of AI written text? Dust raises $40M Series B to scale multiplayer AI for human-agent collaboration SkinMax App | Your Personal Skin Care Coach Client Challenge AI red teaming agents change how LLMs get tested Standard Charter CEO Replaces 8000 "Lower Value Humans" with AI Design advice you can actually use SpaceX IPO filing lays bare losses and Musk control as it stakes future on AI Show HN: SafeRun – Replay debugging and inline prevention for AI agents 3 GitHub - sathvikc/agent-chat-bridge: Turn any AI agent chat session into an async agent. Register a timer, shell command, or webhook — the bridge automatically resumes the session with your prompt when the trigger fires. The Google AI Pro plan just got a quiet downgrade, here is the new deal Google is dethroning OpenAI as the king of consumer AI Ordo · Smart earbuds with cameras & AI TBN Protocol — Full Demo What I'd audit on an AI-built SaaS before its first paying customer The AI Client in WordPress 7.0 Show HN: SafeRun – Replay debugging and inline prevention for AI agents 2 White House briefs AI firms on plans for model review Invasion of the literary bots What Models? — Pick the right model for your GPU in seconds An AI system to help scientists write expert-level empirical software How Many Questions Can the World Afford to Ask AI? Meta Begins AI-Driven Layoffs, Report Says. Can They Boost the Struggling Stock? Benchmarking Open-EndedInference Optimization by AI Agents The Elements of Power (AI Supply Chain) JAM: DSP audio engine programmable via AI chat The SpaceX IPO filing is filled with AI bets, Starship dreams, and Elon Musk at the center Free AI Rewriter - Revise Can one run AI on source code with the prompt "Find below-avg swear rate files"? twitter.com The Developer's Guide to AI When AI can write your code, do you still need a CMS? Congress Banned a Gun Registry. AI Doesn't Need One Cloudflare CEO on how he chooses which employees to replace with AI Replacing NZ public servants with AI could come with hidden costs, critics warn How America Turned Against AI According to the Poll Data: A (Very Big) Compilation GitHub - brucehoult/k3_ai: Utility to start a program on the A100 "AI" cores on SpacemiT K3 machines. Claude.AI Pro Plan quotas too small for deep research AI slop? What about human slop? | NadathurX Token Offset · Offset the environmental cost of AI Show HN: AI Editor for Websites AI Resist List Wheelly.ai — AI in every app, with one hotkey AI atlas reveals hidden whole-body-damage caused by obesity AI robot is now a Buddhist monk Advanced AI models bring government to ‘reflection point,’ CIA official says Linus Torvalds admits he has a 'love-hate relationship with AI' Singapore inks AI deals with Google, OpenAI as ChatGPT-maker commits $234 million to local ecosystem San Francisco turns to AI to save whales from ship strikes as deaths soar What will better AI mean? Why Compiled AI makes AI Enterprise ready The Wake-Up Call for 2026 and 2027 · Greg Herlein The AI people have been right a lot Learn how to build AI products through practice The Alaska Permanent Fund as Loose Precedent for AI Data Center ‘UBI’ Payments Client Challenge The AI bots are coming and the young are booing, not applauding How to sell RL envs and data to AI labs [video] hty Guidelines for Human-AI Interaction - Microsoft Research Bezos brushes off concerns of an AI bubble: 'You shouldn't worry about it' AI Safety Is Underfunded by Design On people writing about their use of AI – Manu PreyReach — Find local-business leads with one prompt — sourced live from Google Places.
GitHub - jaroslavsoucek-art/Giovanni: AI Chief of Staff methodology for Claude Code. Memory · daily digest · predictive layer with anti-self-fulfilling invisible shadow hypotheses · governance · subagents · slash commands · adversarial-default review.
JarSou · 2026-05-21 · via Hacker News - Newest: "AI"

Extended hand and second brain.

A methodology framework for running an AI Chief of Staff inside Claude Code — memory architecture, daily digest, predictive layer, governance discipline, custom subagents, slash commands, living constitution, per-stakeholder modeling, adversarial-default review.

Distilled from a working domain-specific implementation, then sanitized into a domain-agnostic framework you can fork and fill with your own context.

In 10 seconds

Most "AI assistant" repos hand you prompts. Giovanni hands you the system layer underneath the prompts: how to structure memory so it doesn't drift, how to run a daily digest that survives weeks of compounding context, how to model stakeholders as time-series (not snapshots), how to predict reactions without contaminating the prediction (shadow hypotheses invisible to the principal — see The moat below), how to enforce honesty via lint and adversarial review.

The runtime lives in your fork, not here. This repo is templates, schemas, agents, workflows, and governance — domain-agnostic on purpose.

The moat — predictive layer with invisible shadow hypotheses

Most AI assistants either don't predict counterparty behavior, or predict it in plain sight — which contaminates the prediction. The moment you read "the model expects Sarah to push back on Series B timing", you walk into the 1:1 framing the conversation around it. The prediction becomes self-fulfilling or self-preventing; either way, the loop is broken. The model's "track record" becomes a record of how surfaced predictions changed your behavior, not how well it reads your stakeholders.

Giovanni's predictive layer is three pieces, designed against this trap:

flowchart LR
    digest["Daily digest +<br/>stakeholder updates"] -->|"writes silently"| shadow["📦 Shadow hypotheses<br/>memory/shadow/pending/<br/><br/>Invisible to principal.<br/>Not in digest, not in briefing, not in chat."]
    shadow -->|"+90 days"| review["🔍 /shadow-review<br/>quarterly adversarial lookback<br/>'where did the model miss?'<br/>default-skeptical on uncertainty"]
    review -->|"per-actor verdict"| calibration["📊 /calibration-report<br/>monthly · per-actor · per-tier<br/>healthy: 60–80 / 20–40 / 5–15 %"]
    calibration -.tunes.-> branchout["🔮 /branch-out<br/>3 tiers (no percentages)<br/>max t+2 horizon<br/>hard-stop on shallow actors"]
    situation["High-stakes situation"] -->|"active query"| branchout
    branchout -->|"visible to principal"| principal["👤 Principal acts on<br/>3-tier scenario tree"]
Loading
  1. Branch-out (visible) — active simulation for a specific situation. Three likelihood tiers, no fake percentages (numeric probabilities on small-N stakeholder predictions are vibes with arithmetic decoration). Max t+2 horizon (further is fiction). Hard-stop on shallow actors: if 2+ key actors have <5 observed touches, /branch-out refuses to run rather than emit caveat-degraded "best effort" predictions.

  2. Shadow hypotheses (invisible — the moat) — predictions the principal never sees during the prediction window. Stored in memory/shadow/pending/. Not rendered in digests. Not in 1:1 briefs. Not in chat. They become visible only at /shadow-review, after the horizon date has passed and the outcome is structurally determined. The quarterly review runs an adversarial lookback: "what are the strongest arguments this hypothesis was NOT fulfilled?" — default-skeptical on uncertainty, because generous verdicts inflate accuracy and corrupt calibration. >80 % accuracy triggers an immediate re-audit because it usually means tier labels have drifted.

  3. Per-actor calibration (monthly)/calibration-report aggregates hit rates per actor, per tier. Framework-level accuracy is meaningless; what matters is which specific stakeholders the model reads well and which it doesn't. The score is per-relationship, and it tunes the triage heuristic that gates branch-out runs.

The shadow piece is what lets you measure whether the model actually sees your stakeholders, or just generates plausible-sounding narrative. You can't fake your way through 6 months of invisible predictions and adversarial review. See docs/prediction.md for the full binding rationale (anti-self-fulfilling rule, no-recommendation principle, canonical- moves discipline, calibration healthy-range bands).

Architecture

flowchart TB
    P[Principal / you] --> Commands["Slash commands<br/>/digest · /branch-out · /review · /shadow-review · ..."]
    Commands --> Workers["Worker agents<br/>isolated context, tool-scoped<br/>(source-puller, researcher, adversarial-reviewer, prediction-runtime, ...)"]
    Workers --> State
    subgraph State["Framework state — commit-tracked"]
      direction LR
      M["Memory<br/>4-layer:<br/>MAP → shortcut → shards → deep"]
      K["Living constitution<br/>knowledge/<br/>anchored, supersedes-pointer"]
      ST["Stakeholder profiles<br/>per-actor time-series"]
    end
    State --> Workflows["Workflows<br/>daily digest · branch-out · shadow lookback · calibration"]
    Workflows -.feeds back.-> State
    Governance["Governance<br/>lint · hooks · INDEX/MAP auto-regen<br/>hard limits · audit cadence"] -.governs.- State
    Governance -.governs.- Workers
Loading

Who this is for

  • Anyone running a high-context, multi-stakeholder program (founders, chiefs of staff, heads of strategy / legal / operations) who needs an assistant that remembers across weeks without rotting into noise.
  • People who already use Claude Code and want schema-level discipline instead of stitching together yet another prompt library.
  • Builders who want to study one worked architecture of memory + governance + predictive simulation before designing their own.

Not for: people looking for an out-of-the-box assistant. The work is in filling Giovanni with your domain context and running it for months.

Quick start

# Option A — "Use this template" button on GitHub (top-right) for a clean fork.
# Option B — manual clone:
git clone https://github.com/jaroslavsoucek-art/Giovanni.git my-chief-of-staff
cd my-chief-of-staff

# Validate framework lint passes on the vanilla repo:
./scripts/lint.sh

# Read the fork-and-fill walkthrough:
$EDITOR docs/setup-guide.md

For the synthetic test domain (Alex Park / Lattice Finance — used to stress-test every template), see docs/test-domain.md and the memory/examples/*.example.md files.

Status

Setup1 architecture complete; runtime unvalidated. 8/8 specialist architects shipped; all framework layers have templates, schemas, agents, workflows, and lint integration. Not yet end-to-end runtime-tested — no fork to actual operational domain, no independent cross-validation, no Setup2 fork-and-fill walkthrough yet (see docs/setup1-complete.md § "What Setup1 did NOT include"). Hobby project — no commercial support, no roadmap promises. Built part-time by extracting the system layer from a real high-stakes program (expansion of an e-commerce platform into 6 EU markets) and stripping out the domain content.

Next stage (Setup2): fork Giovanni into a clean repo, fill with your own domain content, run actual workflows. This is where the runtime gets validated. See docs/setup1-complete.md for the bootstrap summary, docs/setup-guide.md for the fork-and-fill walkthrough (WIP — iterates as Setup2 surfaces real-world friction).

What's in scope

Layer Files Purpose
4-layer memory architecture memory/templates/, memory/examples/, memory/README.md MAP → operational shortcut → topic shards → deep storage. Graduation criteria, hard limits, audit cadence.
Living constitution knowledge/constitution.template.md, knowledge/README.md, knowledge/INDEX.template.md Single source of truth, commit-traceable, anchor IDs, supersedes-pointer, auto-INDEX.
Per-stakeholder profiles memory/templates/stakeholder.template.md, 3 Lattice examples, docs/stakeholder-profiles.md Sentiment trajectory time-series, communication style, predicted reactions, 6-value relationship-type enum.
Daily digest workflow .claude/workflows/daily-digest.md, memory/digest-{state,sources}.template.md, docs/digest.md 12-step procedure, parallel source-puller fan-out, drift detection with 7d ack expiry, brief auto-gen ≤48h, predictive integration.
Predictive layerthe moat memory/templates/branch-out.template.md, shadow-hypothesis.template.md, calibration-actor-score.template.md, memory/branch-out/canonical-moves.md, docs/prediction.md Branch-out (3-tier no-percentages, max t+2, hard-stop shallow actors). Shadow hypothesesinvisible to the principal at generation (anti-self-fulfilling rule, the binding constraint of the layer), reviewed quarterly with adversarial lookback (default-skeptical on uncertainty). Calibration scoring per-actor monthly with healthy-range bands. See the dedicated section above.
Custom subagents .claude/agents/ (8 architects + 8 workers) 7 operational worker agents + 8 framework architects. Generic, model-tagged, tool-scoped, isolated context.
Slash commands .claude/commands/ (8 commands + registry + design doc) /digest, /branch-out, /shadow-review, /calibration-report, /consistency-check, /market-radar, /review, /redline.
Adversarial-review-as-default .claude/agents/adversarial-reviewer.md, .claude/workflows/adversarial-review.md, docs/adversarial.md SHIP/REWRITE/KILL verdict (no compounds), strongest-counter-case requirement, default-critical, suspend conditions documented.
Governance + lint scripts/lint.{sh,py}, scripts/lint_rules/ (11 rules), scripts/build-{knowledge-index,memory-map}.sh, .claude/hooks/ (8 hooks), docs/governance.md Pluggable Python lint framework, INDEX/MAP auto-regen, hard-limit enforcement (300-line, 2% strikethrough), audit cadence (14d light / 35d full), classification rules.

What's NOT in scope

  • No domain content. No stakeholders by name (except Lattice synthetic test domain in examples), no real decision logs, no project specifics.
  • No vendor lock-in. Works with Claude Code today; designed to migrate to platform-native primitives (Anthropic memory tool, Dreaming, Antigravity SDK) as they ship.
  • No commercial support. MIT license; fork at your own risk.
  • No automatic value. Giovanni is templates + workers + workflows + governance. Value comes from filling it with your domain context and running it for months.

Test domain

docs/test-domain.md defines a synthetic 2nd domain (Alex Park / Lattice Finance — Series A B2B treasury automation SaaS) used to validate every template + workflow is genuinely generic. Every architect's output is stress-tested against this domain. See memory/examples/*.example.md for filled artifacts.

Stats (post-Setup1)

  • 8 architect agents + 8 operational agents = 16 total
  • 8 slash commands + 11 lint rules + 8 hooks + 8 generic scripts
  • 13 memory templates + 14 Lattice examples
  • 1 living constitution template + 1 INDEX template + 1 governance config template
  • 10 workflow/policy/design docs
  • ~104 files, ~17K lines, 19 commits

Contributing

See CONTRIBUTING.md. Hard "no domain content" rule, generic-first check before opening a PR, critical-mode default review. Hobby project — PRs may sit. Read the realistic-expectations section before opening anything bigger than a typo.

License

MIT — see LICENSE.

Origin

See docs/origin.md. Sanitized clean-room extraction from a private domain-specific implementation; no proprietary content carried over.