慣性聚合 高效追讀感興趣之博客、新聞、科技資訊
閱原文 以慣性聚合開啟

推薦訂閱源

博客园 - 司徒正美
V
V2EX
T
Tailwind CSS Blog
有赞技术团队
有赞技术团队
aimingoo的专栏
aimingoo的专栏
Apple Machine Learning Research
Apple Machine Learning Research
IT之家
IT之家
Blog — PlanetScale
Blog — PlanetScale
A
About on SuperTechFans
月光博客
月光博客
T
The Blog of Author Tim Ferriss
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
博客园 - 聂微东
The GitHub Blog
The GitHub Blog
V
Visual Studio Blog
WordPress大学
WordPress大学
酷 壳 – CoolShell
酷 壳 – CoolShell
Engineering at Meta
Engineering at Meta
GbyAI
GbyAI

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python)
二人工智能之评而同者,非二评也:吾之学验言而用之也
Michel Faure · 2026-05-24 · via DEV Community

一夕之间,二度查核,分数如一

五月十七日暮,吾成版本0.4.1之。对案之具 乃决意呈诸外审。吾将宣言及十四则悉数誊入ChatGPT-4o会话,复将同一内容尽录于Claude.ai网页会话。静候。少顷,二者之评皆至。一方得分八分,他方亦然。于理论之具,二者之讥皆近同——布迪厄之引,无操作之实——所陈简略之议亦无二致,论M1-M5之新器,角度亦如一。吾初念之持,仅三十秒耳。二独立之评者,分数同,批驳亦同——此道之校准无误,吾可刊行矣。

然吾止之。盖其会通之中,有物若双购自同肆之验风之器,铮铮作响。

何谓双会之AI,非二度之测也?

吾甚明其会通所测。二言模型,训于重合甚巨之文类——技文、公GitHub库、Stack Overflow论辩、十载博客——所生之误相协。其共通者,乃其共学之交集,非吾所呈之外实。当二者皆觉其理器不称,吾非学其真。吾学其两文类之共统计,认此为是体之常弊。

诚然,二人之评者虽会于一时,然其独立之见犹分。然此喻实为惑人。二人之评者,其生平各异,解读不同,时怀异见。二大语言模型,则共基之质,无此文理。经交叉验证,依古典认识论,必假源于独立。以同文料训练二统计模型,其独立非自具,必待证之,而鲜有能证者。

此直觉或可免吾数日之徒劳重写。然犹存臆测。吾欲得实据以探之。

三日三探

初探,是夜。余尝别营一事,乃游戏开发之库,遂研其理。WebFetch上周。有位克劳德助手曾告我,此工具返"二十五兆字节PDF之全文OCR",吾乃依此而建摄入之管。吾于当前会话中运行此令,以验之。原始之输出,印于控制台:maxContentLength size of 10485760 exceeded。此证乃技之不可能。此管乃基于虚无之机制。吾未尝试之,盖因助者之措辞,自信而有条理,似真而可信也。

次日,再探之。同游戏开发者之根基。一对话之审察,使我指向一技术博客,其被描述为"含十八域执政之务"— 正是吾所求之物。未及刮削,吾已先行之。WebFetch于博客之索引。粗返:八开发纪年之文,所宣布之主题,无冠词乃一通顺之幻象。其模式"某某之博客含X"者,模型轻易生成,盖因其文法合理,然无内证以验其事之实。

是日第三次探,及于其道本身。外有Claude,经claude.ai共享,曾审吾doctrine-counterpart前日已存档,并确认之,截图附焉。六份SKILL.md文件,皆破 YAML 前置之章— 诊断确凿,用以证七分半之贬。是日,吾行。yaml.safe_load于仓中十二SKILL.md之文皆然。十一有二可,一有十二坏所宣之系统瑕疵,实无其事。真断裂之文件,视效评估者未得辨识——盖其已断定之故也。体制之弊不举一例。其所见者,乃GitHub之Markdown渲染吞其前文,呈为表格,此乃宿主渲染之异态,与原始源码之状无关。

外有三证,皆被驳斥。非有偏合,非存微辞——三证皆为其探所破。无质验之,此率不可见也。

诸法之则——Counterpart Toolkit v0.7 之 Am.R12

余于五月廿日,此三事之后二日,更 R12。其正文:

"凡外AI(非Claude、ChatGPT、对辩之属)所陈之辞,若涉(a)可察之器之行,(b)外物之实,(c)可取真值之系之构——皆须验而后信,方为构架之据。验之费约等一壳令。不验而信之害,则全管皆托于虚无之机。二外AI之评会于同断,实为一源,非二——跨基独立之证,需一人或异械之探(如录、度、试运行)。"

孰能争二评者胜于一者乎?无人——然此法当细析之。双大语言模型,会聚非佐证。乃孤子会聚—一统于数理,然无验于外。三质反诘,以应六月前,一元智械之长所闻之难。

首,共训之数。二模其文类相重于要者,于同之句域生共误。其合度量其学之交集。似此,会通之机,当更易于此。寻常之态陈词——预训练所识为成文者——而非诚然之论。此非同义也。

次之,工具有关之幻象。论及特定工具之行为,或外部资源之实况,二模型往往幻生统计最可能之结果,此常谬,且几必自信言之。吾之三探,即此之真象也。

三,则此不对称之费。物探之费,一壳令,十五秒,时或不逮。构架之决,基于未验之会,可费数日之重作,且需一管以自本而建。修订之R12,于此不对称之费,令探为必,而后可任任何取此索为入之令。

终章

修习之反,非疑也。乃探也。而探之始,在于己——先以R12验己之断,而后施于人,核己之仓(repo)以yaml.safe_load 莫轻信目验之审,或谄或贬。若代理者无实质异见,非为对手,乃能言之打字匠也。二代理者苟合,未尝考验,非为二评者,实乃同一打字匠之复本。此律一言可蔽之。

# Before adopting an external claim about a tool / resource / structure:
$ <the material command that could have falsified it>

Enter fullscreen mode Exit fullscreen mode

此库:github.com/michelfaure/doctrine-counterpart,Am.R12显于CLAUDE.md,其三肇始之事载于v0.7-candidates.md。若尔后之架构决断,有一可避未验之信,则此规已偿其值矣。


相契工具包,版次0.7,修订R12,于公元2026年5月20日,就N=3之多元基质事件。有三外显之申索,有三虚妄之证伪,各有一壳命令。许诺CC-BY-4.0。