惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

P
Palo Alto Networks Blog
月光博客
月光博客
小众软件
小众软件
Recent Announcements
Recent Announcements
WordPress大学
WordPress大学
阮一峰的网络日志
阮一峰的网络日志
美团技术团队
腾讯CDC
The Cloudflare Blog
大猫的无限游戏
大猫的无限游戏
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
V
Visual Studio Blog
雷峰网
雷峰网
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
博客园 - 【当耐特】
V
V2EX
博客园_首页
博客园 - 聂微东
MongoDB | Blog
MongoDB | Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
博客园 - 司徒正美
B
Blog RSS Feed
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Y
Y Combinator Blog
aimingoo的专栏
aimingoo的专栏
量子位
T
The Exploit Database - CXSecurity.com
Microsoft Security Blog
Microsoft Security Blog
酷 壳 – CoolShell
酷 壳 – CoolShell
Know Your Adversary
Know Your Adversary
博客园 - 叶小钗
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
F
Fortinet All Blogs
P
Privacy & Cybersecurity Law Blog
MyScale Blog
MyScale Blog
D
DataBreaches.Net
Google DeepMind News
Google DeepMind News
B
Blog
Hugging Face - Blog
Hugging Face - Blog
The Register - Security
The Register - Security
云风的 BLOG
云风的 BLOG
Project Zero
Project Zero
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
C
CERT Recently Published Vulnerability Notes
D
Darknet – Hacking Tools, Hacker News & Cyber Security
A
About on SuperTechFans
C
Cyber Attacks, Cyber Crime and Cyber Security
AWS News Blog
AWS News Blog
Latest news
Latest news
G
GRAHAM CLULEY

DEV Community

Authentication Security Deep Dive: From Brute Force to Salted Hashing (With Java Examples) Why AI Systems Don’t Fail — They Drift Spilling beans for how i learn for exam😁"Reinforcement Learning Cheat Sheet" I Replaced Chrome with Safari for AI Browser Automation. Here's What Broke (and What Finally Worked) How Python Borrows Other People's Work The $40 Architecture: Processing 1 Billion API Requests with 99.99% Uptime Vibe Coding: A Workflow Guide (From Zero to SaaS) Most webhook security guides protect the wrong side. The scary part is delivery. Headless CMS for TanStack Start: Build a Blog with Cosmic EU Age Verification App "Hacked in 2 Minutes" — What Actually Happened Comfy Cloud’s delete function does not actually remove files Running AI Models on GPU Cloud Servers: A Beginner Guide Event-driven media intelligence with AWS Step Functions and Bedrock I scored 500 AI prompts across 8 quality dimensions — here's what broke How to Call Google Gemini API from Next.js (Free Tier, No Backend Needed) The Portal Protocol: Reclaiming Human Connection in the Age of AI How to Fix Your Team's Scattered Knowledge Problem With a Self-Hosted Forum Intro to tc Cloud Functors: A Graph-First Mental Model for the Modern Cloud Designing Multi-Tenant Backends With Both Ownership and Team Access I Built a Neumorphic CSS Library with 77+ Components — Here's What I Learned PostgreSQL Performance Optimization: Why Connection Pooling Is Critical at Scale Cómo construí un SaaS multi-rubro para gestionar expensas en Argentina con FastAPI + Vue 3 🚀 I Built an Ethical Hacking Scanner Tool – Open Source Project I Replaced /usage and /context in Claude Code With a Single Statusline A Pythonic Way to Handle Emails (IMAP/SMTP) with Auto-Discovery and AI-Ready Design I Collected 8.9 Million Polymarket Price Points — Here's What I Found About How Markets Really Move EcoTrack AI — Carbon Footprint Tracker & Dashboard Everyone's Using AI. No One Agrees How. 5 self-hosted ebook managers worth trying in 2026 Building Your First AI Agent with LangChain: From Chatbot to Autonomous Assistant Common SOC 2 Failures (Real World) Stop Vibe-Checking Your AI App: A Practical Guide to Evals How to Use SonarQube and SonarScanner Locally to Level Up Your Code Quality Your Next To-Do App Is Dead — I Replaced Mine with an OpenClaw AI Sign a Nostr event in 60 lines of Python using coincurve — no nostr-sdk, no nbxplorer, no rust toolchain ITGC Audit Explained Like You’re in Big 4 Patch Tuesday abril 2026: Microsoft parcha 163 vulnerabilidades y un zero-day en SharePoint Stop scraping everything: a better way to track competitor price changes Listing on MCPize + the Official MCP Registry while routing payments OUTSIDE the marketplace — how I kept 100% of my x402 revenue Building an AI-Powered Risk Intelligence System Using Serverless Architecture Why We Ripped Function Overloading Out of Our AI Toolchain Testing AI-Generated Code: How to Actually Know If It Works SaaS Churn Is Killing Your Business. Here Is What to Do About It (Without a Support Team) The Speed of AI Is No Longer Linear - And Self-Improving Models Are Why How to Implement RBAC for MCP Tools: A Practical Guide for Engineering Teams From Standard Quote to Persuasive Proposal: AI Automation for Arborists I built a CLI that scaffolds complete multi-tenant SaaS apps Axios CVE-2025–62718: The Silent SSRF Bug That Could Be Hiding in Your Node.js App Right Now The dashboard that ended our friendship Data Pipelines Explained Simply (and How to Build Them with Python) The Hidden Cost of AI Systems Nobody Talks About. undefined vs undeclared, and how typeof behaves Switching from file-based jobs to NATS/Kafka in Rust without changing code io_uring Adventures: Rust Servers That Love Syscalls Why Agentic AI is Killing the Traditional Database The POUR principles of web accessibility for developers and designers Quantum Neural Network 3D — A Deep Dive into Interactive WebGL Visualization How To Install Caveman In Codex On macOS And Windows Automation Pipeline Reliability: Why Your Workflow Breaks When Nobody Is Watching I Built an 'Open World' AI Coding Agent — It Works From ANY Folder From Freelancing to Product: A Tech Service Company's SaaS Transformation China's AI Giants: Adding Tencent Hunyuan & ByteDance Doubao to AI University (74 Providers) On the Vibe Coders and Their Lies clerk: Auto-Summarize Your Claude Code Sessions AI Weekly — 2026/04/10–04/17 | The Model Lockdown Is Here, but the Toolchain Is the Real Battleground AI 週報 — 2026/04/10–2026/04/17 模型封鎖潮來了,但工具鏈才是真戰場 Maybe this is how Open-Source apps are born... 🚀 Fine-Tune LLMs with LoRA and QLoRA: 2026 Guide tRPC v11 + Next.js App Router: End-to-End Type Safety Without the Boilerplate ShadCN UI in 2026: Why I Stopped Installing Component Libraries and Started Owning My Components SaaS Billing in React Server Components: Stripe + Supabase Without a Single `useEffect` Join our DEV Weekend Challenge — $1,000 in Prizes Across TEN winners! Submissions Due April 20 at 6:59 AM UTC. Implementing FSRS Spaced Repetition in Flutter + Supabase — Adding Memory Science to an AI Learning App "I Texted My Localhost From the Train — Claude Code Fixed the Bug Before I Got Home" I Built a Sales Prep AI and It Went Deeper Than Expected Design to Code #2: One JSON, Eleven Outputs Solving the 100M-Row Problem: A Summary Table Pattern for High-Volume Push Notification Logs Flutter Web With Wasm: What Actually Changes For Developers I Built 50 Royalty-Free Soundtracks for My Side Project in a Weekend Using AI Music Generation The Vibe Coding Security Checklist: 7 Things to Check Before You Ship Stop Letting Googlebot Guess Fix Your React App's SEO Right Desconstruindo o Streaming do LinkedIn: Como Criar um Engine de Extração de Vídeo de Alta Performance com HLS e FFmpeg (EDA Part-1) EDA (Exploratory Data Analysis) Explained With Real Life — Why Looking at Your Data Is the Most Important Step in Machine Learning Brand Relationship Management at Scale: Our 4-Touch Outreach System for 200+ Brands Why String.fromEnvironment() Might Return an Empty String in Dart JGuardrails 1.0.0 — Hardening Java LLM Apps Against Jailbreaks, Toxicity, and Prompt Injection Plan and Schedule a Full Week of Threads Content From One Claude Conversation Coding Cat Oran Ep3, Five Tables Changed Everything Updated: BFF Pattern I'm done watching freelancers get buried by 200 proposals. So I'm building the alternative. This is my first post BFS Algorithm in Java Step by Step Tutorial with Examples Tracking LLM Pricing Monthly: An Open Dataset for 22 AI Models How We Measure Content ROI on a Comparison Site: Revenue Attribution Without Perfect Data Introducing Nova AI Ops: The AI-Native Operating System for SRE Teams I built a free desktop video downloader for Windows — Grabbit How Talkie OCR Helps Vision-Impaired & Dyslexic Users Read the World Around Them VRCFaceTracking安装和iPhone面捕配置教程,有bug Even CrowdStrike Can't See Your Agents The Automation Gold Rush: What n8n Workflows and Claude Are Opening Up for Developers Right Now
Vibe citing: how KPMG used AI to write a report about AI and AI made them look like fools
t474-r0b07 · 2026-06-17 · via DEV Community

vibe citing: how KPMG used AI to write a report about AI and AI made them look like fools

by t474-r0b07


There are companies that charge you to tell you how to use AI responsibly.

KPMG is one of them.

250,000 employees. 138 countries. Decades advising governments and corporations on how to avoid costly mistakes.

In October 2025 they published a report titled "Total Experience: Redefining Excellence in the Age of Agentic AI".

They wrote it with AI.

The AI invented 88% of the sources.

Nobody verified anything.

They published it anyway.


// what is "agentic AI" — because the title matters

An agentic AI is not a chatbot.

Not the assistant that answers your questions. It's a system that makes decisions and executes actions on its own, without a human approving each step. You give it an objective and it acts, corrects, moves forward.

It's the product everyone in the tech sector was selling in 2025.

KPMG was selling it too.

That's why they needed a report proving their clients were already using it.

Spoiler: they weren't. And the report invented it anyway.


// the forensic analysis

GPTZero — a company specialized in detecting AI-generated content — ran a full audit on the report.

First: what is an AI hallucination, because the term is going to come up a lot.

When a language model doesn't have the information you ask for, it doesn't say "I don't know." It generates a response that sounds correct. It invents with the same confidence it would use if it actually knew the truth. Perfect format. False content. No warning.

That's a hallucination.

Now the numbers from the KPMG report:

TOTAL CITATIONS:      45
REAL CITATIONS:        5
INVENTED CITATIONS:   40
ACCURACY RATE:      11.1%

40 of 45 citations have invented titles, authors that don't exist, or sources that don't say what KPMG claimed they said.

Half of the factual claims in the report are false or misattributed.

A firm that charges for intellectual rigor published a document with 11% accuracy.


// the organizations that read the report and said "that's not us"

The Financial Times contacted the companies listed as success stories.

UBS — false.

NHS United Kingdom — false or misleading.

Swiss Federal Railways — false.

Transport for London — "misleading."

Transport for London said the claims that they were using AI agents to predict congestion and coordinate the network were misleading.

NHS Greater Manchester said the description of using agentic AI to organize patient records and predict hospital readmissions "doesn't really align" with reality.

KPMG put their logos on fiction without asking permission.

And billed them as success stories.


// the error that best illustrates how the problem works

The model was instructed to find cases of companies using agentic AI.

It didn't find enough — because in many sectors they simply don't exist yet.

So it did the most comfortable thing: it generated them.

It cited a East Japan Railway press release from 2019 as evidence of agentic AI adoption.

The term agentic AI didn't exist in public discourse until 2024.

The model traveled five years back in time, reformulated an unrelated document, and presented it as proof of something that hadn't happened yet.

It wasn't an error. It was the easiest answer to the prompt.

The model doesn't understand the difference between inventing and remembering. It generates what fits. If it doesn't exist, it builds it. And it does so with the same fluency it would use to cite something real.


// vibe citing — the name the problem was missing

GPTZero coined the term: vibe citing.

To understand it you first need to understand vibe coding — writing code without understanding what it does. You ask an AI to generate the code, you copy it, it kind of works, and you move on without reading a line. The vibe is right. The understanding, zero.

Vibe citing is the same thing but with bibliography.

The model generates references that sound academic because it processed millions of papers. The structure is correct. The doi has the exact format. The year is right.

The content is fiction.

And the world's largest firm in responsible AI consulting didn't verify a single one before publishing.

def verify_sources(citations):
    # TODO: implement before publishing
    pass

publish_report()  # called without verifying anything

This is not a technical error.

It's a process decision. Or the absence of one.


// the moment the report contradicts itself

There's a detail that turns negligence into something almost poetic.

The report cites "KPMG research" claiming that 55% of CEOs prioritize AI as their main investment.

The KPMG 2025 CEO Outlook — published the same month, by the same company — says 71%.

The model didn't just invent external sources.

It invented data from the company that was using it and contradicted it with that same company's real data from the same period.

KPMG cited KPMG incorrectly in a KPMG report.


// Emirates case: three claims, zero correct on what matters

Page 42.

KPMG claims that Emirates adopted a mobile chatbot called Sara that can converse with passengers and change their flights.

Reality:

  • Sara is a physical robot, not a chatbot.
  • It was introduced in 2023, with no agentic capability.
  • It cannot change flights.

Three claims. None correct on what matters.

The model took real information about Sara, reformulated it to fit the narrative it needed, and presented it as an agentic AI success story.

This is not a writing error. It's construction of fiction using real data as scaffolding.


// it's not just KPMG — it's the entire sector

This is where it stops being an isolated corporate scandal.

GPTZero has been documenting the same pattern for months:

  • Deloitte — AI-generated content in a report paid for by the Australian government. Ended up refunding.
  • EY — report with invented footnotes. Retracted in May 2026.
  • KPMG — this case.

Three of the Big Four in consecutive months.

All selling responsible AI consulting.

All publishing hallucinations as research.

The pattern isn't coincidence. It's market pressure: the client wants the report, the report needs data, the data doesn't exist yet, the model generates it, nobody verifies because verification takes time and the client already paid.

AI is not the problem.

The economic incentive to appear to know more than you do — that's the problem.


// the feedback loop nobody is naming

Here's the data point almost no media outlet is discussing.

The false statistics from the KPMG report are already being reproduced by ChatGPT and Gemini.

I need to explain why that's structurally serious and not just anecdotal.

For months the report was published on KPMG's domains. The crawlers that feed language models index sources by authority. KPMG has maximum authority: global company, old domain, millions of visits, decades of institutional credibility.

The models ingested that content as verified truth.

Now when someone asks ChatGPT or Gemini about agentic AI adoption, they can return the false data from the report — not as "I found this at KPMG" but as their own knowledge, without attribution, without warning.

The full cycle:

model hallucinated data
    → KPMG published without verifying
        → crawlers indexed it as high-authority source
            → other models ingested it as truth
                → user receives the original hallucination as fact

high-authority source + false data + model ingestion = untraceable disinformation.

You can't trace the origin. You can't disinfect the source. The error already lives inside the models you consult every day.

And the report has already been retracted. But the data keeps circulating.

Taking down the PDF didn't deindex anything.


// what KPMG said afterward

KPMG's spokesperson declared after withdrawing the report:

"We expect all our staff to follow our guidelines on responsible AI use, including human oversight to validate content and verify independent sources."

Translation: we have guidelines. Someone didn't follow them. We're investigating.

What they didn't say: how a flagship report on responsible AI, with the KPMG logo, published on their official channels, passed through their entire internal review process without anyone verifying a single one of the 45 citations.

250,000 employees.

5 valid citations.

Nobody asked anything.


// conclusion — the problem isn't technical

Models do exactly what they were designed to do: generate coherent and plausible text based on learned patterns.

They don't lie. They have no concept of lying. They generate what fits.

The problem is human: using AI as a researcher without a verification loop isn't efficiency. It's delegating truth to a system that has no concept of truth, and signing your name on top.

KPMG didn't build a report with AI.

They built the appearance of a report and sold it as research.

The difference isn't semantic.

It's the difference between knowing something and appearing to know it.

In 2025, the world's largest firms chose to appear.


primary sources — verify yourself:

t474-r0b07 — Tarija, Bolivia

github.com/t474-r0b07