惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

GbyAI
GbyAI
阮一峰的网络日志
阮一峰的网络日志
C
Check Point Blog
Stack Overflow Blog
Stack Overflow Blog
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
酷 壳 – CoolShell
酷 壳 – CoolShell
M
MIT News - Artificial intelligence
L
LangChain Blog
Microsoft Azure Blog
Microsoft Azure Blog
博客园 - Franky
WordPress大学
WordPress大学
博客园_首页
Y
Y Combinator Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
V
Visual Studio Blog
L
LINUX DO - 最新话题
S
Security @ Cisco Blogs
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
Help Net Security
Help Net Security
大猫的无限游戏
大猫的无限游戏
Hugging Face - Blog
Hugging Face - Blog
The GitHub Blog
The GitHub Blog
Schneier on Security
Schneier on Security
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
U
Unit 42
Jina AI
Jina AI
雷峰网
雷峰网
罗磊的独立博客
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
博客园 - 【当耐特】
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
人人都是产品经理
人人都是产品经理
Microsoft Security Blog
Microsoft Security Blog
V
V2EX
N
News and Events Feed by Topic
V2EX - 技术
V2EX - 技术
宝玉的分享
宝玉的分享
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
Exploit-DB.com RSS Feed
Exploit-DB.com RSS Feed
Hacker News - Newest:
Hacker News - Newest: "LLM"
P
Proofpoint News Feed
N
Netflix TechBlog - Medium
Martin Fowler
Martin Fowler
O
OpenAI News
P
Proofpoint News Feed
H
Help Net Security
S
Securelist
Vercel News
Vercel News
Hacker News: Ask HN
Hacker News: Ask HN
博客园 - 三生石上(FineUI控件)

The Decoder

The AI industry's platform trap is starting to look a lot like Microsoft's OpenAI buys Ona to push Codex toward long-running, autonomous coding tasks Jeff Bezos' AI startup Prometheus closes $12 billion round at a $41 billion valuation Free Deezer tool lets users on any streaming service check their playlists for AI music OpenAI vs. Anthropic: A price war over API tokens is brewing Dario Amodei's new essay reads like a Cold War playbook for the AI age Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers Google's new open model DiffusionGemma generates text from noise instead of word by word OpenAI's IPO slips as Altman tells staff to expect a public offering "within the next year" Anthropic study shows AI needs hours, not weeks, to build exploits from security patches OpenAI wants its biggest data center yet, and Nvidia would back the bill Claude Fable 5: The first Mythos model is powerful, expensive, and heavily filtered Germany's National Security Council greenights an AI Safety Institute modeled after the UK's AISI Google's NotebookLM now runs its own cloud computer with code execution and agent-based research Anthropic releases Claude Fable 5 and Mythos 5 with major gains in coding and science Google's Gemini 3.5 Live Translate delivers real-time voice translation across 70+ languages SpaceX wants to put data centers in orbit, and Musk says it's no big deal Landmark German ruling declares Google's AI Overviews are Google's own words and makes it liable for false answers Beijing's $295 billion AI buildout would require 80 percent domestic chips, locking out US suppliers Apple Intelligence gets a second shot with help from Google and Nvidia OpenAI now says "entirely automating everything is not the future we want" OpenAI says going public is "a complicated set of tradeoffs" and is unsure about the timing Microsoft Research's Lens proves detailed captions matter more than raw scale for training efficient image generators Intel gets a second life as Google and Nvidia explore it as a TSMC backup for AI chips Most companies are flying blind on AI spending Frontier Radar #3: How agentic AI is turning tokens into a business metric Instagram AI chatbot breach may have affected over to 20,000 accounts, Meta discloses Microsoft tightens rules for conflict zones after investigation into Israel's military use of Azure Moonshot AI targets a $30 billion valuation, more than six times its late-2025 worth Deepseek topped Ramp's trending software vendors in June 2026 as US companies chase cheaper AI OpenAI says "chat is dead" and plans to rebuild ChatGPT as a full-blown agent app Perplexity's "Search as Code" lets AI models write their own search pipelines instead of calling fixed APIs ChatGPT's new Lockdown Mode lets you disable web access and more to protect sensitive data from prompt injection Anthropic poaches OpenAI's second-ever chip engineer as both companies race toward IPOs Researchers pinpoint why larger language models pick up skills that small ones miss Sakana AI bets AI that improves itself can break the compute arms race of frontier labs Meta's Hatch AI agent could cost up to $200 a month and marks its first paid AI product Elon Musk's xAI reportedly trained its coding models on Claude outputs for months before getting cut off New open-source voice model listens nonstop and decides every 0.4 seconds whether to speak or stay silent SpaceX signs $920 million per month deal with Google for 110,000 Nvidia AI chips ahead of IPO OpenAI and the Trump administration are negotiating a government stake in the AI startup Qwen3.7-Plus is Alibaba's bid to turn multimodal AI into a full-blown autonomous agent Florida's lawsuit against OpenAI and CEO Altman treats ChatGPT as a defective product and public nuisance Satya Nadella publicly torches a VP's plan to make Microsoft's AI agent deliberately addictive Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data" Anthropic's Mythos model is reportedly powering NSA offensive cyber ops against China and Iran Anthropic says Claude now writes over 90% of its code and wants the world to have an AI pause button Cloudflare CEO says the web's future is "pay to crawl" as bots overtake human traffic ChatGPT now saves narrative dossiers about you sorted by work, hobbies, and travel preferences Bain study finds companies miss AI savings targets because humans keep getting in the way OpenAI CEO Sam Altman sees "proactive AI" as the next big phase after chatbots and agents AI can now coach amateur virologists, and top tech leaders want Congress to act on DNA security xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution Google Deepmind's Gemma 4 12B squeezes multimodal AI onto a laptop with just 16 GB of RAM Google lets sites opt out of AI search results, knowing most have nowhere else to go Ideogram 4.0 drops as an open-weight model with native 2K resolution and improved text rendering Trump's new executive order wants AI companies to voluntarily submit models for government safety reviews Perplexity announces hybrid AI system that decides what runs locally or in the cloud AI music startup Suno doubles its valuation to $5.4 billion while fighting major record labels in court Nous Research releases Hermes Desktop, an open-source AI agent for every platform Build 2026: Microsoft tops Google in image generation while playing catch-up on reasoning OpenAI expands Codex with role-specific plugins to build a general-purpose app for non-developers Anthropic scales Project Glasswing to 150 partners across 15 countries to hunt critical software flaws Hackers hijacked high-profile Instagram accounts by simply asking Meta's AI chatbot to change the email OpenAI turns ChatGPT into a career platform with job search and CV editor Warren Buffett's Berkshire Hathaway bets $10 billion on Alphabet's AI infrastructure buildout OpenAI models now available on Amazon Web Services Claude maker Anthropic files for IPO with the SEC Turing Award winner Richard Sutton says pure generative AI can't do real science MiniMax M3: Open-weight model with a million-token context challenges proprietary leaders Nvidia's Nemotron 3 Ultra becomes the smartest open US model, but China still leads Nvidia bets big on physical AI at GTC Taipei with a new world model, driving brain, and open humanoid robot Nvidia pitches RTX Spark as the chip that finally makes local AI agents practical on Windows devices OpenAI starts with infrastructure robots but aims for "everyone having a personal robot doing anything they need" Ask AI what goes with chicken and the answer depends on whether it learned from recipes or molecules Anthropic bans AI tools during job interviews to see how candidates actually think Anthropic study finds men use AI coding agents more than twice as often as women in social science research SoftBank plans 75 billion euro AI data center buildout in France AI search agents often confirm what they already know instead of actually researching the web Microsoft and Nvidia reportedly team up on AI PCs that run actual agents instead of Copilot Making AI chatbots helpful weakens their ability to simulate human behavior, large-scale study finds Terence Tao argues AI could bring division of labor to math for the first time in history Attackers abuse shared ChatGPT and Claude chats to spread malware OpenAI's Codex can now operate your Windows PC autonomously, hunting bugs and testing apps on its own Salesforce claims AI agents cut a 231-day migration to 13 days with fewer incidents Meta's leaked memo reveals AI pendant, supersensing glasses, and enterprise wearables strategy OpenAI gives GPT-5.5 Instant a readability upgrade while phasing out two older models Google fixes several bugs in Gemini usage limits that burned through quotas too fast One company reportedly spent $500 million on Claude in one month after failing to cap AI usage OpenAI is giving away its life sciences AI model to help governments prepare for the next pandemic New review paper argues code is how AI agents think and act, not just what they produce Amazon kills internal AI leaderboard after employees gamed it with pointless tasks Claude company Anthropic nears a trillion-dollar valuation after raising $65 billion in Series H Anthropic ships Claude Opus 4.8 as a "modest but tangible improvement" that tops GPT-5.5 in most benchmarks Google Cloud responds to AI-accelerated cyberattacks with a platform that aims to close security gaps in minutes Google launches a tiny board that runs Gemma 3 locally Mistral rebrands LeChat as Vibe, betting its chatbot's future is as a full-blown work agent Meta One: Zuckerberg finally puts a price tag on all that AI spending Amazon builds its own AI production platform and greenlights three AI animated series for Prime Video ElevenLabs Music v2 promises opera-to-metal transitions without losing musical coherence
Data2Story turns a CSV file into a verified interactive news article using seven AI agents
Jonathan Kemper · 2026-06-20 · via The Decoder
Three-part diagram showing how the Data Journalist Agent transforms a CSV dataset about card choices into a multimodal website with text, an interactive demo, and charts through research, data analysis, and narrative storytelling.
Data2Story turns a raw dataset into a verifiable, multimodal web article, shown here with a dataset on the card choices of 1,354 respondents. | Image: Lin et al.

The authors demo the system on a dataset that's gotten little coverage so far, the 2026 FIFA World Cup schedule. From the schedule and host cities, it generates a climate-focused article with an interactive map.

About four in ten matches are slated for locations the players' union FIFPRO classifies as extremely high heat risk, with humidity rather than air temperature as the main driver. The authors stress these are typical climate conditions, not a forecast for the actual tournament.

Six screenshots of three automatically generated data stories covering the 2026 FIFA World Cup and climate, ArXiv submissions from 1991 to 2026, and time-use diaries, each with a title image and matching data visualization.
Data2Story generates stories from datasets with zero human input, from World Cup stadium climates to ArXiv trends to how people spend their day. | Image: Lin et al.

An "Inspector" panel makes every claim traceable

The system's core feature is the "Inspector," a panel showing structured evidence for each sentence and asset. Every annotated sentence, chart, and interactive element gets its own index card displaying either the exact line of code (plus the data file behind it) or the external URL backing a claim.

Screenshot of a generated article about playing cards, with statements linked via arrows to two types of evidence, an external reference article and a Python script that reproduces the stated value of 20.1 percent.
The Inspector links each statement to either an external source or a runnable script that recalculates the figure from the data. | Image: Lin et al.

This lets 93 percent of all visible statements be checked for their origin. That doesn't mean they're correct, the researchers stress, just verifiable. Doubt a figure? Run the code. The baseline for human-written articles is 25 percent, partly because journalists rarely publish analysis code. The gap reflects both a hole in journalism practice and a strength of the system, the researchers claim.

Seven agents, one editorial workflow

Behind each article sits a chain of seven specialized agents the team calls a "virtual newsroom." The "Detective" runs web searches for context, since a table alone rarely tells the full story. For the World Cup data, it links host cities to FIFPRO heat risk ratings and Open-Meteo climate data.

The "Analyst" runs code instead of guessing numbers. The "Editor" picks which findings drive the narrative. The "Designer" chooses the right medium, say a map for geography or an audio clip for music. The "Programmer" builds the HTML page, the "Auditor" checks layout for errors, and the "Inspector" ties everything back to sources.

Pipeline-Diagramm der virtuellen Redaktion mit den Rollen Detective, Analyst, Editor, Designer, Programmer und Auditor, die Daten nacheinander zu einem fertigen HTML-Artikel verarbeiten, während der Inspector alle Zwischenergebnisse mit dem Endartikel verknüpft.
Each agent role in Data2Story's virtual newsroom handles one step from research to layout. The Inspector links every statement back to its source. | Image: Lin et al.[

The base model is Claude Opus 4.7 running on Claude Code. For images, video, and audio, the system pulls in OpenRouter models like gpt-5.4-image-2, seedance-2.0, and lyria-3-pro-preview.

53 readers rate agent articles higher than human originals

The researchers paired 18 public datasets with matching human-written originals from three distinct sources. They used the concise briefings from The Economist, the lavishly designed long reads from The Pudding, and the community datasets from TidyTuesday. 53 recruited readers rated both versions across five categories, including visual design, narrative rhythm, data transparency, verifiability of claims, and insight gained.

Data2Story won all five categories. The biggest lead was in transparency, at +1.49 on a seven-point scale. Overall, 74 percent preferred the agent article, 25 percent the human version, and 2 percent called it a draw.

By source, the picture shifts. The agent won clearly in data-heavy Economist briefings and TidyTuesday pieces. Against Pudding reports, which design teams often spend weeks crafting, it was a statistical tie. The agent couldn't beat handcrafted presentation.

Bar charts comparing agent and human across 18 article pairs. The agent writes more but shorter sentences (82.2 vs. 56.6 sentences and 16.0 vs. 20.9 words per sentence) and covers 50.4 percent of the human perspective compared to 35.1 percent the other way around.
Across 18 article pairs, Data2Story covers about half the human perspective, while journalists catch only a third of the agent's, most strikingly in The Economist. | Image: Lin et al.

When measuring which statements from the human-written article also appear in the agent-generated article, Data2Story covers about half. Conversely, only 35 percent of the agent’s statements are found in the human text.

The agent adds plenty of its own angles but only partly captures the editorial core. The gap is widest in short, formulaic Economist briefings, where the agent reproduces 73 percent of human findings, likely because those texts hew closely to standard statistics the agent calculates anyway.

Where humans still win

The researchers flag three areas where human authors stay ahead. On editorial perspective, reporters explain things the data can't. A Repair Cafe report traces low repair rates to manufacturers of phones, cars, and tractors deliberately blocking access to diagnostic tools and parts. That's a theory grounded in reporting, not data. The agent shows what breaks, but the "why" stays hidden.

Comparison of two article versions on Repair Cafes. The human report above includes explanatory text about the right to repair, and the agent version below shows a bar chart of repair rates sorted by the top twenty product types.
The human report explains why repairs fail. Data2Story only charts repair rates by product type. | Image: Lin et al.

On creative design, a Pudding piece on stand-up comedy turns the full transcript of an Ali Wong show into a user interface. Next to each line sits a circle sized to the length of the laugh. For the same content, the agent just embeds a static YouTube thumbnail.

Comparison of two article versions on a stand-up show. The human Pudding report above uses the full transcript as a user interface, and the agent version below shows a static Netflix thumbnail and play button.
The Pudding team turns the entire transcript into the interface. Data2Story embeds a clickable thumbnail. | Image: Lin et al.

On dense single graphics, an Economist visualization on the space race layers government and commercial providers, success rates, and annotations into one image. The agent scatters the same data across several charts, and the main point gets lost.

Comparison of two space race visualizations. The densely annotated Economist graphic above shows government and commercial launch providers in a single view, and the interactive agent version below uses a year slider and bare launch numbers without annotations.
The Economist packs government and commercial launches plus annotations into one graphic. Data2Story spreads the data across an interactive view without the notes. | Image: Lin et al.

A collaborator, not a replacement

The authors frame Data2Story as a newsroom tool. Humans bring perspective and reporting, agents handle computation, graphics, and machine-verifiable sourcing.

It could prove most useful for topics newsrooms can't cover for lack of capacity, niche datasets that would otherwise never become a readable story. One limitation is that Data2Story currently runs on full autopilot. A version with human-in-the-loop feedback is left for future work. The site is live at data2story.github.io, and the code is on GitHub.

Machine-verifiability is exactly where current AI systems keep stumbling. A recent Peking University benchmark found that leading models often give the right answer in document analysis but cite the wrong sources, a problem the researchers call "attribution hallucination."

Another study suggests AI search agents often don't research at all but mostly confirm what they already know from training. Data2Story tries to close this gap by having the analyst calculate figures with runnable code instead of guessing and having the Inspector link every statement to its source. Perplexity takes a similar tack with "Search as Code," where models write their own web searches instead of calling a black-box API.