The Extreme Inefficiency of RL for Frontier Models — Toby Ord - 惯性聚合

推荐订阅源

Engineering at Meta

大猫的无限游戏

酷壳 – CoolShell

罗磊的独立博客

WordPress大学

博客园 - 司徒正美

Visual Studio Blog

SegmentFault 最新的问题

钛媒体：引领未来商业与生活新知

博客园 - Franky

奇客Solidot–传递最新科技情报

让小产品的独立变现更简单 - ezindie.com

博客园 - 三生石上(FineUI控件)

Apple Machine Learning Research

宝玉的分享

Tailwind CSS Blog

The Blog of Author Tim Ferriss

博客园 - 【当耐特】

The GitHub Blog

美团技术团队

DataBreaches.Net

Proofpoint News Feed

The Cloudflare Blog

aimingoo的专栏

Check Point Blog

博客园 - 聂微东

Google DeepMind News

Java Code Geeks

Full Disclosure

阮一峰的网络日志

freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More

The Register - Security

Stack Overflow Blog

Writing - Toby Ord

Broad Timelines — Toby Ord Hazard Rates for AI Agents Decline as a Task Goes On — Toby Ord Are the Costs of AI Agents Also Rising Exponentially? — Toby Ord How Well Does RL Scale? — Toby Ord Evidence that Recent AI Gains are Mostly from Inference-Scaling — Toby Ord Is there a Half-Life for the Success Rates of AI Agents? — Toby Ord Inference Scaling Reshapes AI Governance — Toby Ord Inference Scaling and the Log-x Chart — Toby Ord The Scaling Paradox — Toby Ord The Precipice Revisited — Toby Ord On the Value of Advancing Progress — Toby Ord Robust Longterm Comparisons — Toby Ord The timing of labour aimed at reducing existential risk — Toby Ord A Child's Plaything — Toby Ord Remembering Peter Eckersley — Toby Ord Casting the Decisive Vote — Toby Ord

The Extreme Inefficiency of RL for Frontier Models — Toby Ord

September 19, 2025Toby Ord · 2025-09-19 · via Writing - Toby Ord

此内容由惯性聚合(RSS阅读器)自动聚合整理，仅供阅读参考。原文来自 — 版权归原作者所有。