
























Fusion of frontier models beating Fable, or cheaper models matching Fable performance at half the cost. Great announcement timing.
What is missing in the article is the reasoning/effort levels, so it is not ruled out the results differ just due to different reasoning budgets.
I would also be interested in seeing coding performance on SWE benchmarks.
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。