惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

S
Secure Thoughts
Jina AI
Jina AI
有赞技术团队
有赞技术团队
人人都是产品经理
人人都是产品经理
云风的 BLOG
云风的 BLOG
酷 壳 – CoolShell
酷 壳 – CoolShell
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
MyScale Blog
MyScale Blog
IT之家
IT之家
博客园 - 【当耐特】
The GitHub Blog
The GitHub Blog
腾讯CDC
Scott Helme
Scott Helme
Cisco Talos Blog
Cisco Talos Blog
C
CXSECURITY Database RSS Feed - CXSecurity.com
博客园_首页
H
Hackread – Cybersecurity News, Data Breaches, AI and More
K
Kaspersky official blog
P
Palo Alto Networks Blog
Microsoft Security Blog
Microsoft Security Blog
美团技术团队
T
Tor Project blog
T
Threat Research - Cisco Blogs
V
V2EX
The Cloudflare Blog
MongoDB | Blog
MongoDB | Blog
Know Your Adversary
Know Your Adversary
博客园 - Franky
Last Week in AI
Last Week in AI
T
Tenable Blog
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
Spread Privacy
Spread Privacy
L
Lohrmann on Cybersecurity
P
Proofpoint News Feed
S
Schneier on Security
aimingoo的专栏
aimingoo的专栏
雷峰网
雷峰网
P
Privacy & Cybersecurity Law Blog
Latest news
Latest news
博客园 - 三生石上(FineUI控件)
Google DeepMind News
Google DeepMind News
Security Latest
Security Latest
V
Visual Studio Blog
E
Exploit-DB.com RSS Feed
阮一峰的网络日志
阮一峰的网络日志
S
Security Affairs
I
Intezer
爱范儿
爱范儿
AI
AI

The Guardian

New Zealand’s North Island braces for Cyclone Vaianu with thousands ordered to evacuate Artemis II splashdown – in pictures Swalwell denies allegations of sexual assault as calls grow for him to withdraw from California governor race Trump news at a glance: Epstein survivors have words for Melania Trump after surprise statement Multiple people face charges, including murder, in California fireworks blast Rory McIlroy surges into six-shot Masters lead with stunning second-round flourish Roberto De Zerbi targets ‘Ange-ball’ revival to save Spurs from relegation Bath hit back to reach semi-final after stunning Northampton in 11-try epic Australia crash out of BJK Cup after Britain secure upset with doubles win Zebras, wealth and power: Hungary’s election tests Orbán’s grip on power ‘TikTok effect’ brings sellout crowds and younger fans to Grand National meeting King signs up David Beckham to his Chelsea flower show team The war over Omagh’s gold: the £21bn mine plan tearing a community apart Britain’s shadow workforce is paid as little as 65p an hour. Who cares for the carers? Tim Dowling: my wife is on a quest to restore my thinning hair SUVs are making Britain’s potholes worse, say scientists Blind date: ‘She claimed she was usually shy. I wouldn’t have guessed’ I’m a sauna person now: the Becky Barnicoat cartoon ‘I got everything I dreamed of – when I had no ability to handle it’: Lena Dunham on toxic fame, broken friendships and her ‘lost decade’ Six great reads: the man who let snakes bite him, masked heavy metal and the brutal reality for foreign students in the UK Meera Sodha’s recipe for noodles with rose beancurd, spring greens and egg Cuba’s doctors were a lifeline for the world. Now the Caribbean is shamefully complicit in the US drive to expel them An environmental disaster in Moldova has Russia’s fingerprints all over it ‘This is as important as your teeth’: are you skipping this key part of mouth hygiene? Man arrested after four die trying to cross Channel in small boat Ukraine war briefing: doubts linger in Kyiv over Moscow’s promise to uphold Orthodox Easter ceasefire Ichiro Suzuki statue unveiling goes awry as bronze bat snaps during ceremony Arrest of national war hero Ben Roberts-Smith cuts deeply to core of Australian psyche European football: Real Madrid held at home by Girona to extend winless run ‘You come back different’: how rugby players change after motherhood Human rights groups decry US plan for Guantánamo camp for Cuban migrants Potential US host cities for 2031 Women’s World Cup games mull withdrawal over Fifa concerns Arne Slot insists he is ‘aligned’ with Liverpool board and fans as squad is rebuilt Kamala Harris ‘thinking about’ running for president again in 2028 JD Vance warns Iran against trying to ‘play’ the US in peace talks West Ham double up twice to thrash Wolves and put Spurs in relegation zone Trump administration releases new renderings of so-called ‘Arc de Trump’ Bafta apologises for events surrounding John Davidson’s Tourette’s outburst Cocktail of the week: Bar Shrimp’s la rosita – recipe New drug may extend survival in aggressive ovarian cancer, trial shows One dead and 27 injured after bus with British passengers crashes in Canary Islands OpenAI CEO Sam Altman’s home targeted with molotov cocktail Alarm as acting CDC director delays report showing Covid vaccine benefits Argentina just ripped up its pioneering glacier law. What does this mean for millions of people’s drinking water? ‘Illegal’ forest service overhaul risks causing ‘chaos’ across US public lands, union claims Prince Harry sued for defamation by charity he co-founded Anthropic’s new AI tool has implications for us all – whether we can use it or not Concerns raised about motorbike tourist trail after death of British teenager in Vietnam The Guardian view on Trump’s civilisational threats: the words that fuel war must be condemned The Guardian view on dystopias for our times: the American nightmare Weather tracker: Cyclone Maila batters Solomon Islands with 115mph winds Doctors’ leader claims new reduced pay offer killed chances of ending strikes in England Netanyahu-ism has achieved nothing for Israelis – and come at a monstrously high price Deborah Levy: ‘CS Lewis’s White Witch terrified me – but I wanted to meet her’ How I Shop with Michelle Ogundehin: ‘We grownups have enough stuff already’ ‘Butter Birkin’: popcorn plastic It bag in demand by Devil Wears Prada fans Trump’s war and Melania’s Epstein statement, with US editor Betsy Reed – The Latest Orbán and Magyar trade accusations in last days of Hungary election campaign Reckonwrong: How Long Has It Been? review | Safi Bugel's experimental album of the month Martin Rowson on Middle East peace talks – cartoon Fears of UK and EU flight cancellations as airports warn of jet fuel shortages Peers vote to ban pornography depicting sex acts between stepfamily members Week in wildlife: an ostrich on the lam, a tortoise crossing a road and surfing seals ‘There’s no shortage of terrifying technology’: how AI became TV drama’s new go-to villain Texas court overturns sentence for man on death row for nearly 50 years Power up! Could force be the secret to supercharging your fitness? ‘Irresponsible failure’: Google, Meta, Snap and Microsoft slam EU over child sexual abuse law lapse Blank canvas: what to wear with white trousers Critics assemble! Here’s my list of the greatest superhero movies of all time Amazon to finally launch Leo satellite internet in ‘mid-2026’, says CEO Pete Hegseth’s holy war: the militant Christian theology animating the US attack on Iran Toxic putdowns, brutal zingers ... and an unexpected love story – inside the joyful climax to brilliant sitcom Hacks Add to playlist: the beautifully dazed, countrified indie-rock of Tracey Nelson and the week’s best new tracks ‘I’m worried there’s too much of me,’ says a birch: inside the interspecies council giving nature a voice Dolce & Gabbana says co-founder Stefano Gabbana has quit as chair Why is anyone surprised by the US and Israel’s latest war? It’s only what the world allowed them to do in Gaza Super Mario what?! The seven best obscure Mario games Holly Humberstone: Cruel World review – Taylor Swift fave trades gothic melancholy for pop glow-up Thrash review – cursed shark thriller sinks like a stone on Netflix ‘The biggest, baddest, saltiest chick you would ever see’: why no one sang the blues like Big Mama Thornton Go Gentle by Maria Semple review – a joyfully clever New York romcom ‘Tranquil, natural and barely a tourist in sight’: readers’ favourite hidden gems in Spain Benjamina Ebuehi’s sweet and salty chocolate chip cookies recipe ‘I’m not a commercial director – I’m not even a professional film-maker’: Jim Jarmusch on the seven-year journey to make his new film Malcolm in the Middle: Life’s Still Unfair review – the TV magic they’ve created here is absolutely miraculous The Miniature Wife review – Matthew Macfadyen is wasted in this pointless comedy From soups and greens to roots, how to survive the ‘hungry gap’ From fat transplants to LED mittens: how the fear of ‘old lady hands’ mobilised the beauty industry Anna Wintour’s Vogue cover is more than a cameo – it’s a power play ‘They’re gonna make me cry’: I competed at a speed puzzling championship You be the judge: should my girlfriend stop mixing gold and silver jewellery? Maritime and port workers: how is the Middle East conflict affecting you? How games capture the awe and terror of cosmic isolation Why does alcohol make us both happy and miserable – and what else does it do to our minds and bodies? I never text back – and it’s ruining my relationships The pet I’ll never forget: Beau, the labrador who saved my life Life Is Strange: Reunion review – a decade-long story comes to an impassioned close Why is gaming becoming so expensive? The answer is found in AI Sign up for the First Edition newsletter: our free daily news email Sign up for the Feast newsletter: our free Guardian food email
The Anthropic ‘Fable’ saga proves: we have opened the AI Pandora’s box. What now? | Nathan E Sanders and Bruce Schneier
https://www.theguardian.com/profile/bruceschneier · 2026-06-16 · via The Guardian

On 9 June, Anthropic released its Fable generative AI model. Three days later, the US government classified it as a dangerous munition, and used its export-control authority to prohibit any foreign nationals from accessing it. Unable to differentiate between Americans and foreigners, the company shut off access for everyone.

The government’s actions won’t help. The problem isn’t any one particular models; it’s the general trend of increasing AI capabilities. And any real solution requires the sort of collective action that just isn’t possible right now.

Fable is the constrained version of Mythos, the AI model Anthropic announced in April. It only released it to a few selected organizations, because it claimed it was so good at finding and exploiting vulnerabilities in computer code that it releasing it more generally would be dangerous.

It was an obviously self-serving announcement, and because few were able to verify Anthropic’s claims they was met with some skepticism. Those with access used Mythos to find, and patch, many vulnerabilities in their own software. But one UK group found the latest, already public, OpenAI model to be just as powerful.

Fable is just another incremental improvement in the years-long climb of AI capabilities. But just as important as the AI model is the “harness”. This is typically not AI. It’s ordinary computer code that interfaces with the user. It stitches together AI models, decides how and for what purposes they can be used, and gives them useful tools such as web search and the ability to run it’s own computer code.

When Mythos first entered limited release, there was widespread debate whether its power came from the model or the harness. With Mythos demonstrating that it was possible, the open-source community scrambled to build harnesses that could steer other AI models towards similar capabilities.

They largely succeeded. For example, a Prague company was able to replicate Anthropic’s few verifiable cybersecurity capabilities with a much smaller and cheaper model – and a more sophisticated harness. Last week, a group showed that multiple cheaper models harnessed in concert matches Fable’s performance.

The broader community had only a few days with Fable, but that time we learned some about its capabilities. It’s difference is less the new model’s raw analytical and problem solving capabilities, and more that the model doesn’t need that sophisticated harness.

Fable requires much less expertise and detailed prompting from the human user. You can give it a difficult goal and it will figure out novel and unexpected ways to satisfy it, finding loopholes in whatever constraints you or the system have imposed on it.

“Relentlessly proactive” is how AI researcher Simon Willison described it. Another descriptor might be “creative”. Experienced AI developers have had that combination of creativity and proactivity since last year, but Fable puts it within easy reach of everyone.

In the hands of someone with a legitimate problem that needs solving, that can be an incredibly useful capability. But in the hands of someone who wants to do harm, it can be equally dangerous. AIs don’t have a moral compass in the same way that people do. They are agents of the wants and desires of the people who prompt them.

That points to the real problem with relentlessly proactive AI. In language, wants and desires are always underspecified. If I ask you to get me some coffee, you would probably pour me a cup from the coffeepot, or buy one from a nearby coffee shop.

You couldn’t buy me a pound of raw beans, or a coffee plantation. You wouldn’t order a cup of coffee for delivery next month. You wouldn’t find a nearby person, rip a cup of coffee out of their hands, and bring it to me. I wouldn’t have to specify any of the million limitations to my request; you would just know.

Human stories are filled with warnings about underspecified desires. King Midas wished that everything he touch turn to gold, forgetting to add “but not my food, drink, and daughter”. And genies are notorious for granting your wish in a way you wish he hadn’t.

The deeper point is that it’s impossible to list all limitations and restrictions and, like a malicious genie, a creative AI will find the ones you forgot. Block a database you don’t want it to have access to, and it might figure out how to bypass your control. Ask it to book a flight, and it might hack the airline because the website says the flight is sold out. Ask it to save money on your cellphone plan, and it might cancel it altogether – or get someone else to pay for it. As far as we know now AI has not done any of this yet, but you get the idea.

Malicious intent is not required. To an AI model, constraints are just things to get around and not general truisms about the world. They are creative problem solvers and natural rule breakers. They “hack” in the sense that they find and exploit loopholes.

Human systems rely on so many norms that we scarcely recognize the existence of until they are broken. AIs naturally think outside the box, because they don’t have any real conception of what the box is or why it’s there in the first place.

There is no foolproof way to prevent people from using AI models to complete harmful tasks. There is no way to prevent the models from incidentally causing harm while completing benign tasks. AI models are no longer isolated from the real world. They browse the internet and answer emails.

They trade stocks and make purchases. They control physical systems. They are, in effect, robots that affect life and property. We have no technical mechanisms to verify the integrity of an AI system. This level of capability and creativity in the hands of us untrustworthy humans will have both great and terrible results.

The problem is not unique to Anthropic. Mythos/Fable might currently be the most capable rules hacker, but more sophisticated harnesses give other models similar capabilities. And we should assume that the other frontier models are no more than a few months behind, and that open-source models are less than a year behind. At best, any ban only serves to delay the problem for a short while.

That delay might be useful if we – as a society, as a planet – would use that time to come together and figure out what to do. This isn’t a US/China arms race problem; this a species-level problem that requires coordinated action at that scale. Unfortunately, we have no mechanism to do that. I first wrote about this problem five years ago, but it was all too futuristic.

Today, when its right in front of us, there is no world government that can impose constraints on the for-profit corporations currently controlling AI models and research. The US has no appetite to effectively and even-handedly regulate those corporations, even as they do catastrophic damage to the environment, democracy, and – in this case – society in general.

This all makes an AI public option all the more necessary, and urgent. Today’s AIs can be fast, smart, and secure, but only two of the three are possible for any given system. These safety tradeoffs are tightly held secrets of companies racing to beat one another, and they tell us we have to trust them. Instead, the choices and their consequences need to be brought out into the sunlight.

We should be funding open-source harnesses that balance capability and safety – that achieve useful goals without so much power – and open-source AI models whose provenance and biases are public and well understood. We have opened the AI Pandora’s box. Now we have to make the best of it.

  • Bruce Schneier is a security technologist who teaches at the Harvard Kennedy School at Harvard University