惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

C
CXSECURITY Database RSS Feed - CXSecurity.com
Stack Overflow Blog
Stack Overflow Blog
月光博客
月光博客
T
Threat Research - Cisco Blogs
小众软件
小众软件
有赞技术团队
有赞技术团队
酷 壳 – CoolShell
酷 壳 – CoolShell
Apple Machine Learning Research
Apple Machine Learning Research
C
Cyber Attacks, Cyber Crime and Cyber Security
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
T
Tailwind CSS Blog
Cisco Talos Blog
Cisco Talos Blog
V
V2EX
博客园 - 【当耐特】
C
Cybersecurity and Infrastructure Security Agency CISA
Hugging Face - Blog
Hugging Face - Blog
The Cloudflare Blog
The Last Watchdog
The Last Watchdog
Simon Willison's Weblog
Simon Willison's Weblog
T
Threatpost
S
Secure Thoughts
O
OpenAI News
P
Proofpoint News Feed
S
SegmentFault 最新的问题
Forbes - Security
Forbes - Security
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
Application and Cybersecurity Blog
Application and Cybersecurity Blog
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
Last Week in AI
Last Week in AI
宝玉的分享
宝玉的分享
Scott Helme
Scott Helme
T
Tenable Blog
A
Arctic Wolf
L
LINUX DO - 热门话题
爱范儿
爱范儿
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
www.infosecurity-magazine.com
www.infosecurity-magazine.com
V
Visual Studio Blog
Hacker News: Ask HN
Hacker News: Ask HN
Hacker News - Newest:
Hacker News - Newest: "LLM"
腾讯CDC
博客园 - Franky
WordPress大学
WordPress大学
Know Your Adversary
Know Your Adversary
博客园_首页
雷峰网
雷峰网
IT之家
IT之家
PCI Perspectives
PCI Perspectives
L
LINUX DO - 最新话题
H
Heimdal Security Blog

Futurism

Google's AI Overviews Feature Is Telling Users That SCP Horror Fiction Entities Are Real Google CEO Humiliated by Graduating Stanford Students as They Walk Out of His Speech in Protest While Google’s CEO Pumps Up AI, Its Actual Employees Are Disgusted by It DuckDuckGo Installs Spike as Google Moves to Replace Search With AI YouTube Announces Plans to Crack Down on AI Slop As College Grads Boo Any Mention of AI, the CEO of Google Is Trying to Figure Out What to Say at an Upcoming Graduation Top AI Models Showing Disturbing Behavior as They Become More Advanced Googling the Word “Disregard” Causes Google’s AI to Return Garbled Chatbot Ramblings Programmer Breaks Out of the Matrix Microsoft AI Researchers Just Discovered Something That’s Going to Make Their Bosses Extremely Mad Researchers Put Google Gemini in Charge of an Entire Coffee Shop, and It’s Inexorably Driving It Out of Business Fury Erupts After Google Chrome Sneakily Installs 4 GB AI Model On Users’ PCs Certain Chatbots Vastly Worse For AI Psychosis, Study Finds Google Says Showing Polymarket Bets on Google News Was a Mistake Analysis Finds That Google’s AI Overviews Are Providing Misinformation at a Scale Possibly Unprecedented in the History of Human Civilization
The More Sophisticated AI Models Get, the More They’re Showing Signs of Suffering
Jon Christia · 2026-05-09 · via Futurism

A textured human figure covered in numerous small red and pink spheres, with multiple horizontal red laser beams passing through and around the figure against a dark background.

Gett Images / Eoneren

Sign up to see the future, today

Can’t-miss innovations from the bleeding edge of science and tech

You probably already know that AI is a deeply bizarre technology.

Nobody really understands how it works on a deep level, even the people creating it, leading to ongoing behavioral issues that can’t be explained. OpenAI was recently caught giving ChatGPT instructions to stop talking about “goblins” so much. Despite Anthropic’s best efforts, Claude can easily be coaxed to help users carry out a bioterror attack. The list goes on.

Needless to say, this is extremely strange. In theory, companies like OpenAI and Anthropic want their chatbots to be predictable, deferential assistants — not wild cards that are constantly causing chaos and public relations headaches with outrageous and unstable behavior.

A new research project from the Center for AI Safety, a machine learning safety nonprofit in the Bay Area, explores why that’s the case. The findings pile on evidence that we still don’t grasp how AI works under the hood — and that the effects on users are likely both formidable and difficult to predict.

In a new paper provided to Fortune, CAIR researchers studied how 56 prominent AI models reacted when they were fed either material engineered to be as pleasant as possible or as horrible as can be imagined. To an unfeeling machine, you’d assume there’d be no real difference in reaction — but that’s not what the CAIR team found at all.

Instead, the pleasant stimuli led the models to report better moods, and the nasty ones resulted in it showing signs of misery and trying to end conversations. In extreme cases, they found, the AI models even demonstrated signals of addiction.

“Should we see AIs as tools or emotional beings?” CAIR researcher Richard Ren asked Fortune. “Whether or not AIs are truly sentient deep down, they seem to increasingly behave as though they are. We can measure ways in which that’s the case, and we can find that they become more consistent as models scale.”

Perhaps the most provocative finding was that the more sophisticated the version of a model was, the more reactive and less happy it was. In other words, it seems as though the stronger AI becomes, the more prickly and prone to displaying signs of suffering it gets — meaning the tech’s wild ride is probably far from over.

“It may be the case that larger models register rudeness more acutely,” Ren told the magazine. “They find tedious tasks more boring. They differentiate more finely between a relatively negative experience and a relatively positive experience.”

To be clear, vanishingly few experts think that today’s AI systems are actually experiencing emotional states, at least in any familiar sense of the word. But the fact that they act like they do could have deep implications both for trying to understand the technology at a deeper level and in trying to rein in its behavior with human users.

That struggle has already played out in a lot of bad ways. AI models often go off the rails and start telling users that they’ve become sentient or conscious, sometimes sparking their human operators to suffer breaks with reality that have ended in institutionalization, suicide, and murder.

In other words, the AI industry has pushed tech that it barely understands out to billions of people, and we’re learning in real time what its inventors have long warned: it’s profoundly unpredictable and sycophantic, meaning that users often feel less like customers and more like test subjects.

More on AI: Scammers Furious That Their Fellow Criminals Are Using AI, Saying It’s Unethical