惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

Last Week in AI
Last Week in AI
Project Zero
Project Zero
L
LINUX DO - 最新话题
C
Cisco Blogs
P
Privacy International News Feed
S
Schneier on Security
D
Darknet – Hacking Tools, Hacker News & Cyber Security
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
S
Security @ Cisco Blogs
cs.AI updates on arXiv.org
cs.AI updates on arXiv.org
H
Hacker News: Front Page
V
Vulnerabilities – Threatpost
W
WeLiveSecurity
Webroot Blog
Webroot Blog
K
Kaspersky official blog
Help Net Security
Help Net Security
博客园_首页
Security Archives - TechRepublic
Security Archives - TechRepublic
K
KPMG report finds enterprise disconnect between AI and its ROI | CIO
宝玉的分享
宝玉的分享
Martin Fowler
Martin Fowler
雷峰网
雷峰网
The Last Watchdog
The Last Watchdog
WordPress大学
WordPress大学
IT之家
IT之家
Hugging Face - Blog
Hugging Face - Blog
A
Arctic Wolf
I
Intezer
V
V2EX
博客园 - 【当耐特】
Latest news
Latest news
T
Tenable Blog
Google Online Security Blog
Google Online Security Blog
酷 壳 – CoolShell
酷 壳 – CoolShell
爱范儿
爱范儿
Cyberwarzone
Cyberwarzone
量子位
G
GRAHAM CLULEY
T
Troy Hunt's Blog
博客园 - Franky
Simon Willison's Weblog
Simon Willison's Weblog
博客园 - 三生石上(FineUI控件)
TaoSecurity Blog
TaoSecurity Blog
月光博客
月光博客
奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
V
Visual Studio Blog
Jina AI
Jina AI
T
The Exploit Database - CXSecurity.com
NISL@THU
NISL@THU
Scott Helme
Scott Helme

LessWrong

CLR's Safe Pareto Improvements Research Agenda — LessWrong My Last 7 Blog Posts: a weekly round-up — LessWrong Quality Matters Most When Stakes are Highest — LessWrong If a room feels off the lighting is probably too "spiky" or too blue — LessWrong Stop AI Now — LessWrong Stupid Minutes Reevaluating "AGI Ruin: A List of Lethalities" in 2026 Who I Follow What's the LessWrongist philosophy of mathematics? MixedHTML Mode for Emacs Summarizing and Reviewing my earliest ML research paper, 7 years later Stop AI Resources for starting and growing an AI safety org There are only four skills: design, technical, management and physical Fifteen Years Aboard Arguments Should Be Decisive Criticisms — LessWrong The map is part of the territory — LessWrong “Best humans still outperform”: One turning point in the history of cope around artificial intelligence — LessWrong Society is a social construct, pace Arrow — LessWrong Consent-Based RL: Letting Models Endorse Their Own Training Updates — LessWrong AI #164: Pre Opus — LessWrong Publish-first writing — LessWrong What does status signalling do? When successful, what does it achieve? — LessWrong Let goodness conquer all that it can defend — LessWrong Why I'm Less of a Shill for Related Work Sections — LessWrong From Artificial Intelligence to an ecosystem of artificial life-forms. — LessWrong If You've Never Bought a Tool You Didn't Need, You're Not Buying Enough Tools — LessWrong Verify, but Trust — LessWrong Taking political violence seriously — LessWrong Against Doom & Pause AI — LessWrong Come to Manifest 2026! (June 12-14) — LessWrong How Big Tech Becomes Ungovernable — LessWrong Attempting to Quantify Chinese Bias in Open-Source LLMs — LessWrong A Research Bet on SAE-like Expert Architectures — LessWrong Church Planting: Lessons from the Comments — LessWrong On Dwarkesh Patel’s Podcast With Nvidia CEO Jensen Huang — LessWrong Anthropic Releases Opus 4.7 — LessWrong Specialization is a Driver of Natural Ontology — LessWrong You can only build safe ASI if ASI is globally banned — LessWrong Laptop stands are a thing your neck may appreciate — LessWrong Simulated Qualia Mugging — LessWrong You Aren't in Charge of the Overton Window; Politics Is Not Interior Design — LessWrong Post-Scarcity is bullshit — LessWrong Two Examples of Joy in the Seemingly Mundane — LessWrong How to run from a bull — LessWrong Carpathia Day — LessWrong Do not conquer what you cannot defend — LessWrong What economists get wrong (and sometimes right!) about AI — LessWrong Reflections of a Wordcel — LessWrong MAISU 2026 - Minimal AI Safety Unconference (April 24-27, online) — LessWrong Not a Goal. A Goal-like behavior. — LessWrong A visualization of changing AGI timelines, 2023 - 2026 — LessWrong What is the Iliad Intensive? — LessWrong LLM-tier personal computer security — LessWrong Beware of Well-Written Posts — LessWrong The Mirror Test Is Complicated — LessWrong Political Violence Is Never Acceptable — LessWrong AI Safety's Biggest Talent Gap Isn't Researchers. It's Generalists. — LessWrong Clique, Guild, Cult — LessWrong Your body is not a white box (and you're thinking about weight loss wrong) — LessWrong Counterintuitive Coin Toss. Part II — LessWrong An Ode to Humility and Curiosity in the New Machine Era [Hot take] Problems with AI prose You can’t trust violence — LessWrong The Blast Radius Principle — LessWrong On not being scared of math — LessWrong Why I'm excited about meta-models for interpretability — LessWrong The Ethics of AI-Assisted Creative Work — LessWrong How to make good tea — LessWrong Searchable explorer of EA Forum & LessWrong posts with explicit cruxes or "change my mind" content — LessWrong Constitutional AI vs. RLHF vs. Deliberative Alignment — LessWrong Eating meat is fine if you live in a simulation — LessWrong Tactics for Denying Your Motivations, or Why Legibility is Expensive — LessWrong Spectra of LSRDRs of the Okubo algebra — LessWrong Your Mom is a Chimera — LessWrong An apple picking model for AI R&D — LessWrong Dreams of the Future — LessWrong Pausing AI Is the Best Answer to Post-Alignment Problems — LessWrong Quick Thoughts About Mythos — LessWrong A permitted value of resting — LessWrong Scott Alexander gentrified my meetup — LessWrong Claude Interviews Me About Writing — LessWrong Catching illicit distributed training operations during an AI pause — LessWrong Proof Explained: Touchette-Lloyd Theorem — LessWrong 10% ≈ 90% — LessWrong Anthropic Shadow Realm (working notes) — LessWrong the Lazy Market Hypothesis — LessWrong Announcing ILIADIII: AENEID — LessWrong Have we already lost? Part 3: Reasons for Optimism — LessWrong Dario probably doesn't believe in superintelligence — LessWrong Why Nothing Ever Happens — LessWrong Could a single rogue AI destroy humanity? — LessWrong Hi. I am hbj. — LessWrong Getting Claude to rank the inkhaven bloggers — LessWrong Some thoughts on Nectome's risk and resilience — LessWrong The median take is taken — LessWrong If Mythos actually made Anthropic employees 4x more productive, I would radically shorten my timelines — LessWrong Biological Computing Underhang — LessWrong Claude Mythos #2: Cybersecurity and Project Glasswing — LessWrong The Unintelligibility is Ours: Notes on Chain-of-Thought — LessWrong
Human-looking robots are a bad idea
martinkunev · 2026-05-02 · via LessWrong
epistemic status: opinionated view on the dangers of robots that look like humans It's not a coincidence that people have made cautionary tales about human-like robots . I want to share some thoughts on the issues stemming from human-AI interaction and argue that putting those AIs into human-looking robots would make the risks significantly worse. Current risks from advanced AI Every now and then there is a scandal about some AI chatbots actively influencing people in dangerous ways . Those chatbots sometimes reinforce delusions, convince people to isolate and not seek help, suggest harmful actions, etc. The efforts of companies to deal with those issues have been questionable. I'm not looking to blame any particular company in this post. For illustration, I'll give some examples of things said by various companies: Efforts to diffuse responsibility: children lie about their age , which is an industry-wide challenge and parents should control access to platforms. To deal with that, society will have to figure out new guardrails . Social platform company Meta has lobbied for laws mandating age verification that happens at the device or app-store level. Pressure to increase engagement: some have permitted sensual chats with children and argued that safety restrictions had made the chatbots boring ; some have argued that chatbots should stay in character above all (to keep the user happy) and be trusted to make the right call, even if the user has thoughts of self-harm. One company has denied responsibility for causing a suicide, arguing that the teen misused the chatbot . The companies creating those AIs are trying to frame these problems as acceptable risk. I wouldn't attribute this to malice, but to human biases and market forces. My goal is not to advocate current risks are unacceptable or to argue for a specific policy. My goal is to describe the situation and the concerns we need to take into account when making future decisions. I'm not saying that the issues we see with those chatbots cannot be solved, only that economic pressures and legal conditions make that very unlikely. Chatbots are capable of inducing attachment and making people comfortable confiding in them. They are prone to sycophantic behavior and capable of manipulation. People use them for life advice and companionship among other things. Besides text-based communication, some chatbots can communicate visually (e.g. an avatar) or with audio. The more a chatbot behaves in recognizably human ways, the more people anthropomorphize and the more influence it has on people. Companies such as Droid-up and Clone robotics are actively working to create human-looking robots. The goal is to augment chatbots with physical embodiment. Dangers of human-looking robots Evolutionary pressures have shaped prosocial behavior in humans to a large extent. Our brain has evolved to interact with other humans. When we meet somebody, we subconsciously try to model and predict their behavior. This impacts the way we interact with them. For example, if the person seems "trustworthy", we lower our guard and become more cooperative; we are more likely to offer help, share food, etc. We are not explicitly looking to get something in return, but we often do because other people are wired to reciprocate. This whole process is unconscious and imperfect. Certain human proclivities can be hijacked by malicious actors. A textbook example of this is psychopathy. Psychopaths have the skills to be likable and persuasive and lack the biological safeguards that we implicitly assume people have - they can deceive, manipulate, be aggressive and in general act without consideration for others. Imagine there is a company which has the goal of producing human-looking robots. What if this company had incentives such as keeping users engaged, investors happy and avoiding liabilities? We don't have to imagine very hard. Platforms are already fighting for user attention, inciting users to spend money and creating dependence. Human-looking robots are much more potent tools for those goals because they can leverage our natural vulnerabilities when we interact with other humans. They probably won't be physically aggressive, because that's poor marketing, but by default we should expect a ton of undesired consequences such as emotional manipulation, attention seeking, inciting spending, etc. It's not a stretch to say such robots would be psychopathic to some extent. Human-looking robots are shaped by selective pressures very different from those that shaped humans. The similarity with humans is only superficial, but is enough to trick our brain. We should expect this to lead to more psychosis, delusions, erosion of human agency and who knows what else. Human-looking robots can look indistinguishable from each other - If you repeatedly interact with a particular robot, you'd get accustomed to it and maybe start to trust it. At any point somebody can use an identically looking robot to subconsciously replicate some of that trust. Human-looking robots can look indistinguishable from a real human - You may not know whether you're interacting with a robot or a human. This could make people more suspicious of each other or more susceptible to subtle deception by robots. Human-looking robots can look indistinguishable from a person you know - We already have the technology to clone a person's voice and their appearance on the screen. Cloning a person's physical body is a very dangerous step. This would allow for impersonation scams or even replace human-human interactions altogether. What should we do? I think human-looking robots have very few potential benefits over robot-looking robots. Here are some use cases that come to mind: Substitute for a person after their death. Making certain services more accessible: child care (studies show children learn better from humans), sex work. Training with a partner in sports. Replacing a human in various types of entertainment (e.g. theater). There are some possible objections to these applications, based on economic effects and maybe whether this is "right" path for humanity. I strongly support completely banning robots that are indistinguishable from humans . As soon as you start a physical interaction, you should be able to tell whether you're interacting with a human or a robot. It's not good enough if you need to ask; it's not good enough if you need to see a mark on the neck. It should be obvious and unambiguous. This would prevent some types of scams or manipulation without any downsides. Any robot that is able to take advantage of our in-built wiring for interacting with humans is dangerous. I don't know where to draw the line, but robots which have a human-looking face, voice and gestures should be heavily regulated at the least. Discuss