惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

奇客Solidot–传递最新科技情报
奇客Solidot–传递最新科技情报
Application and Cybersecurity Blog
Application and Cybersecurity Blog
S
Securelist
K
Kaspersky official blog
Scott Helme
Scott Helme
C
CXSECURITY Database RSS Feed - CXSecurity.com
GbyAI
GbyAI
钛媒体:引领未来商业与生活新知
钛媒体:引领未来商业与生活新知
C
Cisco Blogs
freeCodeCamp Programming Tutorials: Python, JavaScript, Git & More
让小产品的独立变现更简单 - ezindie.com
让小产品的独立变现更简单 - ezindie.com
博客园 - Franky
Security Latest
Security Latest
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
Y
Y Combinator Blog
T
Threat Research - Cisco Blogs
L
LINUX DO - 热门话题
C
Cyber Attacks, Cyber Crime and Cyber Security
Project Zero
Project Zero
Cisco Talos Blog
Cisco Talos Blog
月光博客
月光博客
I
Intezer
cs.CL updates on arXiv.org
cs.CL updates on arXiv.org
人人都是产品经理
人人都是产品经理
L
Lohrmann on Cybersecurity
Recorded Future
Recorded Future
Latest news
Latest news
V2EX - 技术
V2EX - 技术
T
The Exploit Database - CXSecurity.com
H
Heimdal Security Blog
F
Fortinet All Blogs
Cloudbric
Cloudbric
IT之家
IT之家
博客园 - 叶小钗
Microsoft Security Blog
Microsoft Security Blog
P
Proofpoint News Feed
博客园 - 司徒正美
Apple Machine Learning Research
Apple Machine Learning Research
PCI Perspectives
PCI Perspectives
AWS News Blog
AWS News Blog
H
Help Net Security
S
Security @ Cisco Blogs
酷 壳 – CoolShell
酷 壳 – CoolShell
Recent Announcements
Recent Announcements
Hacker News - Newest:
Hacker News - Newest: "LLM"
cs.CV updates on arXiv.org
cs.CV updates on arXiv.org
F
Full Disclosure
S
Schneier on Security
S
Security Affairs
T
Tenable Blog

cs.HC updates on arXiv.org

Knowing When to Ask: Self-Gated Clarification for Hierarchical Language Agents Collaborative Human-Agent Protocol (CHAP) Impedance MPC for Physical Human-Robot Interaction: Predictive Disturbance Rejection with Joint-Limit Safety Formalizing all indexed mathematics as a benchmark for general reasoning, with the example of implementing dilatations of categories Face versus Body Tracking for Human-Robot Interaction: An Egocentric Dataset What LLMs Must Forget to Teach Effectively: A DIY Approach to Premodern Japanese Language Pedagogy The New Social Image: How AI Competency and AI Proactivity Influence Self- and Peer-Perceptions in the Workplace Inform, Coach, Relate, Listen: Auditing LLM Caregiving Support Roles Visual Matters: Connecting Aesthetic Appeal and Production Quality of Photos, Infographics and Data Visualizations to Credibility of Social Media Posts Faster Completion, Less Learning: Generative AI Reduced Study Time on Math Problems and the Knowledge They Build Learning to Decide with AI Assistance under Human-Alignment Positive Alignment: Artificial Intelligence for Human Flourishing Sycophantic AI makes human interaction feel more effortful and less satisfying over time Aligning Human-AI-Interaction Trust for Mental Health Support: Survey and Position for Multi-Stakeholders The Augmentation Trap: AI Productivity and the Cost of Cognitive Offloading Can LLMs Reason About Attention? Towards Zero-Shot Analysis of Multimodal Classroom Behavior Clinically Aware Synthetic Image Generation for Concept Coverage in Chest X-ray Models K2MUSE: A human lower-limb multimodal walking dataset spanning task and acquisition variability for rehabilitation robotics Privacy-Preserving Empathy Detection in Video Interactions GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations Creating and Evaluating Personas Using Generative AI: A Scoping Review of 81 Articles Social Human Robot Embodied Conversation (SHREC) Dataset: Benchmarking Foundational Models' Social Reasoning Designing Synthetic Discussion Generation Systems: A Case Study for Online Facilitation FSPO: Few-Shot Optimization of Synthetic Preferences Personalizes to Real Users ExplainReduce: Generating global explanations from many local explanations AIvaluateXR: An Evaluation Framework for on-Device AI in XR with Benchmarking Results RECOVER: Designing a Large Language Model-based Remote Patient Monitoring System for Postoperative Gastrointestinal Cancer Care "Would You Want an AI Tutor?" Understanding Stakeholder Perceptions of LLM-based Systems in the Classroom Influencing Humans to Conform to Preference Models for RLHF User Simulation in the Era of Generative AI: User Modeling, Synthetic Data Generation, and System Evaluation LLAMADRS: Evaluating Open-Source LLMs on Real Clinical Interviews--To Reason or Not to Reason? LLM Agents Grounded in Self-Reports Enable General-Purpose Simulation of Individuals The Impact of Generative AI on Collaborative Open-Source Software Development: Evidence from GitHub Copilot Who Benefits from AI? Self-Selection, Skill Gap, and the Hidden Costs of AI Feedback Visual Analysis of Multi-outcome Causal Graphs Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms VERA: Generating Visual Explanations of Two-Dimensional Embeddings via Region Annotation TouchAI: Exploring human-AI perceptual alignment in touch through language model representations Principled Evaluation with Human Labels: One Rater at a Time and Rater Equivalence Modelling and Analysing Behaviours and Emotions via Complex User Interactions An Agent-based Architecture for a Knowledge-work Support System Towards an automated query modification assistant U-Sem: Semantic Enrichment, User Modeling and Mining of Usage Data on the Social Web From Linked Data to Relevant Data -- Time is the Essence Mining User Comment Activity for Detecting Forum Spammers in YouTube An Empirical Study of Real-World SPARQL Queries User Modeling Combining Access Logs, Page Content and Semantics A Human-Centric Approach to Group-Based Context-Awareness Survey on Various Gesture Recognition Techniques for Interfacing Machines Based on Ambient Intelligence Integration of Flexible Web Based GUI in I-SOAS New Methods of Analysis of Narrative and Semantics in Support of Interactivity Emotional State Categorization from Speech: Machine vs. Human Using Soft Constraints To Learn Semantic Models Of Descriptions Of Shapes Integrating multiple sources to answer questions in Algebraic Topology How Controlled English can Improve Semantic Wikis Variations of the Turing Test in the Age of Internet and Virtual Reality Intent expression using eye robot for mascot robot system Fuzzy inference based mentality estimation for eye robot agent Modeling the Experience of Emotion Accelerating and Evaluation of Syntactic Parsing in Natural Language Question Answering Systems Embedding Data within Knowledge Spaces Cooperative interface of a swarm of UAVs Edhibou: a Customizable Interface for Decision Support in a Semantic Portal Combining Semantic Wikis and Controlled Natural Language MOOPPS: An Optimization System for Multi Objective Scheduling Proposition of the Interactive Pareto Iterated Local Search Procedure - Elements and Initial Experiments AceWiki: Collaborative Ontology Management in Controlled Natural Language AceWiki: A Natural and Expressive Semantic Wiki An Intelligent Multi-Agent Recommender System for Human Capacity Building Collaborative model of interaction and Unmanned Vehicle Systems' interface SimDialog: A visual game dialog editor An Analysis of Key Factors for the Success of the Communal Management of Knowledge Effective Generation of Subjectively Random Binary Sequences Practical Approach to Knowledge-based Question Answering with Natural Language Understanding and Advanced Reasoning The Cyborg Astrobiologist: Porting from a wearable computer to the Astrobiology Phone-cam Can the Internet cope with stress? Personalizing Image Search Results on Flickr Social Information Processing in Social News Aggregation Coupling Control and Human-Centered Automation in Mathematical Models of Complex Systems Social Browsing on Flickr Social Networks and Social Information Filtering on Digg Reuse of designs: Desperately seeking an interdisciplinary cognitive approach Communication of Social Agents and the Digital City - A Semiotic Perspective Understanding Design Fundamentals: How Synthesis and Analysis Drive Creativity, Resulting in Emergence Improving the CSIEC Project and Adapting It to the English Teaching and Learning in China Field geology with a wearable computer: 1st results of the Cyborg Astrobiologist System Multi-Modal Human-Machine Communication for Instructing Robot Grasping Tasks The Cyborg Astrobiologist: Scouting Red Beds for Uncommon Features with Geological Significance The Cyborg Astrobiologist: First Field Experience Semantic filtering by inference on domain knowledge in spoken dialogue systems Robust Dialogue Understanding in HERALD ScheduleNanny: Using GPS to Learn the User's Significant Locations, Travel Times and Schedule The role of robust semantic analysis in spoken language dialogue systems A Situation Calculus-based Approach To Model Ubiquitous Information Services Semi-metric Behavior in Document Networks and its Application to Recommendation Systems Fast Hands-free Writing by Gaze Direction Tree-gram Parsing: Lexical Dependencies and Structural Relations Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies Representing Scholarly Claims in Internet Digital Libraries: A Knowledge Modelling Approach
The Alignment Target Problem: Divergent Moral Judgments of Humans, AI Systems, and Their Designers
Benjamin Minhao Chen, Xinyu Xie · 2026-04-27 · via cs.HC updates on arXiv.org

The project of aligning machine behavior with human values raises a basic problem: whose moral expectations should guide AI decision-making? Much alignment research assumes that the appropriate benchmark is how humans themselves would act in a given situation. Studies of agent-type value forks challenge this assumption by showing that people do not always judge humans and AI systems identically.This paper extends that challenge by examining two further possibilities: first, that evaluations of AI behavior change when its human origins are made visible; and second, that people judge the humans who program AI systems differently from either the machines or the human actors they are compared against. An experiment with 1,002 U.S. adults measured moral judgments in a runaway mine train scenario, varying the subject of evaluation across four conditions: a repairman, a repair robot, a repair robot programmed by company engineers, and company engineers programming a repair robot. We find no significant difference in evaluations of the repairman and the robot. However, judgments shifted substantially when the robot's actions were described as the product of human design. Participants exhibited markedly more deontological, rule-based reasoning when evaluating either the programmed robot or the engineers who programmed it, suggesting that rendering human agency visible activates heightened moral constraints. These findings indicate that people may evaluate humans, AI systems acting in the same situation, and the humans who design them in meaningfully different ways. The fact that these evaluations do not necessarily converge gives rise to the alignment target problem: which normative target should guide the development of artificial moral agents in high-stakes domains, and whether these plural judgments can be reconciled within a coherent account of value alignment.