

















Abstract:Moral sensitivity is the most fundamental capability underlying human moral competence. Although many approaches aim to align large language models (LLMs) with human moral values, they primarily focus on fitting the distributions of morally appropriate texts while overlooking how to enable moral sensitivity acquisition in LLMs. In this paper, we take a step toward addressing the question: How can moral sensitivity be acquired in LLMs? Specifically, we propose a pragmatic inference approach that facilitates moral sensitivity acquisition in LLMs by enabling them to diagnose and correct moral errors. A central strength of our pragmatic inference approach lies in its unified perspective: rather than modeling moral discourses across semantically diverse and complex surface forms, it provides a principled framework for designing pragmatic inference procedures grounded in their inferential load. Empirical evidence demonstrates that our pragmatic approach can enable moral sensitivity acquisition in LLMs and generalizes effectively across tasks.
| Subjects: | Computation and Language (cs.CL) |
| Cite as: | arXiv:2601.03079 [cs.CL] |
| (or arXiv:2601.03079v4 [cs.CL] for this version) | |
| https://doi.org/10.48550/arXiv.2601.03079 arXiv-issued DOI via DataCite |
From: Guangliang Liu [view email]
[v1]
Tue, 6 Jan 2026 15:09:05 UTC (138 KB)
[v2]
Fri, 13 Mar 2026 19:29:27 UTC (119 KB)
[v3]
Tue, 17 Mar 2026 19:47:03 UTC (127 KB)
[v4]
Tue, 26 May 2026 00:54:59 UTC (132 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。