

















Abstract:Large Language Models (LLMs) are increasingly deployed in diverse cultural contexts, yet their ability to master aesthetic stylistics, i.e., the strategic use of language to evoke cultural resonance, remains underexplored. We curate C4STYLI, a benchmark of highly stylized translated movie titles and advertising slogans from Hong Kong and the Chinese Mainland, to evaluate LLMs via the lens of behavioral recognition and productive competence. Extensive evaluations show that LLMs differ from humans in stylistic recognition, and this recognition ability varies across text domains. In addition, stylistic recognition and generation performance in LLMs are not consistently aligned. To further examine whether LLMs genuinely capture stylistic information in stylistic recognition, we conduct structural ablation with logistic regression probes. We find that, in the Hong Kong setting, stylistic recognition in LLMs relies primarily on surface-level linguistic information rather than stylistic structure. This suggests limited sensitivity to Hong Kong-specific stylistic structure.
| Comments: | IJCAI 2026 Human-Centred AI track |
| Subjects: | Computation and Language (cs.CL) |
| Cite as: | arXiv:2605.27296 [cs.CL] |
| (or arXiv:2605.27296v1 [cs.CL] for this version) | |
| https://doi.org/10.48550/arXiv.2605.27296 arXiv-issued DOI via DataCite (pending registration) |
From: Jiashuo Wang [view email]
[v1]
Tue, 26 May 2026 17:08:46 UTC (898 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。