Abstract
Despite their strong generative capabilities, large language models frequently exhibit hallucinations, particularly due to outside-boundary confidence where incorrect assertions are produced with high statistical certainty. Existing approaches commonly use output probability as a proxy for truthfulness; however, this signal is confounded by epistemic uncertainty and cannot reliably distinguish genuine uncertainty from fabricated content. We argue that effective hallucination detection requires integrating surface-level confidence with signals that reflect the model’s underlying epistemic state. To this end, we propose Answer-level Intrinsic Cognition (AIC), a model-agnostic metric that captures epistemic boundary deviations by measuring answer-level stability across multiple stochastic forward passes. By coupling AIC with conventional output uncertainty, we derive a composite metric that disentangles within-boundary uncertainty from outside-boundary confidence. Across three public question-answering benchmarks and diverse model scales, the two-dimensional score consistently outperforms strong uncertainty-only baselines, with larger gains on adversarially constructed hallucination sets. The code is available at: https://github.com/HXYfighter/AIC-ACL2026.
- Anthology ID:
- 2026.findings-acl.674
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 13796–13806
- Language:
- URL:
- https://aclanthology.org/2026.findings-acl.674/
- DOI:
- Bibkey:
- Cite (ACL):
- Jieran Li, Xiuyuan Hu, Yang Zhao, Dongbiao Sun, and Hao Zhang. 2026. Beyond Output Confidence: Epistemic-Aware Hallucination Detection with Answer-Level Signals. In Findings of the Association for Computational Linguistics: ACL 2026, pages 13796–13806, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Beyond Output Confidence: Epistemic-Aware Hallucination Detection with Answer-Level Signals (Li et al., Findings 2026)
- Copy Citation:
- PDF:
- https://aclanthology.org/2026.findings-acl.674.pdf
- Checklist:
- 2026.findings-acl.674.checklist.pdf























