惯性聚合 高效追踪和阅读你感兴趣的博客、新闻、科技资讯
阅读原文 在惯性聚合中打开

推荐订阅源

F
Full Disclosure
Recorded Future
Recorded Future
T
Tenable Blog
S
Securelist
C
CERT Recently Published Vulnerability Notes
T
Threatpost
S
Schneier on Security
A
Arctic Wolf
The Hacker News
The Hacker News
C
CXSECURITY Database RSS Feed - CXSecurity.com
Know Your Adversary
Know Your Adversary
P
Privacy International News Feed
Threat Intelligence Blog | Flashpoint
Threat Intelligence Blog | Flashpoint
The Register - Security
The Register - Security
Cisco Talos Blog
Cisco Talos Blog
AWS News Blog
AWS News Blog
K
Kaspersky official blog
T
True Tiger Recordings
T
Threat Research - Cisco Blogs
V
Vulnerabilities – Threatpost
P
Palo Alto Networks Blog
T
The Exploit Database - CXSecurity.com
小众软件
小众软件
B
Blog
Cyber Security Advisories - MS-ISAC
Cyber Security Advisories - MS-ISAC
Microsoft Azure Blog
Microsoft Azure Blog
Cyberwarzone
Cyberwarzone
C
Cybersecurity and Infrastructure Security Agency CISA
T
Tor Project blog
Spread Privacy
Spread Privacy
Malwarebytes
Malwarebytes
P
Proofpoint News Feed
F
Fox-IT International blog
F
Fortinet All Blogs
P
Privacy & Cybersecurity Law Blog
G
GRAHAM CLULEY
量子位
Latest news
Latest news
OSCHINA 社区最新新闻
OSCHINA 社区最新新闻
博客园 - 叶小钗
Project Zero
Project Zero
T
Tailwind CSS Blog
N
Netflix TechBlog - Medium
Martin Fowler
Martin Fowler
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
IntelliJ IDEA : IntelliJ IDEA – the Leading IDE for Professional Development in Java and Kotlin | The JetBrains Blog
I
Intezer
博客园_首页
腾讯CDC
H
Hackread – Cybersecurity News, Data Breaches, AI and More
D
Darknet – Hacking Tools, Hacker News & Cyber Security

Paper Index on ACL Anthology

Can Large Language Models Replace Statistical Software? Robust Language Identification for Romansh Varieties Graph-Augmented LLMs for Swiss MP Ideology Prediction Data Augmentation for Historical NER: A Systematic Comparison of Lexical and LLM-based Approaches Text vs. Phoneme Intermediates for Low-Resource Swiss German RUMLEM: A Dictionary-Based Lemmatizer for Romansh Code-Switching Detection in Multilingual Child Speech with SwissBERT Optimizing Large Language Models for Robust Domain-Specific Text-to-SQL: From Prompting to Preference Alignment How Good is AI on Swiss Voting Booklets? A Multilingual OCR and Alignment Benchmark Call Support Copilot: A Reproducible Multimodal System for Speech Emotion Recognition, Intent Understanding, and Agent Assistance Reinforcement Learning for Latent-Space Thinking in LLMs The Same Email, Signed Differently: Testing Negotiation Bias and Recommendation Stability in LLMs Skill Extraction from Resumes and Job Offers across Six Languages Proceedings of the 11th Edition of the Swiss Text Analytics Conference Enhancing Retrieval via Cognitively Motivated Document Expansion Which Skills Debate Reaches the Public? Comparing Scientific Literature and Media Coverage of AI and LLM Skill Impacts (2022–2025) Concept Extraction and Webb’s Depth of Knowledge: Comparing LLM Question Generation Pipelines for Educational Assessment Automated German Alt Text Generation for News Charts Extracting Article-Level Legal Dependencies from Swiss Federal Law using LLMs Extending the Contact Hypothesis: Cross-Linguistic Evaluation of Religion and Nationality Bias When Prompting LLMs in German and Icelandic An Efficient Approach for Answering Not Readily Attainable Questions for RAG-based Applications A Dataset of Latin Etymologies Extracted from Wiktionary A Bounded Coordination-Support Capability for Multi-Party Settings: Task-State Monitoring in Firefighter Incident Command Implicit and Indirect: Detecting Face-threatening and Paired Actions in Asynchronous Online Conversations Controlling Language and Style of Multi-lingual Generative Language Models with Control Vectors Northern European Journal of Language Technology, Volume 11 Hybrid Human-LLM Corpus Construction and LLM Evaluation for the Caused-Motion Construction Calling things by their names: Towards a unified account for name-informing and mixed quotation Prior Lessons of Incremental Dialogue and Robot Action Management for the Age of Language Models German Demonstrative Pronouns in Contrast Embodied Conversational Systems in Human–Robot Interaction: Introduction to the Special Issue When to Say What and How: Adapting the Elaborateness and Indirectness of Spoken Dialogue Systems Strategic Dialogue Assessment: The Crooked Path to Innocence Common Ground inconsistencies in dialogue systems: conflict patterns implied by polar question forms Lexical Alignment to Non-native Speakers How People Structure Representations of Discourse Characterizing the Response Space of Questions: data and theory Repair of claimed non-understanding of word meaning in online discussion forum interaction The (Possible) Use of AI Tools for Processing Texts in Journalism in Bulgarian Computational Linguistics in Bulgaria Processing of discourse anaphors by L2 speakers of English Exploring the Sensitivity to Alternative Signals of Coherence Relations Enhancing Long-term RAG Chatbots with Psychological Models of Memory Importance and Forgetting Bullshit, Pragmatic Deception, and Natural Language Processing German Modal Particles as Discourse Signals Multi-modal Anaphora and Broadcasting of Information by Gestural Post-holds Lexical and contextual cue effects in discourse expectations: Experimenting with German ’zwar...aber’ and English ’true/sure...but’ GailBot: An automatic transcription system for Conversation Analysis Demonstrative Pronouns as Anti-Logophoric Pronouns: An Experimental Investigation The effect of domain knowledge and implicitation on discourse relation inferences Studying Alignment in a Collaborative Learning Activity via Automatic Methods: The Link Between What We Say and Do Pragmatic uses of I don’t know, boosters, and hedges in text and talk Why ellipsis? Interactional function predicts choice of syntactic form in conversation User Satisfaction Reward Estimation Across Domains: Domain-independent Dialogue Policy Learning Scoring Coreference Chains with Split-Antecedent Anaphors Cognitive and social delays in the initiation of conversational repair Beyond semantics: the challenges of annotating pragmatic and discourse phenomena Discourse Relations and Connectives in Higher Text Structure Few Shades of Supervision for Discourse Segmentation Event and Entity Coreference Across Five Languages: Effects of Context and Referring Expression Digging Communicative Intentions: The Case of Crises Events Automatic Essay Scoring Systems Are Both Overstable And Oversensitive: Explaining Why And Proposing Defenses Investigating Proactivity in Task-Oriented Dialogues Graph-to-Text Approach to Knowledge-Grounded Response Generation in Human–Robot Interaction From Discursive Practice to Logic? Remarks on Logical Expressivism Laughter use by virtual agents increases task success Narrative Elements in Expository Texts Modelling Structures for Situated Discourse It matters how you combine your clauses: Effects of syntactic subordination, connectives, and typographic and prosodic boundaries on the prominence of referents Form and Function of Connectives in Chinese Conversational Speech Attribution and the discourse structure of reports Referential Communication Between Friends and Strangers in the Wild Please, Please, Just Tell Me: The Linguistic Features of Humorous Deception Signaling of Causal Relations in Spanish: Variety, Functionality, and Specificity The timing of prominence information during the resolution of German personal and demonstrative pronouns The Conversational Discourse Unit: Identification and Its Role in Conversational Turn-taking Management Self-Repair in Tigrinya: Trouble Sources, Mechanisms and Solutions Perspective-Taking and Protagonist Prominence Does ChatGPT Adapt Itself to the Language Used and the Audience It Implies? Automatic Detection of the Bulgarian Evidential Renarrative User Impressions of System Questions to Acquire Lexical Knowledge during Dialogues Light Verb Constructions in ELEXIS-WSD – Annotation, Comparisons and Issues Journal Computational Linguistics in Bulgaria The Use of Perspective Markers and Connectives in Expressing Subjectivity: Evidence from Collocational Analyses Opinion Piece: Can we Fix the Scope for Coreference? An Analysis of Japanese Sentence-final Particle Yone: Compare Yone and Ne in Response A Neural Approach to Discourse Relation Signal Detection A Lexicon-Grammar of Brazilian Portuguese Predicative Adjectives Agent Orchestration - LLM for Legal Metadata Extraction: A Comparative Analysis of Efficiency and Precision ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs Bridging Cultural Gaps in Automated Translation of Brazilian Expressions: A Study on Cultural Adaptation Bruna: A Real-Time Multimodal Voice Agent with Hybrid Reasoning Caracterização lexical e sintática de notícias falsas em português produzidas por humanos e por máquinas Cartas Indígenas ao Brasil: Classificação Multi-Rótulo Evaluating Automated Scoring Models on Official ENEM Essays Evaluating FrameNet-Based Semantic Modeling for Gender-Based Violence Detection in Clinical Records Evaluating Reference-Free Summarization Quality Metrics for Portuguese: A Study with Human Judgments in Financial News Evaluating Small Language Models for English-to-Portuguese Translation: Impact of Model Scale and Quantization Evolução de Padrões Linguísticos na Escrita Científica em Português: Uma Análise com NILC-Metrix Experimental Evaluation of Topic Modeling Methods for Categorizing Irregularities in Health-related news
A modular architecture for creating multimodal embodied agents with an episodic Knowledge Graph as an explainable and controllable long-term memory
2026-04-20 · via Paper Index on ACL Anthology

Abstract

How can flexibility and control over the interpretation of multimodal signals by embodied agents be balanced? Flexibility means that agents respond fluently in any context, whereas control means that responses are transparent and faithful to goals and principles that are explicitly defined. This paper describes a modular platform to create multimodal interactive agents using an event bus on which signals and interpretations are posted as a sequence in time, but also provides control options to drive the interaction given specific intentions and goals. Different sensors and interpretation components can be integrated by defining their input and output topics in the event bus, which results in an open multimodal sequence-driven workflow for further interpretations. In addition, our platform allows us to define higher-level intents that control sequence patterns to achieve a goal. A key component is an episodic Knowledge Graph (eKG) that acts as a long-term symbolic memory to aggregate and connect these interpretations. This eKG establishes coherence and continuity across different interactions. Intents and the eKG make it possible to define different (embodied) agents and compare their behavior without having to implement complex software components for multimodal sensor data and design the control over their dependencies. In this paper, we explain the broad range of components that we developed and integrated into various interactive agents. We also explain how the interaction is recorded as multimodal data and how it results in an aggregated memory in the eKG. By analyzing the recorded interaction, we can compare agents and agent components and study their interactive behavior with people and other agents.

Anthology ID:
2025.dnd-16.11
Volume:
Dialogue & Discourse Volume 16
Month:
December
Year:
2025
Address:
Chicago, Illinois, USA
Editors:
Amir Zeldes, Manfred Stede, Patrick G.T. Healey, and Hendrik Buschmeier
Venue:
DND
SIG:
SIGDIAL
Publisher:
University of Illinois Chicago
Note:
Pages:
25–59
Language:
URL:
https://aclanthology.org/2025.dnd-16.11/
DOI:
10.5210/dad.2025.303
Bibkey:
Cite (ACL):
Thomas Baier, Selene Báez Santamaría, and Piek Vossen. 2025. A modular architecture for creating multimodal embodied agents with an episodic Knowledge Graph as an explainable and controllable long-term memory. Dialogue & Discourse, 16:25–59.
Cite (Informal):
A modular architecture for creating multimodal embodied agents with an episodic Knowledge Graph as an explainable and controllable long-term memory (Baier et al., DND 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.dnd-16.11.pdf