






















Authors:Yifei Dong (1), Mingen Zheng (1), Linquan Wu (2), Jeff Z. Pan (3), Jiaxin Bai (4) ((1) Hong Kong University of Science and Technology, (2) City University of Hong Kong, (3) University of Edinburgh, (4) Hong Kong Baptist University)
Abstract:World-model synthesis aims to turn interaction experience into an internal model of environment dynamics. Existing symbolic approaches often fit observed transitions or mixtures of local rules, but they do not produce a complete executable program that can run independently of the real environment. We present Mind-Studio, a framework that synthesizes executable pygame-style world models from state-action-next-state trajectories using large language models. Mind-Studio combines entropy-selected traces with a lightweight game skill file containing object, action, and static scene information extracted from screenshots. We evaluate synthesis quality with a K-step lookahead fidelity protocol that compares generated world-model rollouts against Real-ALE rollouts from the same state. On Montezuma's Revenge, Mind-Studio improves chosen-action next-state prediction from 0.3% for PoE-World to 48.7% while verifying 5 of 8 subgoals; across Alien, Assault, and Skiing, it achieves stronger branch-level fidelity than prior learned lookahead sources.
From: Yifei Dong [view email]
[v1]
Sun, 14 Jun 2026 23:53:49 UTC (740 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。