





















Abstract:Diffusion models excel at generating diverse and multimodal trajectories for robotic planning, yet their iterative denoising process introduces latency that is incompatible with high-frequency closed-loop control. To address this problem, we propose Dynamic Neural Koopman Distillation, a framework that distills multistep diffusion inference into a single forward pass while retaining the multimodal expressivity of the teacher model. Specifically, we introduce a Factorized Dynamic Koopman layer that models the denoising process through a factorized latent transition with state-dependent modal gains. We evaluate the proposed method on standard D4RL MuJoCo locomotion benchmarks and a physical Kinova manipulator, comparing against one-step baselines. The results show that our method significantly outperforms existing one-step distillation approaches on the reported locomotion tasks, and reduces the inference latency to the millisecond regime compared with the teacher policy. Hardware experiments further demonstrate that our method enables smooth and fast closed-loop execution while maintaining task success and comparable accuracy. A project page is available at this https URL.
| Comments: | 8 pages, 5 figures |
| Subjects: | Robotics (cs.RO) |
| Cite as: | arXiv:2605.24924 [cs.RO] |
| (or arXiv:2605.24924v1 [cs.RO] for this version) | |
| https://doi.org/10.48550/arXiv.2605.24924 arXiv-issued DOI via DataCite (pending registration) |
From: Lei Zheng [view email]
[v1]
Sun, 24 May 2026 08:03:49 UTC (6,483 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。