




















Abstract:Model Predictive Path Integral (MPPI) control is a widely used sampling-based method for trajectory optimization, yet its convergence properties remain only partially understood. This paper provides a direct convergence analysis using variational optimization. By lifting constrained trajectory optimization to a Kullback-Leibler (KL) regularized problem over decision distributions, we derive a reduced free-energy objective defined over a parametric sampling family. For general parametric families, we derive gradient and Hessian representations of this reduced objective and analyze preconditioned gradient descent on the sampling-distribution parameters. In the fixed-covariance Gaussian case, the classical MPPI update is recovered exactly as a unit-step preconditioned gradient update. We prove descent and stationarity guarantees for the exact expectation-based iteration when the Hessian of the reduced objective is bounded in the metric induced by the preconditioner. For the Gaussian family, we further show that the preconditioned Hessian is governed by the covariance of the Gibbs-tilted distribution relative to the covariance of the sampling distribution, yielding a covariance-dependent sufficient condition for the descent of exact unit-step MPPI. Numerical experiments illustrate the theory and the effect of key hyperparameters.
| Subjects: | Optimization and Control (math.OC); Systems and Control (eess.SY) |
| Cite as: | arXiv:2603.24489 [math.OC] |
| (or arXiv:2603.24489v2 [math.OC] for this version) | |
| https://doi.org/10.48550/arXiv.2603.24489 arXiv-issued DOI via DataCite |
From: Sina Sharifi [view email]
[v1]
Wed, 25 Mar 2026 16:30:05 UTC (1,454 KB)
[v2]
Fri, 22 May 2026 16:53:08 UTC (884 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。