
























Abstract:The training algorithms for AI systems all introduce far-from-equilibrium dynamical processes, and understanding the irreversibility of these algorithms is a fundamental step towards understanding the learning dynamics of modern AI systems. In this work, we establish a general framework for defining and analyzing the irreversibility of training algorithms. We show that four different ways to characterize the irreversibility of dynamical processes are equivalent to leading order in the step size $\eta$: numerical backward error $\phi_{\rm DE}$, time-renormalized correction $\phi_{\rm TR}$, microscopic time reversal asymmetry $\phi_{\rm TA}$, and the (regularized) stochastic-thermodynamic entropy production $\phi_{\rm ST}$. The irreversibility gives rise to a time-reversal-symmetry-breaking emergent force that generically breaks non-isometric continuous reparametrization symmetries, preserves orthogonal symmetries, and leads to a universal preference for those learning trajectories that minimize the entropy production rate.
| Comments: | preprint |
| Subjects: | Statistical Mechanics (cond-mat.stat-mech); Artificial Intelligence (cs.AI); Machine Learning (cs.LG) |
| Cite as: | arXiv:2605.21933 [cond-mat.stat-mech] |
| (or arXiv:2605.21933v1 [cond-mat.stat-mech] for this version) | |
| https://doi.org/10.48550/arXiv.2605.21933 arXiv-issued DOI via DataCite (pending registration) |
From: Liu Ziyin [view email]
[v1]
Thu, 21 May 2026 03:04:44 UTC (619 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。