
























Abstract:Parameter-efficient fine-tuning enables fast personalization of text-to-image diffusion models, but composing multiple custom concepts remains challenging due to representation interference. Existing modular methods either rely on expensive post-hoc fusion or freeze adaptation subspaces, which limit expressiveness and concept fidelity. To address this trade-off, we propose Sequential regularized LoRA (SeqLoRA), a constrained continual learning framework that jointly optimizes both LoRA factors via bilevel optimization. Theoretically, we establish strong convergence guarantees for our algorithm and model the residual layer activations as a matrix sub-Gaussian process to derive high-probability bounds on catastrophic forgetting. We further prove that learning the LoRA basis from data minimizes residual interference energy more effectively than frozen-basis methods. Experiments on multi-concept image generation demonstrate that SeqLoRA improves identity preservation and scalability across up to 101 concepts, while avoiding costly fusion and reducing attribute interference in composed generations.
| Subjects: | Machine Learning (cs.LG) |
| Cite as: | arXiv:2605.22743 [cs.LG] |
| (or arXiv:2605.22743v1 [cs.LG] for this version) | |
| https://doi.org/10.48550/arXiv.2605.22743 arXiv-issued DOI via DataCite (pending registration) |
From: Javad Parsa [view email]
[v1]
Thu, 21 May 2026 17:13:49 UTC (17,207 KB)
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。