Deltatensors – store model fine-tunes as compressed weight deltas
AaravGaur
·
2026-06-24
·
via HN's home page
 | |
Near-lossless delta compression for fine-tuned neural network models. Compressed a Qwen-2.5-0.5b wikitext fine-tune by 3.2x. Instead of storing 50 fine-tunes of the same base model, store one base and 50 small .wdelta delta files. deltatensors compresses the delta between a base and fine-tuned model, and reconstructs with sub-1% perplexity difference. docs: https://deltatensors.readthedocs.io/en/latest/ |
此内容由惯性聚合(RSS阅读器)自动聚合整理,仅供阅读参考。 原文来自 — 版权归原作者所有。