DTL: Parameter- and Memory-Efficient Disentangled Vision Learning

Minghao Fu, Ke Zhu, Zonghao Ding, Jianxin Wu

PMID: 41032539
DOI: 10.1109/TPAMI.2025.3616318

DTL: Parameter- and Memory-Efficient Disentangled Vision Learning

Minghao Fu et al. IEEE Trans Pattern Anal Mach Intell. 2025.

. 2025 Oct 1:PP.

doi: 10.1109/TPAMI.2025.3616318. Online ahead of print.

Authors

Minghao Fu, Ke Zhu, Zonghao Ding, Jianxin Wu

PMID: 41032539
DOI: 10.1109/TPAMI.2025.3616318

Abstract

The cost of finetuning a pretrained model on downstream tasks steadily increases as they grow larger. Parameter-efficient transfer learning (PETL) is proposed to reduce this cost by changing only a tiny subset of trainable parameters. But, the GPU memory footprint during training is not effectively reduced in PETL. This issue happens because trainable parameters from these methods are generally tightly entangled with the backbone, such that a lot of intermediate states have to be stored for back propagation. To alleviate this issue, we introduce Disentangled Transfer Learning (DTL), which disentangles the trainable parameters from the backbone using a lightweight Compact Side Network (CSN). By progressively extracting task-specific information with a few low-rank linear mappings and appropriately adding the information back to the backbone, CSN effectively realizes knowledge transfer in various downstream recognition tasks. We further extend DTL to more difficult tasks such as object detection and semantic segmentation by employing a more sparse architectural design. Extensive experiments validate the effectiveness of DTL, which not only reduces a large amount of GPU memory usage and trainable parameters, but also outperforms existing PETL methods by a significant margin in accuracy.

PubMed Disclaimer

LinkOut - more resources

Full Text Sources
- IEEE Computer Society
- IEEE Engineering in Medicine and Biology Society

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

DTL: Parameter- and Memory-Efficient Disentangled Vision Learning

DTL: Parameter- and Memory-Efficient Disentangled Vision Learning

Authors

Abstract

LinkOut - more resources

Full Text Sources