. 2023 Jun 20:17:1202143.

doi: 10.3389/fnins.2023.1202143. eCollection 2023.

Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction

Wanqing Bi¹, Jianan Xv¹, Mengdie Song¹, Xiaohan Hao^{1

2}, Dayong Gao³, Fulang Qi¹

Affiliations

¹ The Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, Anhui, China.
² Fuqing Medical Co., Ltd., Hefei, Anhui, China.
³ Department of Mechanical Engineering, University of Washington, Seattle, WA, United States.

PMID: 37409107
PMCID: PMC10318193
DOI: 10.3389/fnins.2023.1202143

Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction

Wanqing Bi et al. Front Neurosci. 2023.

. 2023 Jun 20:17:1202143.

doi: 10.3389/fnins.2023.1202143. eCollection 2023.

Authors

Wanqing Bi¹, Jianan Xv¹, Mengdie Song¹, Xiaohan Hao^{1

2}, Dayong Gao³, Fulang Qi¹

Affiliations

¹ The Centers for Biomedical Engineering, University of Science and Technology of China, Hefei, Anhui, China.
² Fuqing Medical Co., Ltd., Hefei, Anhui, China.
³ Department of Mechanical Engineering, University of Washington, Seattle, WA, United States.

PMID: 37409107
PMCID: PMC10318193
DOI: 10.3389/fnins.2023.1202143

Abstract

Introduction: Fine-tuning (FT) is a generally adopted transfer learning method for deep learning-based magnetic resonance imaging (MRI) reconstruction. In this approach, the reconstruction model is initialized with pre-trained weights derived from a source domain with ample data and subsequently updated with limited data from the target domain. However, the direct full-weight update strategy can pose the risk of "catastrophic forgetting" and overfitting, hindering its effectiveness. The goal of this study is to develop a zero-weight update transfer strategy to preserve pre-trained generic knowledge and reduce overfitting.

Methods: Based on the commonality between the source and target domains, we assume a linear transformation relationship of the optimal model weights from the source domain to the target domain. Accordingly, we propose a novel transfer strategy, linear fine-tuning (LFT), which introduces scaling and shifting (SS) factors into the pre-trained model. In contrast to FT, LFT only updates SS factors in the transfer phase, while the pre-trained weights remain fixed.

Results: To evaluate the proposed LFT, we designed three different transfer scenarios and conducted a comparative analysis of FT, LFT, and other methods at various sampling rates and data volumes. In the transfer scenario between different contrasts, LFT outperforms typical transfer strategies at various sampling rates and considerably reduces artifacts on reconstructed images. In transfer scenarios between different slice directions or anatomical structures, LFT surpasses the FT method, particularly when the target domain contains a decreasing number of training images, with a maximum improvement of up to 2.06 dB (5.89%) in peak signal-to-noise ratio.

Discussion: The LFT strategy shows great potential to address the issues of "catastrophic forgetting" and overfitting in transfer scenarios for MRI reconstruction, while reducing the reliance on the amount of data in the target domain. Linear fine-tuning is expected to shorten the development cycle of reconstruction models for adapting complicated clinical scenarios, thereby enhancing the clinical applicability of deep MRI reconstruction.

Keywords: deep learning; fine-tuning; magnetic resonance imaging reconstruction; transfer learning; transfer strategy.

PubMed Disclaimer

Conflict of interest statement

XH was employed by Fuqing Medical Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

**Figure 1**
Exemplary basic and advanced features. V_b and V_a represent the outputs of each kernel and each filter, respectively. V_a shows two types of advanced features including in-contour artifacts **(top)** and out-of-contour artifacts **(bottom)**.

**Figure 2**
Visualization of the difference between FT and LFT in the transfer phase. Taking a 3 × 3 filter as an example with four channels, the structure of *SSConv* is composed of SS block and regular convolution. The 3 × 3 and 1 × 1 squares represent the weight W and bias b of the kernel, respectively, and different colors in them symbolize different channels. **(A)** FT updates all the pre-trained W and b. **(B)** The propose LFT optimizes the scaling factor Φ_{S_W} and shifting factor Φ_{S_b} only.

**Figure 3**
Overview of the SSGAN architecture. The generator G of SSGAN is composed of two residual U-net with 2 encoder (pink box) and 2 decoder (blue box) blocks. The architecture of the discriminator D is the same as the encoding path of G. The inputs of G are the ZF image (i) and ZF image (k), which come from different collections.

**Figure 4**
Three transfer scenarios of the pre-trained SSGAN. The gray part shows SSGAN was pre-trained on the IXI dataset, then transferred to reconstruct images with different contrasts (blue part, scenario 1: T1-weighted to T2-weighted), slicing directions (purple part, scenario 2: axial planes to sagittal planes), and anatomical structures (green part, scenario 3: brain to knee).

**Figure 5**
Comparison of advanced features extracted by filters of the same channel in four cases: advanced features obtained by **(A)** feeding the source data into the PT model; **(B)** feeding the target data into the PT model; **(C)** feeding the target data into the FT model; **(D)** feeding the target data into the LFT model. Source data is from IXI dataset, while target data are from private sagittal brain dataset I, private axial brain dataset II, and FastMRI knee dataset.

**Figure 6**
Typical reconstructions for sagittal T2-weighted brains from private dataset I by different methods. The last three models trained with 160 images at 50% sampling rate. From left to right are the results of: ground truth, zero-filled model, pre-trained model, directly trained model, fine-tuning model, and linear fine-tuning model, as well as their 10× magnified error maps.

**Figure 7**
Typical reconstructions for axial T1-weighted brain images from private dataset II by different models (the last three networks trained with 200 images at 50% sampling rate). From left to right are the results of: ground truth, zero-filled model, pre-trained model, directly trained model, fine-tuning model, and linear fine-tuning model.

**Figure 8**
Typical reconstructions for knee images from FastMRI dataset by different networks (the last three networks trained with 200 images at 50% sampling rate). From left to right are the results of: ground truth, zero-filled model, pre-trained model, directly trained model, fine-tuning model, and linear fine-tuning model.

See this image and copyright information in PMC

References

1. Aletras A. H., Tilak G. S., Natanzon A., Hsu L.-Y., Gonzalez F. M., Hoyt R. F., Jr., et al. . (2006). Retrospective determination of the area at risk for reperfused acute myocardial infarction with t2-weighted cardiac magnetic resonance imaging: histopathological and displacement encoding with stimulated echoes (dense) functional validations. Circulation 113, 1865–1870. 10.1161/CIRCULATIONAHA.105.576025 - DOI - PubMed
1. Alzubaidi L., Al-Amidie M., Al-Asadi A., Humaidi A. J., Al-Shamma O., Fadhel M. A., et al. . (2021). Novel transfer learning approach for medical imaging with limited labeled data. Cancers 13, 1590. 10.3390/cancers13071590 - DOI - PMC - PubMed
1. Amiri M., Brooks R., Rivaz H. (2020). Fine-tuning U-Net for ultrasound image segmentation: different layers, different outcomes. IEEE Trans. Ultrasonics Ferroelectr. Frequency Control 67, 2510–2518. 10.1109/TUFFC.2020.3015081 - DOI - PubMed
1. Antun V., Renna F., Poon C., Adcock B., Hansen A. C. (2020). On instabilities of deep learning in image reconstruction and the potential costs of AI. Proc. Natl. Acad. Sci. U.S.A. 117, 30088–30095. 10.1073/pnas.1907377117 - DOI - PMC - PubMed
1. Arshad M., Qureshi M., Inam O., Omer H. (2021). Transfer learning in deep neural network based under-sampled MR image reconstruction. Magnet. Reson. Imaging 76, 96–107. 10.1016/j.mri.2020.09.018 - DOI - PubMed

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction

Affiliations

Linear fine-tuning: a linear transformation based transfer strategy for deep MRI reconstruction

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

LinkOut - more resources

Full Text Sources

Miscellaneous

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

LinkOut - more resources

Full Text Sources

Miscellaneous