Unsupervised Image-to-Image Translation: A Review
- PMID: 36366238
- PMCID: PMC9654990
- DOI: 10.3390/s22218540
Unsupervised Image-to-Image Translation: A Review
Abstract
Supervised image-to-image translation has been proven to generate realistic images with sharp details and to have good quantitative performance. Such methods are trained on a paired dataset, where an image from the source domain already has a corresponding translated image in the target domain. However, this paired dataset requirement imposes a huge practical constraint, requires domain knowledge or is even impossible to obtain in certain cases. Due to these problems, unsupervised image-to-image translation has been proposed, which does not require domain expertise and can take advantage of a large unlabeled dataset. Although such models perform well, they are hard to train due to the major constraints induced in their loss functions, which make training unstable. Since CycleGAN has been released, numerous methods have been proposed which try to address various problems from different perspectives. In this review, we firstly describe the general image-to-image translation framework and discuss the datasets and metrics involved in the topic. Furthermore, we revise the current state-of-the-art with a classification of existing works. This part is followed by a small quantitative evaluation, for which results were taken from papers.
Keywords: computer vision; deep learning; generative adversarial networks; machine learning; review; unsupervised image-to-image translation.
Conflict of interest statement
The authors declare no conflict of interest.
Figures







Similar articles
-
C2 -GAN: Content-consistent generative adversarial networks for unsupervised domain adaptation in medical image segmentation.Med Phys. 2022 Oct;49(10):6491-6504. doi: 10.1002/mp.15944. Epub 2022 Aug 27. Med Phys. 2022. PMID: 35981348
-
S-CUDA: Self-cleansing unsupervised domain adaptation for medical image segmentation.Med Image Anal. 2021 Dec;74:102214. doi: 10.1016/j.media.2021.102214. Epub 2021 Aug 12. Med Image Anal. 2021. PMID: 34464837
-
A two-step method to improve image quality of CBCT with phantom-based supervised and patient-based unsupervised learning strategies.Phys Med Biol. 2022 Apr 11;67(8). doi: 10.1088/1361-6560/ac6289. Phys Med Biol. 2022. PMID: 35354124
-
Temporally consistent sequence-to-sequence translation of cataract surgeries.Int J Comput Assist Radiol Surg. 2023 Jul;18(7):1217-1224. doi: 10.1007/s11548-023-02925-y. Epub 2023 May 23. Int J Comput Assist Radiol Surg. 2023. PMID: 37219806 Free PMC article. Review.
-
A Review of Single-Source Deep Unsupervised Visual Domain Adaptation.IEEE Trans Neural Netw Learn Syst. 2022 Feb;33(2):473-493. doi: 10.1109/TNNLS.2020.3028503. Epub 2022 Feb 3. IEEE Trans Neural Netw Learn Syst. 2022. PMID: 33095718 Review.
Cited by
-
Virtual NBI image synthesis using stable diffusion for enhanced recognition of early gastric cancer: a technical validation study.Ann Med. 2025 Dec;57(1):2523565. doi: 10.1080/07853890.2025.2523565. Epub 2025 Jun 28. Ann Med. 2025. PMID: 40580098 Free PMC article.
-
Deep learning-based organ-wise dosimetry of 64Cu-DOTA-rituximab through only one scanning.Sci Rep. 2025 Feb 15;15(1):5627. doi: 10.1038/s41598-025-88498-z. Sci Rep. 2025. PMID: 39955298 Free PMC article.
-
Chronic Mild Sleep Restriction Does Not Lead to Marked Neuronal Alterations Compared With Maintained Adequate Sleep in Adults.J Nutr. 2024 Feb;154(2):446-454. doi: 10.1016/j.tjnut.2023.12.016. Epub 2023 Dec 16. J Nutr. 2024. PMID: 38104943 Free PMC article. Clinical Trial.
-
Enhancing Green Fraction Estimation in Rice and Wheat Crops: A Self-Supervised Deep Learning Semantic Segmentation Approach.Plant Phenomics. 2023 Jul 18;5:0064. doi: 10.34133/plantphenomics.0064. eCollection 2023. Plant Phenomics. 2023. PMID: 37469555 Free PMC article.
-
Diagnostic performance of deep-learning-based virtual chromoendoscopy in gastric neoplasms.Gastric Cancer. 2024 May;27(3):539-547. doi: 10.1007/s10120-024-01469-7. Epub 2024 Jan 19. Gastric Cancer. 2024. PMID: 38240891
References
-
- Isola P., Zhu J.Y., Zhou T., Efros A.A. Image-to-Image Translation with Conditional Adversarial Networks; Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Honolulu, HI, USA. 21–26 July 2017; pp. 5967–5976. - DOI
-
- Wang T.C., Liu M.Y., Zhu J.Y., Tao A., Kautz J., Catanzaro B. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs. [(accessed on 31 December 2018)]; Available online: http://xxx.lanl.gov/abs/1711.11585.
-
- Bousmalis K., Silberman N., Dohan D., Erhan D., Krishnan D. Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks; Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Honolulu, HI, USA. 21–26 July 2017; pp. 95–104. - DOI
-
- Liu M.Y., Breuel T., Kautz J. Unsupervised Image-to-Image Translation Networks. arXiv. 20171703.00848
-
- Taigman Y., Polyak A., Wolf L. Unsupervised Cross-Domain Image Generation. [(accessed on 31 December 2017)]; Available online: http://xxx.lanl.gov/abs/1611.02200[cs]
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources