Machine Learning for Multimedia Communications
- PMID: 35161566
- PMCID: PMC8840624
- DOI: 10.3390/s22030819
Machine Learning for Multimedia Communications
Abstract
Machine learning is revolutionizing the way multimedia information is processed and transmitted to users. After intensive and powerful training, some impressive efficiency/accuracy improvements have been made all over the transmission pipeline. For example, the high model capacity of the learning-based architectures enables us to accurately model the image and video behavior such that tremendous compression gains can be achieved. Similarly, error concealment, streaming strategy or even user perception modeling have widely benefited from the recent learning-oriented developments. However, learning-based algorithms often imply drastic changes to the way data are represented or consumed, meaning that the overall pipeline can be affected even though a subpart of it is optimized. In this paper, we review the recent major advances that have been proposed all across the transmission chain, and we discuss their potential impact and the research challenges that they raise.
Keywords: QoE assessment; caching; channel coding; content consumption; error concealment; image coding; machine learning; multimedia communications; video coding; video streaming.
Conflict of interest statement
The authors declare no conflict of interest.
Figures








Similar articles
-
Joint source-channel coding for wireless object-based video communications utilizing data hiding.IEEE Trans Image Process. 2006 Aug;15(8):2158-69. doi: 10.1109/tip.2006.875194. IEEE Trans Image Process. 2006. PMID: 16900673
-
Error concealment for shape in MPEG-4 object-based video coding.IEEE Trans Image Process. 2005 Apr;14(4):389-96. doi: 10.1109/tip.2004.841197. IEEE Trans Image Process. 2005. PMID: 15825475
-
Joint source/channel coding for image transmission with JPEG2000 over memoryless channels.IEEE Trans Image Process. 2005 Aug;14(8):1020-32. doi: 10.1109/tip.2005.851681. IEEE Trans Image Process. 2005. PMID: 16121451
-
Video traffic characteristics of modern encoding standards: H.264/AVC with SVC and MVC extensions and H.265/HEVC.ScientificWorldJournal. 2014 Feb 20;2014:189481. doi: 10.1155/2014/189481. eCollection 2014. ScientificWorldJournal. 2014. PMID: 24701145 Free PMC article. Review.
-
Exploring the Role of 6G Technology in Enhancing Quality of Experience for m-Health Multimedia Applications: A Comprehensive Survey.Sensors (Basel). 2023 Jun 25;23(13):5882. doi: 10.3390/s23135882. Sensors (Basel). 2023. PMID: 37447735 Free PMC article. Review.
References
-
- Kountouris M., Pappas N. Semantics-Empowered Communication for Networked Intelligent Systems. IEEE Commun. Mag. 2021;59:96–102. doi: 10.1109/MCOM.001.2000604. - DOI
-
- AI, J. ISO/IEC JTC 1/SC29/WG1 N91014, REQ “JPEG AI Use Cases and Requirements”. 2021.
-
- MPEG Activity: Video Coding for Machines. [(accessed on 7 January 2021)]. Available online: https://mpeg.chiariglione.org/standards/exploration/video-coding-machines.
-
- Moving Picture, Audio and Data Coding by Artificial Intelligence. [(accessed on 7 January 2021)]. Available online: https://mpai.community/
-
- Hussain A.J., Al-Fayadh A., Radi N. Image compression techniques: A survey in lossless and lossy algorithms. Neurocomputing. 2018;300:44–69. doi: 10.1016/j.neucom.2018.02.094. - DOI
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources