Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
- PMID: 35821601
- DOI: 10.1111/tpj.15905
Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
Abstract
Advances in high-throughput omics technologies are leading plant biology research into the era of big data. Machine learning (ML) performs an important role in plant systems biology because of its excellent performance and wide application in the analysis of big data. However, to achieve ideal performance, supervised ML algorithms require large numbers of labeled samples as training data. In some cases, it is impossible or prohibitively expensive to obtain enough labeled training data; here, the paradigms of unsupervised learning (UL) and semi-supervised learning (SSL) play an indispensable role. In this review, we first introduce the basic concepts of ML techniques, as well as some representative UL and SSL algorithms, including clustering, dimensionality reduction, self-supervised learning (self-SL), positive-unlabeled (PU) learning and transfer learning. We then review recent advances and applications of UL and SSL paradigms in both plant systems biology and plant phenotyping research. Finally, we discuss the limitations and highlight the significance and challenges of UL and SSL strategies in plant systems biology.
Keywords: deep learning; machine learning; plant systems biology; semi-supervised learning; unsupervised learning.
© 2022 Society for Experimental Biology and John Wiley & Sons Ltd.
Similar articles
-
Semi-Supervised and Unsupervised Deep Visual Learning: A Survey.IEEE Trans Pattern Anal Mach Intell. 2024 Mar;46(3):1327-1347. doi: 10.1109/TPAMI.2022.3201576. Epub 2024 Feb 6. IEEE Trans Pattern Anal Mach Intell. 2024. PMID: 36006881
-
Small Data Challenges in Big Data Era: A Survey of Recent Progress on Unsupervised and Semi-Supervised Methods.IEEE Trans Pattern Anal Mach Intell. 2022 Apr;44(4):2168-2187. doi: 10.1109/TPAMI.2020.3031898. Epub 2022 Mar 4. IEEE Trans Pattern Anal Mach Intell. 2022. PMID: 33074801
-
ℓ1-norm based safe semi-supervised learning.Math Biosci Eng. 2021 Sep 7;18(6):7727-7742. doi: 10.3934/mbe.2021383. Math Biosci Eng. 2021. PMID: 34814272
-
The Utility of Unsupervised Machine Learning in Anatomic Pathology.Am J Clin Pathol. 2022 Jan 6;157(1):5-14. doi: 10.1093/ajcp/aqab085. Am J Clin Pathol. 2022. PMID: 34302331 Review.
-
Machine learning: its challenges and opportunities in plant system biology.Appl Microbiol Biotechnol. 2022 May;106(9-10):3507-3530. doi: 10.1007/s00253-022-11963-6. Epub 2022 May 16. Appl Microbiol Biotechnol. 2022. PMID: 35575915 Review.
Cited by
-
PhytoCluster: a generative deep learning model for clustering plant single-cell RNA-seq data.aBIOTECH. 2025 Feb 20;6(2):189-201. doi: 10.1007/s42994-025-00196-6. eCollection 2025 Jun. aBIOTECH. 2025. PMID: 40641652 Free PMC article.
-
A classification method of stress in plants using unsupervised learning algorithm and chlorophyll fluorescence technology.Front Plant Sci. 2023 Oct 23;14:1202092. doi: 10.3389/fpls.2023.1202092. eCollection 2023. Front Plant Sci. 2023. PMID: 37936937 Free PMC article.
-
Enhancing ransomware defense: deep learning-based detection and family-wise classification of evolving threats.PeerJ Comput Sci. 2024 Nov 29;10:e2546. doi: 10.7717/peerj-cs.2546. eCollection 2024. PeerJ Comput Sci. 2024. PMID: 39678277 Free PMC article.
-
Machine Learning for AI Breeding in Plants.Genomics Proteomics Bioinformatics. 2024 Sep 13;22(4):qzae051. doi: 10.1093/gpbjnl/qzae051. Genomics Proteomics Bioinformatics. 2024. PMID: 38954837 Free PMC article. No abstract available.
-
Artificial Intelligence in Evaluation of Permanent Impairment: New Operational Frontiers.Healthcare (Basel). 2023 Jul 8;11(14):1979. doi: 10.3390/healthcare11141979. Healthcare (Basel). 2023. PMID: 37510420 Free PMC article.
References
REFERENCES
-
- Abdalla, A., Cen, H., Wan, L., Rashid, R., Weng, H., Zhou, W. et al. (2019) Fine-tuning convolutional neural network with transfer learning for semantic segmentation of ground-level oilseed rape images in a field with high weed pressure. Computers and Electronics in Agriculture, 167, 105091.
-
- Amodio, M., van Dijk, D., Srinivasan, K., Chen, W.S., Mohsen, H., Moon, K.R. et al. (2019) Exploring single-cell data with deep multitasking neural networks. Nature Methods, 16, 1139-1145.
-
- Anowar, F., Sadaoui, S. & Selim, B. (2021) Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE). Computer Science Review, 40, 100378.
-
- Aromolaran, O., Aromolaran, D., Isewon, I. & Oyelade, J. (2021) Machine learning approach to gene essentiality prediction: a review. Briefings in Bioinformatics, 22. https://doi.org/10.1093/bib/bbab128
-
- Azodi, C.B., Tang, J. & Shiu, S.H. (2020) Opening the black box: interpretable machine learning for geneticists. Trends in Genetics, 36, 442-455.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources