Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
- PMID: 35821601
- DOI: 10.1111/tpj.15905
Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology
Abstract
Advances in high-throughput omics technologies are leading plant biology research into the era of big data. Machine learning (ML) performs an important role in plant systems biology because of its excellent performance and wide application in the analysis of big data. However, to achieve ideal performance, supervised ML algorithms require large numbers of labeled samples as training data. In some cases, it is impossible or prohibitively expensive to obtain enough labeled training data; here, the paradigms of unsupervised learning (UL) and semi-supervised learning (SSL) play an indispensable role. In this review, we first introduce the basic concepts of ML techniques, as well as some representative UL and SSL algorithms, including clustering, dimensionality reduction, self-supervised learning (self-SL), positive-unlabeled (PU) learning and transfer learning. We then review recent advances and applications of UL and SSL paradigms in both plant systems biology and plant phenotyping research. Finally, we discuss the limitations and highlight the significance and challenges of UL and SSL strategies in plant systems biology.
Keywords: deep learning; machine learning; plant systems biology; semi-supervised learning; unsupervised learning.
© 2022 Society for Experimental Biology and John Wiley & Sons Ltd.
References
REFERENCES
-
- Abdalla, A., Cen, H., Wan, L., Rashid, R., Weng, H., Zhou, W. et al. (2019) Fine-tuning convolutional neural network with transfer learning for semantic segmentation of ground-level oilseed rape images in a field with high weed pressure. Computers and Electronics in Agriculture, 167, 105091.
-
- Amodio, M., van Dijk, D., Srinivasan, K., Chen, W.S., Mohsen, H., Moon, K.R. et al. (2019) Exploring single-cell data with deep multitasking neural networks. Nature Methods, 16, 1139-1145.
-
- Anowar, F., Sadaoui, S. & Selim, B. (2021) Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE). Computer Science Review, 40, 100378.
-
- Aromolaran, O., Aromolaran, D., Isewon, I. & Oyelade, J. (2021) Machine learning approach to gene essentiality prediction: a review. Briefings in Bioinformatics, 22. https://doi.org/10.1093/bib/bbab128
-
- Azodi, C.B., Tang, J. & Shiu, S.H. (2020) Opening the black box: interpretable machine learning for geneticists. Trends in Genetics, 36, 442-455.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
