Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Sep;111(6):1527-1538.
doi: 10.1111/tpj.15905. Epub 2022 Jul 27.

Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology

Affiliations
Free article
Review

Unsupervised and semi-supervised learning: the next frontier in machine learning for plant systems biology

Jun Yan et al. Plant J. 2022 Sep.
Free article

Abstract

Advances in high-throughput omics technologies are leading plant biology research into the era of big data. Machine learning (ML) performs an important role in plant systems biology because of its excellent performance and wide application in the analysis of big data. However, to achieve ideal performance, supervised ML algorithms require large numbers of labeled samples as training data. In some cases, it is impossible or prohibitively expensive to obtain enough labeled training data; here, the paradigms of unsupervised learning (UL) and semi-supervised learning (SSL) play an indispensable role. In this review, we first introduce the basic concepts of ML techniques, as well as some representative UL and SSL algorithms, including clustering, dimensionality reduction, self-supervised learning (self-SL), positive-unlabeled (PU) learning and transfer learning. We then review recent advances and applications of UL and SSL paradigms in both plant systems biology and plant phenotyping research. Finally, we discuss the limitations and highlight the significance and challenges of UL and SSL strategies in plant systems biology.

Keywords: deep learning; machine learning; plant systems biology; semi-supervised learning; unsupervised learning.

PubMed Disclaimer

Similar articles

Cited by

References

REFERENCES

    1. Abdalla, A., Cen, H., Wan, L., Rashid, R., Weng, H., Zhou, W. et al. (2019) Fine-tuning convolutional neural network with transfer learning for semantic segmentation of ground-level oilseed rape images in a field with high weed pressure. Computers and Electronics in Agriculture, 167, 105091.
    1. Amodio, M., van Dijk, D., Srinivasan, K., Chen, W.S., Mohsen, H., Moon, K.R. et al. (2019) Exploring single-cell data with deep multitasking neural networks. Nature Methods, 16, 1139-1145.
    1. Anowar, F., Sadaoui, S. & Selim, B. (2021) Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE). Computer Science Review, 40, 100378.
    1. Aromolaran, O., Aromolaran, D., Isewon, I. & Oyelade, J. (2021) Machine learning approach to gene essentiality prediction: a review. Briefings in Bioinformatics, 22. https://doi.org/10.1093/bib/bbab128
    1. Azodi, C.B., Tang, J. & Shiu, S.H. (2020) Opening the black box: interpretable machine learning for geneticists. Trends in Genetics, 36, 442-455.

Publication types

LinkOut - more resources