Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 9;11(11):3005.
doi: 10.3390/biomedicines11113005.

Morphological Signal Processing for Phenotype Recognition of Human Pluripotent Stem Cells Using Machine Learning Methods

Affiliations

Morphological Signal Processing for Phenotype Recognition of Human Pluripotent Stem Cells Using Machine Learning Methods

Ekaterina Vedeneeva et al. Biomedicines. .

Abstract

Human pluripotent stem cells have the potential for unlimited proliferation and controlled differentiation into various somatic cells, making them a unique tool for regenerative and personalized medicine. Determining the best clone selection is a challenging problem in this field and requires new sensing instruments and methods able to automatically assess the state of a growing colony ('phenotype') and make decisions about its destiny. One possible solution for such label-free, non-invasive assessment is to make phase-contrast images and/or videos of growing stem cell colonies, process the morphological parameters ('morphological portrait', or signal), link this information to the colony phenotype, and initiate an automated protocol for the colony selection. As a step in implementing this strategy, we used machine learning methods to find an effective model for classifying the human pluripotent stem cell colonies of three lines according to their morphological phenotype ('good' or 'bad'), using morphological parameters from the previously published data as predictors. We found that the model using cellular morphological parameters as predictors and artificial neural networks as the classification method produced the best average accuracy of phenotype prediction (67%). When morphological parameters of colonies were used as predictors, logistic regression was the most effective classification method (75% average accuracy). Combining the morphological parameters of cells and colonies resulted in the most effective model, with a 99% average accuracy of phenotype prediction. Random forest was the most efficient classification method for the combined data. We applied feature selection methods and showed that different morphological parameters were important for phenotype recognition via either cellular or colonial parameters. Our results indicate a necessity for retaining both cellular and colonial morphological information for predicting the phenotype and provide an optimal choice for the machine learning method. The classification models reported in this study could be used as a basis for developing and/or improving automated solutions to control the quality of human pluripotent stem cells for medical purposes.

Keywords: best clone; human embryonic stem cells; human pluripotent stem cells; machine learning; morphological phenotype.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Figures

Figure 1
Figure 1
Schematic representation of a device designed for the automated best clone selection. Red arrows indicate the direction of information processing within the device.
Figure 2
Figure 2
Performance of classification models trained on the combined cellular and colonial data: (a) Box plots for the nested cross-validation accuracy. Orange lines show the median accuracy values, boxes represent the interval between the lower and upper quartiles, whiskers mark the minimum and maximum accuracy values, and circles are outliers; (b) ROC-curves. The dashed line represents a random classifier, which assigns phenotypes randomly. NB: naïve Bayes; KNN: k-nearest neighbors; LR: logistic regression; SVM: support vector machines; RF: random forest; NN: neural networks.
Figure 3
Figure 3
Mean SHAP values representing the importance of the morphological features in the best classification models based on (a) cell data, (b) colony data, and (c) combined data. Names of cellular parameters start with ‘Cell’, and names of colonial ones start with ‘Col’. A: Area; P: Perimeter; MA: Minor axis; FD: Feret’s diameter D; MFD: Minimal Feret’s diameter D; SF: Shape factor; AIS: Area of intercellular space (only for colonies).

Similar articles

Cited by

References

    1. Mousavikhamene Z., Sykora D.J., Mrksich M., Bagheri N. Morphological Features of Single Cells Enable Accurate Automated Classification of Cancer from Non-Cancer Cell Lines. Sci. Rep. 2021;11:24375. doi: 10.1038/s41598-021-03813-8. - DOI - PMC - PubMed
    1. Gosnell M.E., Anwer A.G., Mahbub S.B., Menon Perinchery S., Inglis D.W., Adhikary P.P., Jazayeri J.A., Cahill M.A., Saad S., Pollock C.A., et al. Quantitative Non-Invasive Cell Characterisation and Discrimination Based on Multispectral Autofluorescence Features. Sci. Rep. 2016;6:23453. doi: 10.1038/srep23453. - DOI - PMC - PubMed
    1. Basu S., Kolouri S., Rohde G.K. Detecting and Visualizing Cell Phenotype Differences from Microscopy Images Using Transport-Based Morphometry. Proc. Natl. Acad. Sci. USA. 2014;111:3448–3453. doi: 10.1073/pnas.1319779111. - DOI - PMC - PubMed
    1. Di Z., Klop M.J.D., Rogkoti V.-M., Le Dévédec S.E., van de Water B., Verbeek F.J., Price L.S., Meerman J.H.N. Ultra High Content Image Analysis and Phenotype Profiling of 3D Cultured Micro-Tissues. PLoS ONE. 2014;9:e109688. doi: 10.1371/journal.pone.0109688. - DOI - PMC - PubMed
    1. Stanley N., Stelzer I.A., Tsai A.S., Fallahzadeh R., Ganio E., Becker M., Phongpreecha T., Nassar H., Ghaemi S., Maric I., et al. VoPo Leverages Cellular Heterogeneity for Predictive Modeling of Single-Cell Data. Nat. Commun. 2020;11:3738. doi: 10.1038/s41467-020-17569-8. - DOI - PMC - PubMed