Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 May 21:51:414-430.
doi: 10.1016/j.bioactmat.2025.03.023. eCollection 2025 Sep.

Quantum machine learning-based electrokinetic mining for the identification of nanoparticles and exosomes with minimal training data

Affiliations

Quantum machine learning-based electrokinetic mining for the identification of nanoparticles and exosomes with minimal training data

Abhimanyu Thakur et al. Bioact Mater. .

Abstract

Synthetic and naturally occurring particles, such as nanoparticles (NPs) and exosomes; a type of extracellular vesicles (EVs), have garnered widespread attention across various fields, including biomaterials, oncology, and delivery systems for drugs and vaccines. Traditional methods for identifying NPs and EVs, such as transmission electron microscopy, are often prohibitively expensive and labor-intensive. As an alternative, the assessment of electrokinetic attributes such as zeta potential or electrophoretic mobility, conductance, and mean count rate, offers a more cost-effective, rapid, and reliable means of characterizing these particles. In this context, we introduce the first application of a quantum machine learning (QML)-based electrokinetic mining for the identification of green-synthesized iron- and cobalt-based NPs, as well as exosomes derived from human embryonic stem cells (hESC), human lung cancer (A549) cells, and colorectal cancer (CRC) cells, based solely on their electrokinetic attributes. Comparative analyses involving cross-validation, train-test splits, confusion matrices, and Receiver Operating Characteristic (ROC) curves revealed that classical ML techniques could accurately identify the types of NPs and EVs. Notably, QML demonstrated proficiency in differentiating between various NPs and EVs, including the distinction of EVs in the plasma of CRC patients versus those of healthy individuals. Furthermore, QML's application has been extended to the identification of NPs along with EVs in the plasma of CRC patients and experimental mice, achieving higher prediction performance even with a minimal training dataset, demonstrating that QML based electrokinetic mining could identify NPs or EVs with minimal training data, thereby facilitating novel clinical development in the realm of liquid biopsies.

Keywords: Electrokinetic analysis; Extracellular vesicle; Liquid biopsy; Machine learning; Nanoparticle; Quantum machine learning.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Image 1
Graphical abstract
Fig. 1
Fig. 1
Characterization and Electrokinetic analysis of iron- and cobalt-nanoparticles. (A, B) Representative transmission electron microscopy (TEM) image of (A) iron NPs, and (B) cobalt NPs. Scale bar: 20 nm. Both NP populations are mostly spherical in shape. (CD) Representative graphs showing distribution of apparent ZP vs total count of (C) iron NPs, and (D) cobalt NPs for six samples of NPs. Each color represents a sample of NPs. (EG) Bar graphs showing the mean - (E) zeta potential (ZP), (F) electrophoretic mobility (Mob), and (G) conductivity for iron (black) and cobalt (gray) NPs. Error bars display the standard error of the mean (N = 3). (H, I) Time-dependent phase of (H) iron NPs, and (I) cobalt NPs. Student's t-test was applied for comparison: iron NPs vs. cobalt NPs; Statistical significance was set at ∗P < 0.05, ∗∗P < 0.01.
Fig. 2
Fig. 2
Visualization of the data distribution of the raw datasets using scatter plot analysis. (AD) Pairwise comparisons of electrokinetic features, namely ZP, Mob, Cond, and MCR-for iron (blue) and cobalt (red) NPs (N = 1016). (EH) Pairwise comparisons of electrokinetic features for A549- (blue) and hESC- (red) EVs (N = 1033).
Fig. 3
Fig. 3
Performance evaluation of classical ML models to predict the types of NPs and EVs via confusion matrix and ROC curve. (AH) Confusion matrices, binary classification metrics (precision, recall, F1 score), ROC, and three-dimensional scatter plots for each ML model trained for the identification of NPs. (IP) Confusion matrices, binary classification metrics (precision, recall, F1 score), ROC, and three-dimensional scatter plots for each ML model trained for the identification of EVs. (A, E, I, M) The confusion matrix is a table that summarizes the performance of a classification model. The rows of the confusion matrix represent the true NP classes, while the columns represent the predicted NP classes by the ML models. (B, F, J, N) Precision measures the proportion of correctly predicted positive instances out of all predicted positive instances, while recall measures the proportion of correctly predicted positive instances out of all actual positive instances. The F1 score is the harmonic mean of precision and recall, and it is a measure of a model's overall performance. (C, G, K, O) The AUC-ROC is a measure of the overall performance of the model. A perfect model would have an AUC of 1, while a random model would have an AUC of 0.5. (D, H, L, P) Three-dimension scatter plots showing the distribution of predicted data via LR, and XGB on the held out 30 % test set. (Q, R) SHAP value analysis showing the contribution of major attributes for LR-based identification of (Q) NPs and (R) EVs. For the degree of uncertainty in sample was represented in (C,G,K,O) ROC plots via confidence interval (CI) with probability limit of 95 %.
Fig. 4
Fig. 4
Performance evaluation of quantum ML model and comparative analysis of its prediction performance with classical ML models with extremely small training set for the identification of NPs. Representative (AC) confusion matrix, (DF) ROC curves with confidence interval, and (GI) three-dimensional scatter plots for QML with (A, D, G) 70 % training set, (B, E, H) 50 % of initial training set, and (C, F, I) 10 % of initial training set. The test accuracies were evaluated on the held out 30 % test set. The confusion matrix shows the number of correctly and incorrectly classified NPs for each type of NP. The ROC curve shows the trade-off between the true positive rate (TPR) and the false positive rate (FPR) for different thresholds. As can be seen from the figures, the VQC model can achieve high prediction performance in identifying the types of NPs, even with a smaller training set. (JP) Representative three-dimensional scatter plots showing the NP prediction ability via (J–N) classical ML models: LR, SVM, KNN, DT, and XGB, and (O) VQC, a QML with 0.5 % training set. (P) Bar graph showing that QML outperformed the classical ML models for the identification of NPs, even when trained with 0.5 % training set. For precision estimation, the degree of uncertainty in sample was represented in (D–F) ROC plots via confidence interval (CI) with probability limit of 95 %.
Fig. 5
Fig. 5
QML based electrokinetic mining could detect the NPs in mouse plasma, and plasma from healthy individuals and cancer patients. (A) A schematic diagram showing the experimental plan for the collection of plasma from control- and NPs injected-mice, followed by recording of electrokinetic data, and the ML based training and prediction modules. (BE) ROC and 3D scatter plots, and (F) the quantitative bar graph, showing the prediction performance to identify NPs in the plasma of mice using QML trained with 70 %, 50 %, 10 %, or 1 % data. (G) A schematic diagram showing the experimental plan for the collection of plasma from healthy individuals and CRC patients, and separately spiking NPs in a separate group of plasma from healthy individuals and CRC patients, followed by measurements of electrokinetic data, and the ML based training and prediction modules. (HK) ROC and 3D scatter plots showing the QML based identification of NPs in plasma of (H) healthy and (J) cancer patients, and the corresponding bar graphs, showing the prediction performance of QML trained with 70 %, 50 %, 10 %, and 5 % data, to identify NPs in the plasma of healthy and cancer patients. (F, I, K) Bar graphs showed that QML could maintain high prediction performance, even with massive reductions in training set size. For precision estimation, the degree of uncertainty in sample was represented in ROC plots via CI (showed with confidence shape) with probability limit of 95 %.
Fig. 6
Fig. 6
QML based electrokinetic mining with fewer observations identifies EVs from the plasma of healthy and cancer patients, and EVs mixed with NPs. (A) A schematic diagram showing the experimental plan for the collection of plasma from healthy and cancer patients, followed by the isolation of EVs and their spiking with NPs. Then, electrokinetic recording was conducted on the plasma derived EVs without or with NPs. (BD) ROC curves and confusion matrices showing the classification of (B) healthy EVs vs cancer EVs, (C) healthy EVs vs healthy EVs spiked with NPs, and (D) cancer EVs vs cancer EVs spiked with NPs by QML trained with 70 % and 5 % data. (E) Quantitative bar graphs showing the prediction performance by QML at training set of 70 % and 5 %, inferring that QML could sustain the percentage of prediction performance, even with massive decrease in training set. For precision estimation, the degree of uncertainty in sample was represented in ROC plots via CI (showed with confidence shape) with probability limit of 95 %. In Fig. 6E, the statistical significance was determined via multiple t-tests, followed by the Holm-Sidak method, with P value = 0.05.

Similar articles

References

    1. Patra J.K., Das G., Fraceto L.F., Campos E.V.R., Rodriguez-Torres M. del P., Acosta-Torres L.S., Diaz-Torres L.A., Grillo R., Swamy M.K., Sharma S., Habtemariam S., Shin H.-S. Nano based drug delivery systems: recent developments and future prospects. J. Nanobiotechnol. 2018;16:71. doi: 10.1186/s12951-018-0392-8. - DOI - PMC - PubMed
    1. Fathi-Achachelouei M., Knopf-Marques H., Ribeiro da Silva C.E., Barthès J., Bat E., Tezcaner A., Vrana N.E. Use of nanoparticles in tissue engineering and regenerative medicine. Front. Bioeng. Biotechnol. 2019;7 doi: 10.3389/fbioe.2019.00113. - DOI - PMC - PubMed
    1. Malekzad H., Sahandi Zangabad P., Mirshekari H., Karimi M., Hamblin M.R. Noble metal nanoparticles in biosensors: recent studies and applications. Nanotechnol. Rev. 2017;6 doi: 10.1515/ntrev-2016-0014. - DOI - PMC - PubMed
    1. Khan I., Saeed K., Khan I. Nanoparticles: properties, applications and toxicities. Arab. J. Chem. 2017 doi: 10.1016/j.arabjc.2017.05.011. - DOI
    1. Thakur A., Ke X., Chen Y.-W., Motallebnejad P., Zhang K., Lian Q., Chen H.J. The mini player with diverse functions: extracellular vesicles in cell biology, disease, and therapeutics. Protein Cell. 2022;13:631–654. doi: 10.1007/s13238-021-00863-6. - DOI - PMC - PubMed

LinkOut - more resources