Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 17;15(1):2307.
doi: 10.1038/s41598-025-85930-2.

Exploration of linear and interpretable models for quantification of cell parameters via contactless short-wave infrared hyperspectral sensing

Affiliations

Exploration of linear and interpretable models for quantification of cell parameters via contactless short-wave infrared hyperspectral sensing

Anjana Hevaganinge et al. Sci Rep. .

Abstract

The development of optical sensors for label-free quantification of cell parameters has numerous uses in the biomedical arena. However, using current optical probes requires the laborious collection of sufficiently large datasets that can be used to calibrate optical probe signals to true metabolite concentrations. Further, most practitioners find it difficult to confidently adapt black box chemometric models that are difficult to troubleshoot in high-stakes applications such as biopharmaceutical manufacturing. Replacing optical probes with contactless short-wave infrared (SWIR) hyperspectral cameras allows efficient collection of thousands of absorption signals in a handful of images. This high repetition allows for effective denoising of each spectrum, so interpretable linear models can quantify metabolites. To illustrate, an interpretable linear model called L-SLR is trained using small datasets obtained with a SWIR HSI camera to quantify fructose, viable cell density (VCD), glucose, and lactate. The performance of this model is also compared to other existing linear models, namely Partial Least Squares (PLS) and Non-negative Matrix Factorization (NMF). Using only 50% of the dataset for training, reasonable test performance of mean absolute error (MAE) and correlations (r2) are achieved for glucose (r2 = 0.88, MAE = 37 mg/dL), lactate (r2 = 0.93, MAE = 15.08 mg/dL), and VCD (r2 = 0.81, MAE = 8.6 × 105 cells/mL). Further, these models are also able to handle quantification of a metabolite like fructose in the presence of high background concentration of similar metabolite with almost identical chemical interactions in water like glucose. The model achieves reasonable quantification performance for large fructose level (100-1000 mg/dL) quantification (r2 = 0.92, MAE = 25.1 mg/dL) and small fructose level (< 60 mg/dL) concentrations (r2 = 0.85, MAE = 4.97 mg/dL) in complex media like Fetal Bovine Serum (FBS). Finally, the model provides sparse interpretable weight matrices that hint at the underlying solution changes that correlate to each cell parameter prediction.

Keywords: Contactless bio-sensor; Machine learning; Near infrared (NIR); Short wave infrared (SWIR).

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
An overview of the contactless, label free rapid quantification system using a short-wave infrared (SWIR) hyperspectral camera. (A) Collection of dataset of hyperspectral images and corresponding labels. (B) Collection of hyperspectral images in SWIR range for (C) training an encoder and sparse linear regression model for metabolite prediction.
Fig. 2
Fig. 2
Architecture of L-SLR. First, absorbance signal is converted to a polynomial basis (A), then masked based on weights from pretrained sparse linear regression model (C). Next, masked signal is passed into pretrained LDA model (E). LDA forms compressed latent space which is used to generate metabolite predictions via linear regression (F). Coefficients from LDA and SLR are then used to form weight matrix visuals (G). Models used in (C) and (E) are pretrained using a labeled dataset consisting of integer labels computed via Hilbert curve (B) that represent unique metabolite combinations.
Fig. 3
Fig. 3
Correlation Visualization between ground truth fructose and fructose predicted by PLS-R (A, C) and L-SLR (B, D) for test data collected in dataset conditions 2 and 4 (A, B) and conditions 5 and 7 (C, D). r2 and MAE are listed in the top left corner of each graph.
Fig. 4
Fig. 4
Correlation Visualization between ground truth fructose and fructose predicted by PLS-R (A, C) and L-SLR (B, D) for test data collected in dataset conditions 9 and 10 (A, B) and condition 8 (C, D). r2 and MAE are listed in the top left corner of each graph.
Fig. 5
Fig. 5
Correlation visualization between ground truth VCD and VCD predicted by PLS-R (A) and L-SLR (B) respectively in spent cell media are shown for randomly selected test fold. r2 and MAE are listed in the top left corner of each graph.
Fig. 6
Fig. 6
Correlation visualization between ground truth glucose and glucose predicted by PLS-R (A) and L-SLR (B) respectively in spent cell media are shown for randomly selected test fold. r2 and MAE are listed in the top left corner of each graph.
Fig. 7
Fig. 7
Correlation visualization between ground truth lactate and lactate predicted by PLS-R (A) and L-SLR (B) respectively in spent cell media are shown for randomly selected test fold. r2 and MAE are listed in the top left corner of each graph.
Fig. 8
Fig. 8
Cumulative sum weight matrices produced by L-SLR (AD) and PLS-R (EH) for bands and cross terms across all models evaluated for fructose quantification via datasets 9–11 (A, E), dataset 8 (B, F), datasets 2–4 (C, G) and datasets 5–7 (D, H). All weights are normalized by the respective maximum of each cumulative matrix to facilitate comparison of weight matrices across models and datasets alike.
Fig. 9
Fig. 9
Cumulative sum weight matrices produced by L-SLR (AC) and PLS-R (DF) for bands and cross terms across all models evaluated for quantification of VCD (A, D), glucose (B, E), and lactate (C, F) in spent cell media. All weights are normalized by the respective maximum of each cumulative matrix to facilitate comparison of weight matrices across models and datasets alike.

References

    1. Costariol, E. et al. Demonstrating the manufacture of human CAR-T cells in an automated stirred-tank bioreactor. Biotechnol. J.15, 2000177 (2020). - PubMed
    1. Miller, M., Drucker, W. R., Owens, J. E., Craig, J. W. & Woodward, H. Metabolism of intravenous fructose and glucose in normal and diabetic subjects. J. Clin. Invest.31, 115–125 (1952). - PMC - PubMed
    1. Kruszynska, Y. T., Meyer-Alber, A., Wollen, N. & McIntyre, N. Energy expenditure and substrate metabolism after oral fructose in patients with cirrhosis. J. Hepatol.19, 241–251 (1993). - PubMed
    1. Schwarz, H., Mäkinen, M. E., Castan, A. & Chotteau, V. Monitoring of amino acids and antibody N-glycosylation in high cell density perfusion culture based on Raman spectroscopy. Biochem. Eng. J.182, 108426 (2022).
    1. Ishigaki, M. et al. Development of an amino acid sequence-dependent analytical method for peptides using near-infrared spectroscopy. Analyst147, 3634–3642 (2022). - PubMed

Publication types

LinkOut - more resources