Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb 19;15(1):6067.
doi: 10.1038/s41598-025-90892-6.

Using visible and NIR hyperspectral imaging and machine learning for nondestructive detection of nutrient contents in sorghum

Affiliations

Using visible and NIR hyperspectral imaging and machine learning for nondestructive detection of nutrient contents in sorghum

Kai Wu et al. Sci Rep. .

Abstract

Nondestructive, rapid, and accurate detection of nutritional compositions in sorghum is crucial for agricultural and food industries. In our study, the crude protein, tannin, and crude fat contents of sorghum variety samples were taken as the research object. The visible near-infrared (VIS-NIR) hyperspectral of sorghum were measured by the indoor mobile scanning platform. The nutritional components were determined using chemical methods to analyze the differences in nutritional composition among different varieties. After preprocessing the original spectral, the competitive adaptive reweighted sampling (CARS) and bootstrapping soft shrinkage (BOSS) algorithms were used to coarsely extract the key variables. Subsequently, the iteratively retains informative variables (IRIV) was employed to assess the importance of these key variables, resulting in explanatory wavelength sets for crude protein, tannin, and crude fat. Finally, the partial least squares (PLS), back propagation (BP) and extreme learning machine (ELM) were utilized to establish detection models. The results indicated that the optimal wavelength variable sets for crude protein, tannin, and crude fat contained 41, 38, and 22 wavelength variables, respectively. The CARS-IRIV-PLS, BOSS-IRIV-PLS and BOSS-IRIV-ELM were suitable for detecting crude protein, tannin and crude fat, respectively. Meanwhile, the Rp2, RMSEp and RPDp values of the model were 0.69, 0.80% and 1.80, 0.88, 0.22% and 2.84, 0.61, 0.32% and 1.61, respectively. These detection models can be used for the effective estimation of the nutritional compositions in sorghum with VIS-NIR spectral data, and can provide an important basis for the application of food nutrition assessment.

Keywords: Detection model; Hyperspectral; Machine learning; Nondestructive; Nutrient contents; Sorghum.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The chemical analysis results of crude protein, tannin, and crude fat content in 93 sorghum varieties. (a) The box plots and normal distribution of crude protein, tannin, and crude fat in 279 sorghum samples. (b) The differences in crude protein, tannin, and crude fat content among 93 varieties of sorghum grains.
Fig. 2
Fig. 2
The pretreatment of spectral data for sorghum samples; (a) Raw spectra. (b) Spectral pre-processing by SNV; (c) Spectral pre-processing by SNV and Detrending; (d) Spectral pre-processing by SNV, Detrending and MSC.
Fig. 3
Fig. 3
The change of RMSECV value with the increase of iterations during the key wavelength extraction process of crude protein, tannin, and crude fat using the CARS algorithm. (a) Crude protein; (b) Tannin; (c) Crude fat.
Fig. 4
Fig. 4
The change of RMSECV value with the increase of iterations during the key wavelength extraction process of crude protein, tannin, and crude fat using the BOSS algorithm. (a) Crude protein; (b) Tannin; (c) Crude fat.
Fig. 5
Fig. 5
Distribution of the strong information, weak information, no information and interference information wavelengths of key wavelengths of the crude protein, tannin and crude fat in grain spectra of sorghum. (a) Crude protein, (b) Tannin, (c) Crude fat.
Fig. 6
Fig. 6
Fitting results of the calibration set and prediction set for each index in the optimal simultaneous detection model. (a) Crude protein; (b) Tannin; (c) Crude fat.

Similar articles

References

    1. Khoddami, A. et al. Sorghum in foods: Functionality and potential in innovative products. Crit. Rev. Food Sci. Nutr.63 (9), 1170–1186. 10.1080/10408398.2021.1960793 (2023). - PubMed
    1. Carcedo, A. J. et al. Environment characterization in Sorghum (Sorghum bicolor L.) by modeling water-deficit and heat patterns in the Great Plains Region, United States. Fronit. Plant. Sci.13, 768610. 10.3389/fpls.2022.768610 (2022). - PMC - PubMed
    1. Bakari, H. et al. Sorghum (Sorghum bicolor L. Moench) and its main parts (by-products) as promising sustainable sources of value-added ingredients. Waste Biomass Valoriz.14 (4), 1023–1044. 10.1007/s12649-022-01992-7 (2023).
    1. Wang, H. et al. Regulation of density and fertilization on crude protein synthesis in forage maize in a semiarid rain-Fed area. Agriculture13 (3), 715. 10.3390/agriculture13030715 (2023).
    1. Zeng, X., Jiang, W., Du, Z. & Kokini, J. L. Encapsulation of tannins and tannin-rich plant extracts by complex coacervation to improve their physicochemical properties and biological activities: A review. Crit. Rev. Food Sci. Nutr.63, 3005–3018. 10.1080/10408398.2022.2075313 (2023). - PubMed

LinkOut - more resources