Dataset of near-infrared (NIR) spectral data for prediction of organic matter and total carbon in agricultural soil using homemade NIR spectrometer
- PMID: 40677255
- PMCID: PMC12269513
- DOI: 10.1016/j.dib.2025.111840
Dataset of near-infrared (NIR) spectral data for prediction of organic matter and total carbon in agricultural soil using homemade NIR spectrometer
Abstract
The paper presents the spectroscopic data obtained from a homemade NIR spectrometer developed for agricultural quality analysis, along with the calibration and validation of a model database for predicting agricultural soil properties. We collected NIR spectral data from 190 soil samples taken at a depth of 0-20 cm from agricultural areas in northern Thailand, including vegetable farms, orchards, and field crops. The acquisition process started by air-drying the soil and sieving it through 2.0 mm and 0.5 mm mesh. Six preprocessing techniques, including Savitzky-Golay smoothing, multiplicative scatter correction (MSC), standard normal variate (SNV), first derivative, second derivative, and mean centering, were used with partial least squares (PLS) regression to create the prediction model for soil organic matter and total carbon. Seventy percent of the sample was divided into calibration and the remaining thirty percent was validation. The most suitable model for assessing soil organic matter (SOM) and total carbon is Savitzky-Golay smoothing through the PLSR model, with a coefficient of determination (R2) of 0.79 and 0.78, a root mean square error (RMSE) of 0.701% and 0.382% for validation samples, respectively. Thus, the NIR dataset spanning 900-1,700 nm proved to be an ideal wavelength range for developing a portable/handheld NIR spectrometer, with potential for further accuracy improvements through model refinement.
Keywords: Chemometric; Model development; Pre-processing technique; Soil fertility; Soil spectroscopy.
© 2025 The Author(s).
Figures




Similar articles
-
Rapid Determination of Polysaccharides in Cistanche Tubulosa Using Near-Infrared Spectroscopy Combined with Machine Learning.J AOAC Int. 2023 Jul 17;106(4):1118-1125. doi: 10.1093/jaoacint/qsac144. J AOAC Int. 2023. PMID: 36355447
-
PCA- and PLSR-Based Machine Learning Model for Prediction of Urea-N Content in Heterogeneous Soils Using Near-Infrared Spectroscopy.Sensors (Basel). 2025 Jul 4;25(13):4176. doi: 10.3390/s25134176. Sensors (Basel). 2025. PMID: 40648429 Free PMC article.
-
Building a spectral soil library in a soil routine analysis laboratory to determine soil organic carbon using compact near-infrared spectrophotometers: Performance of global and local models.Spectrochim Acta A Mol Biomol Spectrosc. 2026 Jan 5;344(Pt 1):126707. doi: 10.1016/j.saa.2025.126707. Epub 2025 Jul 16. Spectrochim Acta A Mol Biomol Spectrosc. 2026. PMID: 40695064
-
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340. Health Technol Assess. 2006. PMID: 16959170
-
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780. Cochrane Database Syst Rev. 2024. PMID: 39679851 Free PMC article.
References
-
- Padhiary M., Saha D., Kuma R., Sethi L.N., Kumar A. Enhancing precision agriculture: a comprehensive review of machine learning and AI vision applications in all-terrain vehicle for farm automation. Smart Agric. Technol. 2024;8 doi: 10.1016/j.atech.2024.100483. - DOI
-
- Soriano-Disla J.M., Janik L.J., Viscarra Rossel R.A., Macdonald L.M., McLaughlin M.J. The performance of visible, near-, and mid-infrared reflectance spectroscopy for prediction of soil physical, chemical, and biological properties. Appl. Spectrosc. Rev. 2014;49(2):139–186. doi: 10.1080/05704928.2013.811081. - DOI
-
- Rossel R.A.V., Walvoort D.J.J., McBratney A.B., Janik L.J., Skjemstad J.O. Visible, near infrared, mid infrared or combined diffuse reflectance spectroscopy for simultaneous assessment of various soil properties. Geoderma. 2006;131:59–75. doi: 10.1016/j.geoderma.2005.03.007. - DOI
LinkOut - more resources
Full Text Sources
Miscellaneous