Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan;30(3):7851-7873.
doi: 10.1007/s11356-022-22601-z. Epub 2022 Sep 1.

The development of dissolved oxygen forecast model using hybrid machine learning algorithm with hydro-meteorological variables

Affiliations

The development of dissolved oxygen forecast model using hybrid machine learning algorithm with hydro-meteorological variables

Abul Abrar Masrur Ahmed et al. Environ Sci Pollut Res Int. 2023 Jan.

Abstract

Dissolved oxygen (DO) forecasting is essential for aquatic managers responsible for maintaining ecosystem health and the management of water bodies affected by water quality parameters. This paper aims to forecast dissolved oxygen (DO) concentration using a multivariate adaptive regression spline (MARS) hybrid model coupled with maximum overlap discrete wavelet transformation (MODWT) as a feature decomposition approach for Surma River water using a set of water quality hydro-meteorological variables. The proposed hybrid model is compared with numerous machine learning methods, namely Bayesian ridge regression (BNR), k-nearest neighbourhood (KNN), kernel ridge regression (KRR), random forest (RF), and support vector regression (SVR). The investigational results show that the proposed model of MODWT-MARS has a better prediction than the comparing benchmark models and individual standalone counter parts. The result shows that the hybrid algorithms (i.e. MODWT-MARS) outperformed the other models (r = 0.981, WI = 0.990, RMAE = 2.47%, and MAE = 0.089). This hybrid method may serve to forecast water quality variables with fewer predictor variables.

Keywords: Bangladesh; Dissolved oxygen; Forecasting; Hybrid model; MARS; MODWT; Surma River.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The study region showing the Keane Bridge station of Surma River, Sylhet, Bangladesh
Fig. 2
Fig. 2
The study's workflow details the steps in the model designing phase and the proposed hybrid CEEMDAN-MARS predictive models. Note: IMF = Intrinsic Mode Function, CCF = Cross-Correlation Functions, PACF = partial autocorrelation function, CEEMDAN = complete ensemble empirical mode decomposition with adaptive noise and DO = Dissolved Oxygen (mg/l)
Fig. 3
Fig. 3
Time series of the a maximum overlap discrete wavelet coefficient (MODWC) of Dissolved Oxygen using MODWT, and intrinsic mode functions (IMFs) and the residual components after decomposing the DO in the training period using b CEEMDAN and c EEMD. The time series of the actual DO is plotted at the top of the figure
Fig. 4
Fig. 4
Partial autocorrelation function (PACF) plot of the DO time series exploring the antecedent behaviour in terms of the lag of daily DO. The blue line in the figures indicates the ± 95% confidence level
Fig. 5
Fig. 5
An analysis of the statistically significant cross-correlation function plots of a actual variables vs DO, b IMF1 of all variables vs IMF1 of DO, c IMF2 of all variables vs IMF2 of DO, d IMF3 of all variables vs IMF3 of DO, e IMF4 of all variables vs IMF4 of DO, f) residuals of all variables vs residuals of DO
Fig. 5
Fig. 5
An analysis of the statistically significant cross-correlation function plots of a actual variables vs DO, b IMF1 of all variables vs IMF1 of DO, c IMF2 of all variables vs IMF2 of DO, d IMF3 of all variables vs IMF3 of DO, e IMF4 of all variables vs IMF4 of DO, f) residuals of all variables vs residuals of DO
Fig. 6
Fig. 6
Box plots of hybrid models (MODWT-MARS) and their respective standalone counterparts (i.e. MARS, BNR, KRR, KNN, RNN, and SVR) in forecasting DO compare to the observed DO of Surma River
Fig. 7
Fig. 7
Empirical cumulative distribution function (CDF) of forecasted error |FE| of DO generated by the proposed MODWT-MARS and comparing models
Fig. 8
Fig. 8
Comparison of the forecasting skill of proposed models in terms of RRMSE (%) and MAPE (%) in the testing period
Fig. 9
Fig. 9
Tylor diagram representing correlation coefficient and the standard deviation difference for proposed hybrid models vs benchmark models
Fig. 10
Fig. 10
Scatter plot of forecasted vs observed DO, using a Bayesian ridge regression (BNR) and b multiple adaptive regression splines model using MODWT and CEEMDAN decomposition. A least square regression line and coefficient of determination (R2) with a linear fit equation are shown in each sub-panel
Fig. 11
Fig. 11
Comparison between forecasted DO and observed DO during model testing using MODWT-MARS and Standalone MARS model
Fig. 12
Fig. 12
Effect of a EMD, b EEMD, c CEEMDAN, d MODWT, and e DWT of the performance of six models based on r, LM, and APB

References

    1. Adamowski J, Fung Chan H, Prasher SO, Ozga-Zielinski B, Sliusarieva A (2012) Comparison of multiple linear and nonlinear regression, autoregressive integrated moving average, artificial neural network, and wavelet artificial neural network methods for urban water demand forecasting in Montreal, Canada. Water Resour Res 48(1)
    1. Agbinya JI (1996) Discrete wavelet transform techniques in speech processing, Proceedings of Digital Processing Applications (TENCON’96). IEEE 2:514–519
    1. Ahmed AAM. Prediction of dissolved oxygen in Surma River by biochemical oxygen demand and chemical oxygen demand using the artificial neural networks (ANNs) J King Saud Univ Eng Sci. 2017;29(2):151–158.
    1. Ahmed MH, Lin L-S. Dissolved oxygen concentration predictions for running waters with different land use land cover using a quantile regression forest machine learning technique. J Hydrol. 2021;597:126213.
    1. Ahmed AAM, Shah SMA. Application of adaptive neuro-fuzzy inference system (ANFIS) to estimate the biochemical oxygen demand (BOD) of Surma River. J King Saud Univ Eng Sci. 2017;29(3):237–243.

LinkOut - more resources