Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr;25(12):12139-12149.
doi: 10.1007/s11356-018-1438-z. Epub 2018 Feb 17.

Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia

Affiliations

Analysing the accuracy of machine learning techniques to develop an integrated influent time series model: case study of a sewage treatment plant, Malaysia

Mozafar Ansari et al. Environ Sci Pollut Res Int. 2018 Apr.

Abstract

The function of a sewage treatment plant is to treat the sewage to acceptable standards before being discharged into the receiving waters. To design and operate such plants, it is necessary to measure and predict the influent flow rate. In this research, the influent flow rate of a sewage treatment plant (STP) was modelled and predicted by autoregressive integrated moving average (ARIMA), nonlinear autoregressive network (NAR) and support vector machine (SVM) regression time series algorithms. To evaluate the models' accuracy, the root mean square error (RMSE) and coefficient of determination (R2) were calculated as initial assessment measures, while relative error (RE), peak flow criterion (PFC) and low flow criterion (LFC) were calculated as final evaluation measures to demonstrate the detailed accuracy of the selected models. An integrated model was developed based on the individual models' prediction ability for low, average and peak flow. An initial assessment of the results showed that the ARIMA model was the least accurate and the NAR model was the most accurate. The RE results also prove that the SVM model's frequency of errors above 10% or below - 10% was greater than the NAR model's. The influent was also forecasted up to 44 weeks ahead by both models. The graphical results indicate that the NAR model made better predictions than the SVM model. The final evaluation of NAR and SVM demonstrated that SVM made better predictions at peak flow and NAR fit well for low and average inflow ranges. The integrated model developed includes the NAR model for low and average influent and the SVM model for peak inflow.

Keywords: ARIMA; Influent; Integrated SVM-NAR model; Recurrent neural network; Support vector machine; Time series model.

PubMed Disclaimer

References

    1. BMC Med Inform Decis Mak. 2013 May 02;13:56 - PubMed
    1. Environ Sci Pollut Res Int. 2013 Mar;20(3):1858-69 - PubMed
    1. PLoS One. 2016 Jun 03;11(6):e0156768 - PubMed
    1. Environ Sci Pollut Res Int. 2016 Jun;23 (11):10785-98 - PubMed
    1. Water Res. 2016 Jul 1;98 :376-83 - PubMed

LinkOut - more resources