Application of Stacking-Based Ensemble Learning Model for Water Quality Prediction
Asian Research Journal of Mathematics,
Water is the source of life, and the growth of animals and plants cannot leave the water source. The quality of water will directly affect the life and health of humans, animals and plants. In order to predict the concentration and changing trend of various pollutants in water bodies and promote the comprehensive management of water resources, this paper proposes a new integrated model based on the idea of Stacking integrated learning. The model is based on XGBoost, support vector regression, and multi-layer perceptron. The model is constructed with ridge regression as the meta-model. The model was applied to the pH and total nitrogen content of water quality, and the mean absolute percentage error was used to quantitatively evaluate the prediction results, and the results of the ensemble learning model were compared with the prediction results of a single base model. The results show that the stacking ensemble learning idea can effectively improve the prediction ability and generalization performance of the base model. The proposed ensemble learning model has very good prediction ability and generalization ability, and has great potential in other prediction fields such as water quality.
- Ensemble learning
- water quality prediction
- multi-step prediction
- machine learning
How to Cite
Hounslow, Arthur W. Water quality data: analysis and interpretation. CRC Press; 2018.
Lu Hongfang, Xin Ma. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere. 2020;249:126169.
Panaskar DB, et al. Evaluating groundwater suitability for the domestic, irrigation, and industrial purposes in Nanded Tehsil, Maharashtra, India, using GIS and statistics. Arabian Journal of Geosciences. 2016;9(13):1-16.
Kulisz Monika, et al. Forecasting water quality index in groundwater using artificial neural network. Energies. 2021;14(18):5875.
Barzegar, Rahim, et al. Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model. Stochastic Environmental Research and Risk Assessment. 2018;32(3):799-813.
Khullar, Sakshi, Nanhey Singh. Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation. Environmental Science and Pollution Research. 2021;1-15.
Abobakr Yahya, Abobakr Saeed, et al. Water quality prediction model based support vector machine model for ungauged river catchment under dual scenarios. Water. 2019;11(6):1231.
Barzegar, Rahim, Mohammad Taghi Aalami, Jan Adamowski. Short-term water quality variable prediction using a hybrid CNN–LSTM deep learning model. Stochastic Environmental Research and Risk Assessment. 2020;34(2):415-433.
Ahmed, Ali Najah, et al. Machine learning methods for better water quality prediction. Journal of Hydrology. 2019;578:124084.
Wagena Moges B, et al. Comparison of short-term streamflow forecasting using stochastic time series, neural networks, process-based, and Bayesian models. Environmental Modelling & Software. 2020;126:104669.
Gu J, Liu S, Zhou Z, Chalov SR, Zhuang Q. A Stacking Ensemble Learning Model for Monthly Rainfall Prediction in the Taihu Basin, China. Water. 2022;14:492.
Zhang L, Yang Y, Deng Y, Kang H. Forecasting of Road Traffic Flow Based on Harris Hawk Optimization and XGBoost. Journal of Advances in Mathematics and Computer Science. 2022;37(2):21-29.
Chen T, Guestrin C. Xgboost: A scalable tree boosting system; proceedings of the Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, F; 2016 [C].
Friedman Jerome H. Greedy function approximation: A gradient boosting machine. Annals of Statistics. 2001;1189-1232.
Raj Pethuru, Preetha Evangeline David. The Digital Twin Paradigm for Smarter Systems and Environments: The Industry Use Cases. Academic Press; 2020.
Liu Bing-Chun, et al. Urban air quality forecasting based on multi-dimensional collaborative Support Vector Regression (SVR): A case study of Beijing-Tianjin-Shijiazhuang. PloS One. 2017;12(7):e0179763.
Abstract View: 48 times
PDF Download: 19 times