Application of Stacking-Based Ensemble Learning Model for Water Quality Prediction
Longfeng Zhang *
School of Science, Southwest University of Science and Technology, Mianyang 621010, China.
Yiqi Yang
School of Science, Southwest University of Science and Technology, Mianyang 621010, China.
Yanqiao Deng
School of Science, Southwest University of Science and Technology, Mianyang 621010, China.
Hao Kang
School of Science, Southwest University of Science and Technology, Mianyang 621010, China.
Tingting Hua-ng
School of Science, Southwest University of Science and Technology, Mianyang 621010, China.
*Author to whom correspondence should be addressed.
Abstract
Water is the source of life, and the growth of animals and plants cannot leave the water source. The quality of water will directly affect the life and health of humans, animals and plants. In order to predict the concentration and changing trend of various pollutants in water bodies and promote the comprehensive management of water resources, this paper proposes a new integrated model based on the idea of Stacking integrated learning. The model is based on XGBoost, support vector regression, and multi-layer perceptron. The model is constructed with ridge regression as the meta-model. The model was applied to the pH and total nitrogen content of water quality, and the mean absolute percentage error was used to quantitatively evaluate the prediction results, and the results of the ensemble learning model were compared with the prediction results of a single base model. The results show that the stacking ensemble learning idea can effectively improve the prediction ability and generalization performance of the base model. The proposed ensemble learning model has very good prediction ability and generalization ability, and has great potential in other prediction fields such as water quality.
Keywords: Ensemble learning, water quality prediction, multi-step prediction, machine learning