Uncertainty analysis of artificial neural network (ANN) and support vector machine (SVM) models in predicting monthly river flow (Case study: Ghezelozan River)

Document Type : Research/Original/Regular Article

Authors

1 Expert, Regional Water Company of Zanjan, Zanjan, Iran

2 M.Sc. Student, Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

Abstract

Introduction
River flow forecasting has been one of the important challenges of water resources management in recent decades, so many researchers have proposed different methods to improve the performance of forecasting models. In the last decade, artificial intelligence methods have been most widely used in the simulation of various processes, including hydrological processes, due to their flexibility and high accuracy in modeling. However, the results of these methods have always been associated with uncertainty due to several factors such as structure, algorithm, input data, and the type of method chosen for data calibration. One of the methods that can somewhat solve this problem is the uncertainty analysis of the predictions made by these models.
 
Materials and Methods
In this study, the uncertainty of the results of artificial neural network (ANN) and support vector machine (SVM) models in predicting the monthly flow of the river has been evaluated. In this research, the time series of the monthly flow of the Ghezelozan River using the data of the Bianlu-Yasaul Stream gauging station in 39 years from 1976 to 2014 was used, where 75% and 25% of the data was used for training and testing the models, respectively. In these models, to estimate the monthly flow of the Ghezelozan River, six different input combinations including the flow of one, two, and three months before and the number of months of the flow were used. Then, the accuracy and performance of the models were compared using the coefficient of determination (R) and root mean square of errors (RMSE). Finally, the uncertainty of the results of ANN and SVM models in predicting the monthly flow of the river was evaluated by the Monte-Carlo method using d-factor and 95PPU values.
 
Results and Discussion
The evaluation of the performance of the ANN model shows that the best performance is related to the combination where the flow of the previous two months and the number of the month of the flow are the inputs of the model so that R and RMSE indexes were obtained as 0.757 and 9.45, respectively. In the SVM model for the monthly river flow series, the best performance is related to the combination where the flow of one, two, and three months ago and the number of months of the flow were the inputs of the model, and the R and RMSE indexes for this input pattern were 0.729 and 10.946, respectively. After studying the uncertainty of the models, the results showed that the ANN model has more uncertainty in the output values compared to the SVM model, and this is while the d-factor of the ANN model, both in the training and test phase, it was more than the SVM model. The comparison of the uncertainty analysis of the results of the ANN and SVM models showed that the SVM model with d-factor and 95PPU values equal to 0.155 and 17.241, respectively, compared to the ANN model with d-factor and 95PPU values equal to 0.993 and 85.470, respectively, has less uncertainty in the output values. So the number of observation data placed in the 95% confidence range (95PPU) of the ANN model compared to the SVM model has increased significantly in both the training and testing phases. Also, the results showed that both models have more uncertainty in the months with a large volume of water, which can be due to the complexity of the process and the involvement of uncertain factors in these months, as well as the effect of factors that are not considered in the structure of predictive models.
Conclusion
The results of ANN and SVM models in predicting the monthly flow of the Ghezelozan River showed that although the ANN model with R-value equal to 0.757 and RMSE value equal to 9.45 has a good performance compared to the SVM model with R-value equal to 0.729 and RMSE value equal to 10.946 in predicting the river flow, the results of this model are associated with high uncertainty. The comparison of the uncertainty analysis of the results of ANN and SVM models by Monte-Carlo method showed that the SVM model with d-factor and 95PPU values equal to 0.155 and 17.241, respectively, compared to the ANN model with d-factor and 95PPU values equal to 0.993 and 85.470, respectively, has less uncertainty in predicting the monthly flow of the Ghezelozan River and it is better than ANN model. According to the results of this research, taking into account the fact that advanced artificial intelligence models also have uncertainty, it is necessary to apply these methods in the fields of risk management and future planning to obtain the best performance.

Keywords

Main Subjects


References
Abbaspour, K.C., Yang, J., Maximov, I., Siber, R., Bogner, K., Mieleitner, J., Zobrist, J., & Srinivasan, R. (2007). Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT. Journal of Hydrology, 333(2-4), 413-430. doi:10.1016/j.jhydrol.2006.09.014
Adamowski, J. (2013). Using support vector regression to predict direct runoff, base flow and total flow in a mountainous watershed with limited data in Uttaranchal, India. Annals of Warsaw University of Life Sciences-SGGW. Land Reclamation, 45(1), 71-83. doi: 10.2478/sggw-2013-0007
Ahmadi, F., Radmanesh, F., & Mirabbasi Najafabadi, R. (2016). Comparing the performance of support vector machines and bayesian networks in predicting daily river flow (Case study: Baranduz Chai River). Journal of Water and Soil Conservation, 22(6), 171-186 dor: 20.1001.1.23222069.1394.22.6.10.2. [In Persian]
Akbarzadeh, A., Noori, R., Farokhnia, A., Khakpour, A., & Sabahi, M.S. (2010). Accuracy and uncertainty analysis of intelligent techniques for predicting the longitudinal dispersion coefficient in rivers. Journal of Water and Wastewater, 21(3), 99-107. [In Persian]
Allahverdipour, P., & Sattari, M.T. (2023). Comparing the performance of the multiple linear regression classic method and modern data mining methods in annual rainfall modeling (Case study: Ahvaz city). Water and Soil Management and Modeling, 3(2), 125-142 doi:10.22098/mmws.2022.11337.1120. [In Persian]
Aqil, M., Kita, I., Yano, A., & Nishiyama, S. (2007). Analysis and prediction of flow from local source in a river basin using a Neuro-fuzzy modeling tool. Journal of Environmental Management, 85(1), 215-223. doi:10.1016/j.jenvman.2006.09.009
Asefa, T., Kemblowski, M., McKee, M., & Khalil, A. (2006). Multi-time scale stream flow predictions: The support vector machines approach. Journal of Hydrology, 318(1-4), 7-16. doi:10.1016/j.jhydrol.2005.06.001
Dibike, Y.B., Velickov, S., Solomatine, D., & Abbott, M.B. (2001). Model induction with support vector machines: introduction and applications. Journal of Computing in Civil Engineering, 15(3), 208-216. doi: 10.1061/(ASCE)0887-3801(2001)15:3(208)
Eckhardt, K., Breuer, L., & Frede, H.G. (2003). Parameter uncertainty and the significance of simulated land use change effects. Journal of Hydrology, 273(1-4), 164-176. doi: 10.1016/S0022-1694(02)00395-5
Eskandari, A., Noori, R., Vesali Naseh, M.R., & Saeedi, F. (2019). Uncertainty evaluation of ANN and ANFIS Models in Inflow forecasting into the Raees-Ali Delvari Dam. Journal of Environmental Science and Technology, 21(7), 31-47. doi:10.22034/jest.2020.20068.2909 [In Persian]
Etminan, S., Jalali, V.R., Mamodabadi, M., Khashei-siuki, A., Pourreza-bilondi, M. (2023). Assessing the hydraulic parameter’s uncertainty of the HYDRUS model using DREAM method. Water and Soil Management and Modeling, 3(4), 1-15. doi:10.22098/mmws.2022.11659.1152 [In Persian]
Farokhnia, A., & Morid, S. (2010). Uncertainty analysis of artificial neural networks and neuro-fuzzy models in river flow forecasting. Iran-Water Resources Research, 5(3), 14-27. [In Persian]
Ghorbani, M.A., Zadeh, H.A., Isazadeh, M., & Terzi, O. (2016). A comparative study of artificial neural network (MLP, RBF) and support vector machine models for river flow prediction. Environmental Earth Sciences, 75(6), 1-14. doi: 10.1007/s12665-015-5096-x
Jain, S.K., Das, A., & Srivastava, D.K. (1999). Application of ANN for reservoir inflow prediction and operation. Journal of Water Resources Planning and Management, 125(5), 263-271. doi:10.1061/(ASCE)0733-9496(1999)125:5(263)
Kavzoglu, T., & Colkesen, I. (2009). A kernel functions analysis for support vector machines for land cover classification. International Journal of Applied Earth Observation and Geoinformation, 11(5), 352-359. doi:10.1016/j.jag.2009.06.002
Luk, K.C., Ball, J.E., & Sharma, A. (2000). A study of optimal model lag and spatial inputs to artificial neural network for rainfall forecasting. Journal of Hydrology, 227(1-4), 56-65. doi: 10.1016/S0022-1694(99)00165-1
Marcé, R., Comerma, M., García, J.C., & Armengol, J. (2004). A neuro‐fuzzy modeling tool to estimate fluvial nutrient loads in watersheds under time‐varying human impact. Limnology and Oceanography: Methods, 2(11), 342-355. doi:10.4319/lom.2004.2.342
Mehdizadeh Zare Anari, Y., Najafzadeh, M., & Anvari, S. (2023). The Uncertainty Analysis of Artificial Intelligence Models in Forecasting River Flow (Case Study: Karun River). Amirkabir Journal of Civil Engineering, 54(11), 4383-4396. doi:10.22060/ceej.2022.21208.7653 [In Persian]
Noori, R., Yeh, H.D., Abbasi, M., Kachoosangi, F. T., & Moazami, S. (2015). Uncertainty analysis of support vector machine for online prediction of five-day biochemical oxygen demand. Journal of Hydrology, 527, 833-843. doi:10.1016/j.jhydrol.2015.05.046
Pai, P.F., & Hong, W.C. (2007). A recurrent support vector regression model in rainfall forecasting. Hydrological Processes, 21(6), 819-827. doi:10.1002/hyp.6323
Rezaei Moghadam, M.H., Sarvati, M.R., & Asghari Sareskanroodi, S. (2011). Compared Investigation Meandering Pattern by Fractal GeometricalAnalysis and Central Angles and Sinuosity Ratio Indices. Journal of Watershed Management, 2(3), 1-18. [In Persian]
Riahi, H., & Ayyoubzadeh, S.A. (2010). Uncertainty analysis of ANN and ANFIS techniques in comparison with regime equations for determining regime channel geometry. In Fifth International Conference on Water Resources and Environment Research, Quebec City, Canada.
Seefi, A., Mirlatifi, M., & Reahi, H. (2013). Introduction and application of least square support vector machine (LSSVM) for simlulation of reference evapotranspiration and uncertainty analysis of results, A case study of the Kerman City. Irrigation and Water Engineering, 4(1), 67-79. [In Persian]
Efron, B., & Tibshirani, R.J. (1994). An Introduction to the Bootstrap. 1st Edition: Chapman and Hall/CRC, 456 pages. doi:10.1201/9780429246593
Vapnik, V.N. (1995). The Nature of Statistical Learning Theory. 1st Edition: Springer, New York, NY, 188 pages. doi:10.1007/978-1-4757-2440-0
Wu, J.S., Han, J., Annambhotla, S., & Bryant, S. (2005). Artificial neural networks for forecasting watershed runoff and stream flows. Journal of Hydrologic Engineering, 10(3), 216-222. doi:10.1061/(ASCE)1084-0699(2005)10:3(216)