Evaluating the performance of the flexible discriminant analysis model in predicting the flooding potential of the Zarrineh-Rood Watershed

Document Type : Research/Original/Regular Article

Authors

1 Assistant Professor, Soil Conservation and Watershed Management Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, AREEO, Sanandaj, Iran

2 Assistant Professor, Research Department of Natural Resources, Golestan Agricultural and Natural Resources Research and Education Center, AREEO, Gorgan, Iran

3 Assistant Professor, Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran

4 Assistant Professor, Research Institute of Forests and Rangelands, AREEO, Tehran, Iran

5 Professor, Soil Conservation and Watershed Management Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, AREEO, Sanandaj, Iran

Abstract

Introduction
Floods cause financial losses and countless lives in the world every year. Identifying flood-prone areas is one of the basic steps in flood management. In the past, careful observation and note-taking of the mechanism of occurrence and the natural course of the cause and effect of the effective factors that led to the final occurrence of the phenomenon in a chain manner helped to understand the pattern and process of the occurrence to some extent. Because the processes of creating floods are numerous and affected by various factors and the flood phenomenon is multidimensional and dynamic, all-natural, human, and organizational-management factors affect the occurrence, intensity, extent, and continuity of floods. So far, many efforts have been made to use data-mining models and artificial intelligence in the spatial prediction of floods. The models try to better and more accurately estimate the distribution of the flood phenomenon by examining the relationships between each flood event (dependent factor) and the set of underlying and stimulating factors (independent factors) and fitting them to educational evidence. Since the applicability of the flexible discriminant analysis model has not been fully investigated in the field of flood susceptibility prediction, this research quantitatively evaluated its performance using real-world flood data.
 
Materials and Methods
Based on the availability of periodic history of flood events, the Zarrineh-Rood Watershed of Kurdistan Province, Iran, was chosen as the study area. It was tried to select the factors based on different criteria such as familiarity with the process of flood inundation, ease of data preparation, having the most spatial variability at the regional level (not uniform), and containing the most information for the model should be selected to separate areas with different levels of flood susceptibility. Thirteen diverse geo-environmental factors including elevation, aspect, slope percent, land use, drainage density, lithology, plan curvature, profile curvature, mean annual precipitation, soil texture, stream power index, distance from the stream, and topographic wetness index were used as independent variables. The maps of elevation, aspect, slope percent, plan and profile curvatures, stream power index, and topographic wetness index were produced using a digital elevation model. Hydrological layers including distance from the stream and drainage density were produced using the stream network layer. The location of the flooding events was also collected as the dependent variable. The spatial data of flooding were randomly divided into two groups of training and validation with a ratio of 70:30. After running the model (i.e., Flexible Discriminant Analysis) based on the training group, the flood susceptibility map was produced. The validation of the model results was conducted using the area under the receiver operating characteristic curve (AUROC) and the true skill statistic (TSS) metrics.
 
Results and Discussion
The results indicated that the FDA model with the value of AUROC= 0.96 and TSS= 0.86 efficiently and accurately produced the flood susceptibility map. The flexible nature of the model in the selection of regression equations, as well as the possibility of weighting, and determining the priority of the evidence of presence over the evidence of absence, are among the special capabilities of the FDA model, which many machine learning models lack. Using probability distribution estimation algorithms in the model is very important and can not only extract the hidden spatial pattern of occurrence from a set of data but also help to predict flood-prone areas in data-scarce Watersheds. Based on the results, about 14% (62 thousand ha) of the study area was categorized in the high and very high flood susceptibility zones, which include the northern, northwestern, and southeastern areas. Spatial analysis of the flood susceptibility map showed that in total 25897 ha (18.12%) of agricultural lands, 343 ha (50.91%) of garden lands, and 2126 ha (39.93%) of residential areas located in high and very high susceptible zones. Considering the successfulness of the FDA model in goodness-of-fit and validation phases, the flood susceptibility map can be used as a basis for planning flood control and management measures.
 
Conclusion
The findings of this study proved that the flexible discriminant analysis model provides the possibility of processing diverse and big geo-environmental data to predict the flood susceptibility of Watersheds and it had a high efficiency in this context. There is a lot of spatial correspondence between the vegetation status map and the flood susceptibility map; in such a way that the places that had a high flood susceptibility degree, their upstream areas were generally destroyed in terms of vegetation. The results of this research showed that a significant area of the Zarineh-Rood Watershed had a high and very high flood potential, which was characterized by the interaction of low slope and flat areas, formations and soils with low penetration and dense drainage network, and more importantly, flood-prone areas located in the northern, northwestern and southeastern parts of the Watershed. The situation of the flood probability of the Zarineh-Rood Watershed has been determined and managers and decision-makers must put the critical areas in the priority of flood management programs. More flood-driver factors are suggested to be used as predictor variables in flood susceptibility modeling in future studies. On the other hand, it is very important to determine the role of predictor variables in the flood susceptibility degree at the Watershed scale, which can be investigated in future research.

Keywords

Main Subjects


Abedi, R., Costache, R., Shafizadeh-Moghadam, H., & Pham, Q.B. (2022). Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees. Geocarto International37(19), 5479-5496. doi:10.1080/10106049.2021.1920636
Allouche, O., Tsoar, A., & Kadmon, R. (2006). Assessing the accuracy of species distribution models: prevalence, kappa and the true skill statistic (TSS). Journal of Applied Ecology43(6), 1223-1232. doi:10.1111/j.1365-2664.2006.01214.x
Arabameri, A., Seyed Danesh, A., Santosh, M., Cerda, A., Chandra Pal, S., Ghorbanzadeh, O., & Chowdhuri, I. (2022). Flood susceptibility mapping using meta-heuristic algorithms. Geomatics, Natural Hazards and Risk13(1), 949-974. doi:10.1080/19475705.2022.2060138
Arora, A., Pandey, M., Siddiqui, M.A., Hong, H., & Mishra, V.N. (2021). Spatial flood susceptibility prediction in Middle Ganga Plain: comparison of frequency ratio and Shannon’s entropy models. Geocarto International36(18), 2085-2116. doi:10.1080/10106049.2019.1687594
Barati, Gh., Bodagh Jamali, J., & Maleki, N. (2012). Anticyclones and heavy rainfalls over Western Iran. Physical Geography Research Quarterly, 44(2), 85-98. doi:10.22059/jphgr.2012.29208. [In Persian]
Chapi, K., Singh, V.P., Shirzadi, A., Shahabi, H., Tien Bui, D., Pham, B.T., & Khosravi. K. (2017). A novel hybrid artificial intelligence approach for flood susceptibility assessment. Environmental Modelling & Software95, 229-245. doi:10.1016/j.envsoft.2017.06.012
Derex, M. (2022). Human cumulative culture and the exploitation of natural phenomena. Philosophical Transactions of the Royal Society B377(1843), 20200311. doi:10.1098/rstb.2020.0311
Habibi, A., Delavar, M.R., Sadeghian, M.S., & Nazari, B. (2023). Flood susceptibility mapping and assessment using regularized random forest and NAÏVE bayes algorithms. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences10, 241-248. doi:10.5194/isprs-annals-X-4-W1-2022-241-2023
Hallgren, W., Santana, F., Low-Choy, S., Zhao, Y., & Mackey, B. (2019). Species distribution models can be highly sensitive to algorithm configuration. Ecological Modelling408, 108719. doi:10.1016/j.ecolmodel.2019.108719
Hastie, T., Tibshirani, R., Friedman, J.H., & Friedman, J.H. (2009). The elements of statistical learning: data mining, inference, and prediction (Vol. 2, pp. 1-758). New York: springer. doi:10.1007/978-0-387-21606-5
Hong, H., Tsangaratos, P., Ilia, I., Liu, J., Zhu, A.X., & Chen, W. (2018). Application of fuzzy weight of evidence and data mining techniques in construction of flood susceptibility map of Poyang County, China. Science of the Total Environment625, 575-588. doi:10.1016/j.scitotenv.2017.12.256
Karim, F., Armin, M.A., Ahmedt-Aristizabal, D., Tychsen-Smith, L., & Petersson, L. (2023). A review of hydrodynamic and machine learning approaches for flood inundation modeling. Water15(3), 566. doi:10.3390/w15030566
Kazemi, M., & Jafarpoor, A. (2022). Identifying the threshold of variables affecting flood zone using machine learning technique (Case study: Karun basin). Water and Soil Management and Modeling,  doi:10.22098/mmws.2023.12285.1220. [In Persian]
Moazzam, M.F.U., Lee, B.G., Rahman, A.U., Farid, N., & Rahman, G. (2020). Spatio-statistical analysis of flood susceptibility assessment using bivariate model in the floodplain of river swat, district charsadda, Pakistan. Journal of Geoscience and Environment Protection8(5), 159. doi:10.4236/gep.2020.85010
Newson, M., Lewin, J., & Raven, P. (2022). River science and flood risk management policy in England. Progress in Physical Geography: Earth and Environment46(1), 105-123. doi:0.1177/03091333211036384
Pontius Jr, R.G., & Schneider, L.C. (2001). Land-cover change model validation by an ROC method for the Ipswich watershed, Massachusetts, USA. Agriculture, Ecosystems & Environment85(1-3), 239-248. doi:10.1016/S0167-8809(01)00187-6
Pourghasemi, H.R., Pouyan, S., Bordbar, M., Golkar, F., & Clague, J.J. (2023). Flood, landslides, forest fire, and earthquake susceptibility maps using machine learning techniques and their combination. Natural Hazards, 1-20. doi:10.1007/s11069-023-05836-y
Prasad, P., Loveson, V.J., Das, B., & Kotha, M. (2022). Novel ensemble machine learning models in flood susceptibility mapping. Geocarto International37(16), 4571-4593. doi:10.1080/10106049.2021.1892209
Rahmati, O., Pourghasemi, H.R., & Zeinivand, H. (2016). Flood susceptibility mapping using frequency ratio and weights-of-evidence models in the Golastan Province, Iran. Geocarto International31(1), 42-70. doi:10.1080/10106049.2015.1041559
Rahmati, O., Tahmasebipour, N., Haghizadeh, A., Pourghasemi, H.R., & Feizizadeh, B. (2017). Evaluation of different machine learning models for predicting and mapping the susceptibility of gully erosion. Geomorphology, 298, 118-137. doi: 10.1016/j.geomorph.2017.09.006
Rajabizadeh, Y., Ayyoubzadeh, S.A., & Zahiri, A. (2019). Flood survey of Golestan Province in 2018-2019 and providing solutions for its control and management in the future. Ecohydrology, 6(4), 921-942 doi:10.22059/ije.2019.283004.1137. [in Persian]
Samanta, S., Pal, D.K., & Palsamanta, B. (2018). Flood susceptibility analysis through remote sensing, GIS and frequency ratio model. Applied Water Science8(2), 66. doi:10.1007/s13201-018-0710-1
Seleem, O., Ayzel, G., de Souza, A.C.T., Bronstert, A., & Heistermann, M. (2022). Towards urban flood susceptibility mapping using data-driven models in Berlin, Germany. Geomatics, Natural Hazards and Risk13(1), 1640-1662. doi:10.1080/19475705.2022.2097131
Shafapour Tehrany, M., & Kumar, L. (2018). The application of a Dempster–Shafer-based evidential belief function in flood susceptibility mapping and comparison with frequency ratio and logistic regression methods. Environmental Earth Sciences77, 1-24. doi:10.1007/s12665-018-7667-0
Shafapour Tehrany, M., Shabani, F., Neamah Jebur, M., Hong, H., Chen, W., & Xie, X. (2017). GIS-based spatial prediction of flood prone areas using standalone frequency ratio, logistic regression, weight of evidence and their ensemble techniques. Geomatics, Natural Hazards and Risk8(2), 1538-1561. doi:10.1080/19475705.2017.1362038
Tajbakhsh, S.M., & Chezgi, J. (2022). Prioritization of flooding sub-basins in the north of the Birjand Plain using morphometric factors and VIKOR model. Water and Soil Management and Modeling doi:10.22098/mmws.2022.11855.1179. [In Persian]
Vafaei, M., Dastorani, M.T., & Rostami Khalaj, M. (2023). Flood risk assessment in campus of Ferdowsi University of Mashhad and presentation management scenarios using HEC-RAS model. Water and Soil Management and Modeling, 3(3), 225-239. doi:10.22098/mmws.2022.11815.1173. [In Persian]
Wubalem, A., Tesfaw, G., Dawit, Z., Getahun, B., Mekuria, T., & Jothimani, M. (2020). Comparison of statistical and analytical hierarchy process methods on flood susceptibility mapping: in a case study of Tana sub-basin in northwestern Ethiopia. Natural Hazards and Earth System Sciences Discussions, 1-43. doi:10.1515/geo-2020-0329
Youssef, A.M., Pourghasemi, H.R., & El-Haddad, B. A. (2022). Advanced machine learning algorithms for flood susceptibility modeling—performance comparison: Red Sea, Egypt. Environmental Science and Pollution Research29(44), 66768-66792. doi: 10.1007/s11356-022-20213-1
Youssef, A.M., Pourghasemi, H.R., Mahdi, A.M., & Matar, S.S. (2023). Flood vulnerability mapping and urban sprawl suitability using FR, LR, and SVM models. Environmental Science and Pollution Research30(6), 16081-16105. doi:10.1007/s11356-022-23140-3