Evaluating the performance of flexible discriminant analysis model in predicting the flooding potential of the Zarrineh-Rood watershed, Iran

Document Type : Research/Original/Regular Article

Authors

1 Assistant Professor, Soil Conservation and Watershed Management Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, AREEO, Sanandaj, Iran

2 Assistant Professor, Research Department of Natural Resources, Golestan Agricultural and Natural Resources Research and Education Center, AREEO, Gorgan, Iran

3 Assistant Professor, Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran

4 Assistant Professor, Research Institute of Forests and Rangelands, AREEO, Tehran, Iran

5 Professor, Soil Conservation and Watershed Management Research Department, Kurdistan Agricultural and Natural Resources Research and Education Center, AREEO, Sanandaj, Iran

Abstract

Introduction

Floods cause financial losses and countless lives in the world every year. Identifying flood-prone areas is one of the basic steps in flood management. In the past, careful observation and note-taking of the mechanism of occurrence and the natural course of the cause and effect of the effective factors that led to the final occurrence of the phenomenon in a chain manner helped to understand the pattern and process of the occurrence to some extent. Because the processes of creating floods are numerous and affected by various factors and the flood phenomenon is multidimensional and dynamic, all-natural, human, and organizational-management factors together affect the occurrence, intensity, extent, and continuity of floods. So far, many efforts have been made to use data-mining models and artificial intelligence in the field of spatial prediction of floods. The models try to estimate the distribution of the flood phenomenon better and more accurately by examining the relationships between each flood event (dependent factor) and the set of underlying and stimulating factors (independent factors) and fitting them to educational evidences. Since the applicability of the flexible discriminant analysis model has not been fully investigated in the field of flood susceptibility prediction, this research quantitatively evaluated its performance using real-world flood data.

Materials and Methods

Based on the availability of periodic history of flood events, the Zarrineh-Rood watershed of Kurdistan province, Iran, was chosen as the study area. It was tried to select the factors based on different criteria such as familiarity with the process of flood inundation, ease of data preparation, having the most spatial variability at the regional level (not uniform), and containing the most information for the model should be selected to separate areas with different levels of flood susceptibility. Thirteen diverse geo-environmental factors including elevation, aspect, slope percent, land use, drainage density, lithology, plan curvature, profile curvature, mean annual precipitation, soil texture, stream power index, distance from stream, and topographic wetness index were used as independent variables. The maps of elevation, aspect, slope percent, plan and profile curvatures, stream power index and topographic wetness index were produced using a digital elevation model. The datasets of precipitation were obtained from the Weather Organization of Kurdistan province, and subsequently, the mean annual precipitation map was created using interpolation methods. The soil texture and land use maps were obtained from Natural Resources and Watershed Management Organization and their validity was tested during field surveys. Hydrological layers including distance from stream and drainage density were produced using the stream network layer. The location of the flooding events was also collected as the dependent variable. The spatial data of flooding were randomly divided into two groups of training and validation with a ratio of 70:30. After running the model (i.e., Flexible Discriminant Analysis) based on the training group, the flood susceptibility map was produced. The validation of the model results was conducted using the area under the receiver operating characteristic curve (AUROC) and the true skill statistic (TSS) metrics.

Results and Discussion

The results indicated that the FDA model with the value of AUROC = 0.96 and TSS = 0.86 efficiently and accurately produced the flood susceptibility map. The flexible nature of the model in the selection of regression equations, as well as the possibility of weighting, and determining the priority of the evidence of presence over the evidence of absence, are among the special capabilities of the FDA model, which many machine learning models lack. Using probability distribution estimation algorithms in the model is very important and can not only extract the hidden spatial pattern of occurrence from a set of data but also help to predict flood-prone areas in data-scarce watersheds. Based on the results, about 14% (62 thousand ha) of the study area was categorized in the high and very high flood susceptibility zones, which include the northern, northwestern, and southeastern areas. Spatial analysis of the flood susceptibility map showed that in total 25897 ha (18.12%) of agricultural lands, 343 ha (50.91%) of garden lands, and 2126 ha (39.93%) of residential areas located in high and very high susceptible zones. The flood susceptibility map can be used as a basis for planning flood control and management measures.

Conclusion

The findings of this study proved that the flexible discriminant analysis model provides the possibility of processing diverse and big geo-environmental data to predict the flood susceptibility of watersheds and it had a high efficiency in this context. The results of this research showed that a significant area of the Zarineh-Rood watershed had a high and very high flood potential, which was characterized by the interaction of low slope and flat areas, formations and soils with low penetration and dense drainage network, and more importantly, flood-prone areas located in the northern, northwestern and southeastern parts of the watershed. The situation of the flood probability of the Zarineh-Rood watershed has been determined and managers and decision-makers must put the critical areas in the priority of flood management programs.

Keywords

Main Subjects



Articles in Press, Accepted Manuscript
Available Online from 23 June 2023
  • Receive Date: 09 June 2023
  • Revise Date: 22 June 2023
  • Accept Date: 23 June 2023