Identifying the threshold of variables affecting flood zones using machine learning technique (Case study: the downstream region of the Karun River)

Document Type : Research/Original/Regular Article

Authors

1 Assistant Professor/, Department of Hormoz Studies and Research Center, University of Hormozgan, Bandarabaas, Iran

2 Ph.D./, Department of Watershed Management Engineering, Faculty of Natural Resources and Marine Sciences, Tarbiat Modares University, Noor, Iran

Abstract

Abstract
Introduction
Machine learning is a new artificial intelligence method that seeks to write a program with the best performance by using learning experience. Machine learning models with different algorithms can be predictive or descriptive or have both properties and be used in different fields. On the other hand, for better management of flood risk reduction, it is necessary to know the effective factors in each region and flood sensitivity analysis. Since so far, few researchers have analyzed the threshold of influence of variables affecting the occurrence of floods using machine learning methods, the current research is new in this respect. Based on this, the current research has been conducted to identify the threshold of variables affecting the zoning of flooded areas using machine learning and remote sensing data in the Karun Basin area. The results can be put on the agenda of the relevant managers in identifying the influence limits of different variables on the occurrence of floods and the management of flood-sensitive areas by relying on the effective limits of the variables in the study area.
 
Materials and Methods
Landsat OLI 8 images on April 8, 2019 were used to identify flooded areas. In this regard, to identify groundwater, the corresponding image of the previous year of the region was used to separate and identify groundwater zones. Then, the remaining pixels of the study area as whole samples and flooded areas were entered into the modelling process as target samples. Therefore, flooded areas with a code of one and other areas with a code of zero entered the modelling process as dependent variables. Also, the variables that were entered as independent variables in the machine learning process include actual evaporation and transpiration, land use, soil density mass, soil clay percentage, soil water deficit, DEM, NDVI, land cover index, Palmer drought severity index, potential evaporation and transpiration, precipitation. cumulative, soil sand percentage, soil texture, soil moisture, minimum and maximum temperature. Next, by entering these variables and performing the machine learning process, the models were evaluated and TreeNet was selected as the best model. Then the threshold of each of the studied components on flood zones was obtained from machine learning. Also, in the present study, learning and test data were used in a ratio of 70% to 30% and completely randomly. It is worth noting that the number of 200 trees with at least six nodes was set for modelling.
 
Results and Discussion
Different components have certain thresholds at the beginning of land flooding so regarding vegetation as the most important effective factor in flood zoning, it shows that the lack of vegetation causes flooding, and the higher the level of vegetation, the more it prevents flooding. Also, the cumulative precipitation threshold for flooding the studied area was 15 mm of rainfall, and less than that, the incoming rainfall did not pose a risk of flooding the studied lands. The amount of 15.5 mm of rainfall was the turning point and the threshold of the beginning of the flooding in the study area. Regarding the soil moisture deficiency index, it shows that the threshold of flooding based on this index was 144, in other words when the soil moisture profile is more than the mentioned value, the incoming precipitation must compensate for the soil moisture deficiency, and as a result, floods will be prevented. On the other hand, most flooding conditions have existed at a height of 16 m, and as the height increases, the risk of flooding the studied area decreases, so that there is a failure at a height of 19 m, and when the height reaches 22.5 m, the risk Flooding disappears, and at a height higher than 26 m, flooding is restrained and will reach a steady state. The reason for this can be the plainness of the studied area and the widening of the flood zone in the plain.
 
Conclusion
The results showed that the components of the vegetation cover index, cumulative precipitation, soil water deficit, Palmer drought index, height, and surface soil moisture respectively had the greatest effect on the flooding of the studied area. Also, in the studied area, the effect of soil sand percentage, soil clay percentage, soil density, potential evaporation and transpiration, slope direction, maximum daily temperature, and soil texture on flood zoning was insignificant. The evaluation of the efficiency of the model with the indicators of ROC, specificity, sensitivity, and overall accuracy is 0.95, 91.2, 90.43, and 91.12, respectively, which indicates accuracy. The results of flood zoning with the ground reality indicated R2 and MAE equal to 72.8% and 0.27%, which confirms the accuracy of the zoning results with the ground reality relatively well. The analysis of the results shows that there will be an increased risk of flooding in the wetland and swamp areas due to the high humidity and water level. The results of the present research can be used by planners and managers of natural hazards to reduce floods.

Keywords

Main Subjects


References
Abdi, M., Nohtani, M., Dehghani, M., & Khaksefidi, A. (2023). Determining the changes in flood potential caused by drought periods in the Dehak watershed of south Khorasan province. Water and Soil Management and Modeling. 3(1), 149-164. doi:10.22098/MMWS.2022.11296.1118. [In Persian]
Ahlmer, A.K., Cavalli, M., Hansson, K., Koutsouris, A.J., Crema, S., & Kalantari, Z. (2018). Soil moisture remote-sensing applications for identification of flood-prone areas along transport infrastructure. Environmental Earth Sciences77, 1-17.‏ doi:10.1007/s12665-018-7704-z
Al-Areeq, A.M., Abba, S.I., Yassin, M.A., Benaafi, M., Ghaleb, M., & Aljundi, I.H. (2022). Computational machine learning approach for flood susceptibility assessment integrated with remote sensing and GIS techniques from Jeddah, Saudi Arabia. Remote Sensing14(21), 5515.‏ doi:10.3390/rs14215515
Avand, M., Janizadeh, S., Bui, D.T., Pham, V.H., Ngo, T.T.,  & Nhu, V. (2020a). A tree-based intelligence ensemble approach for spatial prediction of potential groundwater. International Journal Digital Earth, 13(12), 1–22. doi:10.1080/17538947.2020.1718785
Avand, M.T., Moradi, H.R., Ramazanzadeh, M. )2020b(. Flood susceptibility mapping using random forest machine learning and generalized bayesian linear model. Environment and Water Engineering. 6(1), 83-95. doi:10.22034/
jewe.2020.220593.1351
. [In Persian]
Chakrabortty, R., Chandra Pal, S., Rezaie, F., Arabameri, A., Lee, S., Roy, P., Saha, A., Chowdhuri, I., & Moayedi, H. (2022). Flash-flood hazard susceptibility mapping in Kangsabati River Basin, India. Geocarto International37(23), 6713-6735.‏ doi:10.1080/10106049.2021.1953618
Congalton, R.G. & Green, K. (2008). Assessing the accuracy of remotely sensed data: principles and practices. CRC Press. 364 p. doi:10.1201/9780429052729
Dodangeh, E., Choubin, B., Eigdir, A.N., Nabipour, N., Panahi, M., Shamshirband, S., & Mosavi, A. (2020). Integrated machine learning methods with resampling algorithms for flood susceptibility prediction. Science of the Total Environment705, 135983.‏ doi:10.1016/j.scitotenv.2019.135983
Elish, M.O. (2009). Improved estimation of software project effort using multiple additive regression trees. Expert Systems with Applications36(7), 10774-10778.‏ doi:10.1016/j.eswa.2009.02.013
Faramarzi, H., Hosseini, S.M., Pourghasemi, H.R., & Farneghi, M. (2020). Flood reduction in urban basins using LID-BMPs in SWMM model and selecting the best option with AHP-TOPSIS (Case study: Golestan area in Semnan). Ecohydrology. 6(4), 1003-1013. doi:10.22059/IJE.2019.285430.1163. [In Persian]
Friedman, J.H. (2002). Stochastic gradient boosting. Computational Statistics & Data Analysis38(4), 367-378.‏ doi:10.1016/S0167-9473(01)00065-2
Friedman, J.H., & Meulman, J.J. (2003). Multiple additive regression trees with application in epidemiology. Statistics in Medicine, 22(9), 1365-1381. doi:10.1002/sim.1501
Ha, M.C., Vu, P.L., Nguyen, H.D., Hoang, T.P., Dang, D.D., Dinh, T.B.H., Serban, G., Rus, I., & Brețcan, P. (2022). Machine learning and remote sensing application for extreme climate evaluation: Example of flood susceptibility in the Hue Province, Central Vietnam region. Water14(10), 1617.‏ doi:10.3390/w14101617
Hong, J., Tamakloe, R., & Park, D. (2020). Application of association rules mining algorithm for hazardous materials transportation crashes on expressway. Accident Analysis & Prevention142, 105497.‏ doi:10.1016/j.aap.2020.105497
Ighile, E.H., Shirakawa, H., & Tanikawa, H. (2022). Application of GIS and machine learning to predict flood areas in Nigeria. Sustainability14(9), 5039.‏ doi:10.3390/su14095039
Iqbal, U., Riaz, M.Z.B., Zhao, J., Barthelemy, J., & Perez, P. (2023). Drones for flood monitoring, mapping and detection: a bibliometric review. Drones7(1), 32.‏ doi:10.3390/drones7010032
Kavianpour, A.H., Jafarian, Z., Ismali, A., & Kavian, A.A. (2014). Effect of vegetation covers on decreasing runoff and soil loss using rainfall simultion in Nesho rangeland, Mazandaran province. Geography and Environmental Planning, 26(2), 179-190. dor: 20.1001.1.20085362.1394.26.2.12.3. [In Persian]
Kazemi, H., Mansouri, N., & Jozi, S.A. (2022).
Flood risk zoning in Nowshahr city using machine learning models
. Journal of Housing and Rural Environment, 40(176), 71-86.‏ doi:10.22034/40.176.7. [In Persian]
Khaldi, L., Elabed, A., & El Khanchoufi, A. (2023). Quantitative assessment of the relative impacts of different factors on flood susceptibility modelling: case study of Fez-Meknes region in Morocco. E3S Web of Conferences, ‏ 364(4), 02005. doi:10.1051/e3sconf/202336402005
Khosravi, K., Nohani, E., Maroufinia,  E., & Pourghasemi, H.R. (2016). A GIS-based flood susceptibility assessment and its mapping in Iran: a comparison between frequency ratio and weights-of-evidence bivariate statistical models with multi-criteria decision-making technique. Natural Hazards, 83(2), 947-987. doi:10.1007/s11069-016-2357-2
Kia, M.B., Pirasteh, S., Pradhan, B., Mahmud, A.R., Sulaiman, W.N.A., Moradi, A., Nor, W., Sulaiman, A., & Moradi, A. (2012). An artificial neural network model for flood simulation using GIS: Johor River Basin, Malaysia. Environmental Earth Sciences, 67, 251-264. doi:10.1007/s12665-011-1504-z
Mo, K.C., & Chelliah, M. (2006). The modified Palmer drought severity index based on the NCEP North American Regional Reanalysis. Journal of Applied Meteorology and Climatology45(10), 1362-1375.‏ doi:10.1175/JAM2402.1
Monteiro, D.S., & Antonio, J.F. (2002). Multiple additive regression trees a methodology for predictive data mining for fraud detection. Naval Postgraduate School Monterey Ca.93PP
Mosavi, A., Ozturk, P., & Chau, K.W. (2018). Flood prediction using machine learning models: Literature review. Water10(11), 1536.‏ doi:10.3390/w10111536
Pierdicca, N., Pulvirenti, L., Chini, M., Guerriero, L., & Ferrazzoli, P. (2010). A fuzzy-logic-based approach for flood detection from Cosmo-SkyMed data. In 2010 IEEE International Geoscience and Remote Sensing Symposium,  Pp. 4796-4798.‏ doi:10.1109/IGARSS.2010.5650903
Reis, A.A., Carvalho, M.C., de Mello, J.M., Gomide, L.R., Ferraz Filho, A.C., & Acerbi Junior, F.W. (2018). Spatial prediction of basal area and volume in Eucalyptus stands using Landsat TM data: an assessment of prediction methods. New Zealand Journal of Forestry Science48(1), 1-17. doi:10.1186/s40490-017-0108-0
Saberi Tanasvan, M.,Ganji Noroozi;, Z., Delghandi, M., Dorostkar, V. (2020). Sensitivity analysis of flood parameters to roughness variation. Journal of Irrigation and Water Engineering, 10(4), 167-180. doi: 10.22125/IWE.2020.110081. [In Persian]
Sayyad, D., Ghazavi, R., & Omidvar, E., (2022). Appropriate urban infrastructure management strategies against floods from the perspective of passive defense using SWOT and QSPM (Case study: Kashan City). Water and Soil Management and Modeling, 2(1), 45-52. doi:10.22098/MMWS.2022.9651.1055. [In Persian].
Shafizadeh-Moghadam, H., Valavi, R., Shahabi, H., Chapi, K., & Shirzadi, A. (2018). Novel forecasting approaches using combination of machine learning and statistical models for flood susceptibility mapping. Journal of Environmental Management217, 1-11.‏ doi:10.1016/j.jenvman.2018.03.089
Shahbazi, S., Bazrafshan, J., Irannejad, P., & Sohrabi, T. (2014). Palmer drought severity index's calibration under the climatic conditions of arid and semiarid regions of the west and south west of Iran. Journal of Water and Soil Conservation, 22(5), 23-44. doi: 20.1001.1.23222069.1394.22.5.2.2. [In Persian]
Solaimani, K., & Davishi, S. (2020). Zoning and monitoring of spring 2019 flood hazard in Khuzestan using landsat-8 data. Ecohydrology. 7(3), 647-662. doi: 10.22059/IJE.2020.302703.1333. [In Persian]
Tajbakhsh Fakhrabadi, S.M., & Chezgi, J. (2022). Prioritization of flooding sub-basins in the north of the Birjand Plain using morphometric factors and VIKOR model. Water and Soil Management and Modeling, 3(3), 240-255. doi:10.22098/MMWS.2022.11855.1179. [In Persian]
Wang, Y., Fang, Z., Hong, H., & Peng, L. (2020). Flood susceptibility mapping using convolutional neural network frameworks. Journal of. Hydrology, 582, 124–482. doi:10.1016/j.jhydrol.2019.124482
Were, K., Bui, D.T., Dick, Ø.B., & Singh, B.R. (2015). A comparative assessment of support vector regression, artificial neural networks, and random forests for predicting and mapping soil organic carbon stocks across an Afromontane landscape. Ecological Indicators52, 394-403.‏ doi:10.1016/j.ecolind.2014.12.028
Yousefi, H., Yonesi, H., Davoudimoghadam, D., Arshia, A., & Shamsi, Z. (2022). Determination of flood potential using CART, GLM and GAM machine learning models. Journal of Irrigation and Water Engineering. 12(4): 84-105. doi: 10.22125/IWE.2022.15068. [In Persian]