Optimizing machine learning to reduce crop classification uncertainty in semi-arid Canal command areas

نوع مقاله : Special Issue: New Approaches to Water and Soil Management and Modeling

نویسندگان

1 Ph.D. Scholar, Department of Water Resources and Ocean Engineering, National Institute of Technology Karnataka, Surathkal, Mangaluru-575025, Karnataka, India

2 Scientist ‘C’, National Institute of Hydrology, Hard Rock Regional Centre, Visvesvaraya Nagar, Belagavi-590019, Karnataka, India

3 Professor, Department of Water Resources and Ocean Engineering, National Institute of Technology Karnataka, Surathkal, Mangaluru-575025, Karnataka, India

چکیده

Effective water resource management and canal performance analysis in semi-arid regions are fundamentally reliant on precise estimates of crop water demand, which are typically derived from accurate, up-to-date crop inventories. For command areas in semi-arid India, this vital information is a primary input for agro-hydrological models used to assess irrigation efficiency and plan water allocation. However, the inherent complexity of these landscapes characterized by small, fragmented landholdings introduces substantial uncertainty into remote sensing based crop classification, threatening the reliability of subsequent management decisions. This study systematically addresses this input uncertainty by performing a comprehensive, multi-factorial sensitivity analysis using multi-temporal Sentinel-1 (SAR) and Sentinel-2 (optical) data. We investigated the combined effects of four multi-sensor data fusion strategies, six Machine Learning (ML) classifiers, three feature selection techniques, and five training/testing data split ratios. The findings of the study provide crucial operational insights for modelers and managers. The synergistic fusion of Sentinel-1 and Sentinel-2 data was identified as the single most critical factor for achieving high accuracy. Furthermore, classification performance showed high sensitivity to training data volume, with an optimal threshold observed at an 80/20 train/test split. The Extreme Gradient Boosting (XGBoost) classifier, coupled with Backward Elimination feature selection, emerged as the superior strategy, achieving a maximum overall accuracy of 98.4%. By identifying this optimized workflow, this research provides a robust and scalable method for generating highly reliable spatial input data, thereby minimizing uncertainty in crop water requirement calculations and significantly enhancing the predictive capacity and practical utility of agro-hydrological models for sustainable water management.

کلیدواژه‌ها

موضوعات


Abhilash, R., Basappa, V., Chakragiri, S. V., & Patankar, D. B. (2022). Geospatial Approach for Integrated Command Area Management. Journal of Irrigation and Drainage Engineering, 148(4), 1–11. doi:10.1061/(asce)ir.1943-4774.0001659
Ahmed, A. A., Sayed, S., Abdoulhalik, A., Moutari, S., & Oyedele, L. (2024). Applications of machine learning to water resources management: A review of present status and future opportunities. Journal of Cleaner Production, 441, 140715. doi: 10.1016/j.jclepro.2024.140715
Amarasinghe, U. A., Sikka, A., Mandave, V., Panda, R. K., Gorantiware, S., Chandrasekharana, K., & Ambast, S. K. (2021). A re-look at canal irrigation system performance: a pilot study of the Sina irrigation system in Maharashtra, India. Water Policy, 23, 114–129. doi:10.2166/wp.2020.291
Banda, L., Rai, A., Kansal, A., & Vashisth, A. K. (2023). Suitable Crop Prediction based on affecting parameters using Naïve Bayes Classification Machine Learning Technique. 2023 International Conference on Disruptive Technologies, ICDT 2023, 43–46. doi:10.1109/ICDT57929.2023.10150814
Biau, G., & Scornet, E. (2016). A random forest guided tour. Test, 25(2), 197–227. doi:10.1007/s11749-016-0481-7
Chang, Z., Li, H., Chen, D., Liu, Y., Zou, C., Chen, J., Han, W., Liu, S., & Zhang, N. (2023). Crop Type Identification Using High-Resolution Remote Sensing Images Based on an Improved DeepLabV3+ Network. Remote Sensing, 15(21), 5088. doi:10.3390/rs15215088
Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 13-17-August-2016, 785–794. doi:10.1145/2939672.2939785
Congalton, R. G., & Green, K. (2019). Assessing the Accuracy of Remotely Sensed Data. CRC Press. doi:10.1201/9780429052729
Eisfelder, C., Boemke, B., Gessner, U., Sogno, P., Alemu, G., Hailu, R., Mesmer, C., & Huth, J. (2024). Cropland and Crop Type Classification with Sentinel-1 and Sentinel-2 Time Series Using Google Earth Engine for Agricultural Monitoring in Ethiopia. Remote Sensing, 16(5), 866. doi:10.3390/rs16050866
ESA. (2024). Copernicus open access hub. https://scihub.copernicus.eu/
Gao, S., Tang, B. H., Huang, L., & Chen, G. (2023). Identification of tea plantations in typical plateau areas with the combination of Sentinel-1/2 optical and radar remote sensing data based on feature selection algorithm. International Journal of Remote Sensing, 7033–7053. doi:10.1080/01431161.2023.2198655
Gitelson, A. A., Kaufman, Y. J., & Merzlyak, M. N. (1996). Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sensing of Environment, 58(3), 289–298. doi:10.1016/S0034-4257(96)00072-7
GSARS. (2017). Handbook on remote sensing for agricultural statistics. In Handbook on Remote Sensing for Agricultural Statistics. GSARS Handbook. https://openknowledge.fao.org/server/api/core/bitstreams/dcadd248-cccb-43ad-9e3c-7aa0cf2bb577/content
He, S., Peng, P., Chen, Y., & Wang, X. (2022). Multi-Crop Classification Using Feature Selection-Coupled Machine Learning Classifiers Based on Spectral, Textural and Environmental Features. Remote Sensing, 14. doi:10.3390/rs14133153
Hoppe, H., Dietrich, P., Marzahn, P., Weiß, T., Nitzsche, C., Freiherr von Lukas, U., Wengerek, T., & Borg, E. (2024). Transferability of Machine Learning Models for Crop Classification in Remote Sensing Imagery Using a New Test Methodology: A Study on Phenological, Temporal, and Spatial Influences. Remote Sensing, 16(9), 1493. doi:10.3390/rs16091493
Huete, A., Didan, K., Miura, T., Rodriguez, E. P., Gao, X., & Ferreira, L. G. (2002). Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sensing of Environment, 83(1–2), 195–213. doi:10.1016/S0034-4257(02)00096-2
Kabolizadeh, M., Rangzan, K., & Habashi, K. (2023). Improving classification accuracy for separation of area under crops based on feature selection from multi-temporal images and machine learning algorithms. Advances in Space Research, 72(11), 4809–4824. doi: 10.1016/j.asr.2023.09.044
Kumar, M. D., Sahasranaman, M., Verma, M. S., Kumar, S., & Narayanamoorthy, A. (2021). Getting the irrigation statistics right. International Journal of Water Resources Development. doi:10.1080/07900627.2021.1921711
Liu, N., Zhao, Q., Williams, R., & Barrett, B. (2024). Enhanced crop classification through integrated optical and SAR data: a deep learning approach for multi-source image fusion. International Journal of Remote Sensing, 45(19–20), 7605–7633. doi:10.1080/01431161.2023.2232552
Mandal, D., Kumar, V., Ratha, D., Dey, S., Bhattacharya, A., Lopez-Sanchez, J. M., McNairn, H., & Rao, Y. S. (2020). Dual polarimetric radar vegetation index for crop growth monitoring using sentinel-1 SAR data. Remote Sensing of Environment, 247. doi: 10.1016/j.rse.2020.111954
Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to Information Retrieval. Cambridge University Press. doi:10.1017/CBO9780511809071
Maponya, M. G., van Niekerk, A., & Mashimbye, Z. E. (2020). Pre-harvest classification of crop types using a Sentinel-2 time-series and machine learning. Computers and Electronics in Agriculture, 169. doi: 10.1016/j.compag.2019.105164
Maxwell, A. E., Warner, T. A., & Fang, F. (2018). Implementation of machine-learning classification in remote sensing: an applied review. International Journal of Remote Sensing, 39(9), 2784–2817. doi:10.1080/01431161.2018.1433343
Mirzaei, A., Bagheri, H., & Khosravi, I. (2023). Enhancing Crop Classification Accuracy through Synthetic SAR-Optical Data Generation Using Deep Learning. ISPRS International Journal of Geo-Information, 12(11). doi:10.3390/ijgi12110450
Montgomery, D. C., Peck, E. A., & Vining, G. G. (2012). Introduction to Linear Regression Analysis (5th ed.). John Wiley & Sons. https://www.kwcsangli.in/uploads/3--Introduction_to_Linear_Regression_Analysis__5th_ed._Douglas_C._Montgomery__Elizabeth_A._Peck__and_G._.pdf
Nigar, A., Li, Y., Jat Baloch, M. Y., Alrefaei, A. F., & Almutairi, M. H. (2024). Comparison of machine and deep learning algorithms using Google Earth Engine and Python for land classifications. Frontiers in Environmental Science, 12. doi:10.3389/fenvs.2024.1378443
Ok, A. O., Akar, O., & Gungor, O. (2012). Evaluation of random forest method for agricultural crop classification. European Journal of Remote Sensing, 45(1), 421–432. doi:10.5721/EuJRS20124535
Pal, M., & Mather, P. M. (2003). An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sensing of Environment, 86(4), 554–565. doi:10.1016/S0034-4257(03)00132-9
Qader, S. H., Dash, J., Alegana, V. A., Khwarahm, N. R., Tatem, A. J., & Atkinson, P. M. (2021). The Role of Earth Observation in Achieving Sustainable Agricultural Production in Arid and Semi-Arid Regions of the World. Remote Sensing, 13(17), 3382. doi:10.3390/rs13173382
Raja, S. P., Sawicka, B., Stamenkovic, Z., & Mariammal, G. (2022). Crop Prediction Based on Characteristics of the Agricultural Environment Using Various Feature Selection Techniques and Classifiers. IEEE Access, 10, 23625–23641. doi:10.1109/ACCESS.2022.3154350
Rajaput, M., Abhilash, R., & Dodamani, B. M. (2025). A systematic review of performance assessment in canal irrigation systems: Integrating socio-technical, remote sensing, and AI-driven approaches for a climate-resilient future. Water and Soil Management and Modeling, 5(4), 254–276. doi:10.22098/mmws.2025.18343.1683
Ramezan, C. A. (2022). Transferability of Recursive Feature Elimination (RFE)-Derived Feature Sets for Support Vector Machine Land Cover Classification. Remote Sensing, 14. doi:10.3390/rs14246218
Rouse, J. W., Haas, R. H., Schell, J. A., Deering, D. W., & Harlan, J. C. (1974). Monitoring the Vernal Advancement of Retrogradation (Greenwave Effect) of Natural Vegetation. https://ntrs.nasa.gov/api/citations/19750020419/downloads/19750020419.pdf
Rusňák, T., Kasanický, T., Malík, P., Mojžiš, J., Zelenka, J., Sviček, M., Abrahám, D., & Halabuk, A. (2023). Crop Mapping without Labels: Investigating Temporal and Spatial Transferability of Crop Classification Models Using a 5-Year Sentinel-2 Series and Machine Learning. Remote Sensing, 15(13), 3414. doi:10.3390/rs15133414
Saadatfar, H., Khosravi, S., Joloudari, J. H., Mosavi, A., & Shamshirband, S. (2020). A new k-nearest neighbors’ classifier for big data based on efficient data pruning. Mathematics, 8(2). doi:10.3390/math8020286
Schölkopf, B., & Smola, A. J. (2001). Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. The MIT Press. doi:10.7551/mitpress/4175.001.0001
Shao, Z., Ahmad, M. N., & Javed, A. (2024). Comparison of Random Forest and XGBoost Classifiers Using Integrated Optical and SAR Features for Mapping Urban Impervious Surface. Remote Sensing, 16(4). doi:10.3390/rs16040665
Solomatine, D. P., & Ostfeld, A. (2008). Data-driven modelling: some past experiences and new approaches. Journal of Hydroinformatics, 10(1), 3–22. doi:10.2166/hydro.2008.015
Song, Q., Xiang, M., Hovis, C., Zhou, Q., Lu, M., Tang, H., & Wu, W. (2019). Object-based feature selection for crop classification using multi-temporal high-resolution imagery. International Journal of Remote Sensing, 40(5–6), 2053–2068. doi:10.1080/01431161.2018.1475779
Uçar, M. K., Nour, M., Sindi, H., & Polat, K. (2020). The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets. Mathematical Problems in Engineering, 2020. doi:10.1155/2020/2836236
Vizzari, M., Lesti, G., & Acharki, S. (2024). Crop classification in Google Earth Engine: leveraging Sentinel-1, Sentinel-2, European CAP data, and object-based machine-learning approaches. Geo-Spatial Information Science. doi:10.1080/10095020.2024.2341748
Yadav, S., Sharma, S., & Chaudhary, P. (2024). Review on Advancement in Machine Learning and Deep Learning Techniques for Crop Classification. In Advancements in Communication and Systems (pp. 593–610). Soft Computing Research Society. doi:10.56155/978-81-955020-7-3-53
Yao, J., Wu, J., Xiao, C., Zhang, Z., & Li, J. (2022). The Classification Method Study of Crops Remote Sensing with Deep Learning, Machine Learning, and Google Earth Engine. Remote Sensing, 14(12), 2758. doi:10.3390/rs14122758
Zhang, N., Chen, M., Yang, F., Yang, C., Yang, P., Gao, Y., Shang, Y., & Peng, D. (2022). Forest Height Mapping Using Feature Selection and Machine Learning by Integrating Multi-Source Satellite Data in Baoding City, North China. Remote Sensing, 14(18). doi:10.3390/rs14184434