Efficient geospatial mapping of buildings, woodlands, water and roads from aerial imagery using deep learning

PeerJ Comput Sci. 2024 Jun 25:10:e2039. doi: 10.7717/peerj-cs.2039. eCollection 2024.

Abstract

As more aerial imagery becomes readily available, massive volumes of data are being gathered constantly. Several groups can benefit from the data provided by this geographical imagery. However, it is time-consuming to manually analyze each image to gain information on land cover. This research suggests using deep learning methods for precise and rapid pixel-by-pixel classification of aerial imagery for land cover analysis, which would be a significant step forward in resolving this issue. The suggested method has several steps, such as the augmentation and transformation of data, the selection of deep learning models, and the final prediction. The study uses the three most popular deep learning models (Vanilla-UNet, ResNet50 UNet, and DeepLabV3 ResNet50) for the experiments. According to the experimental results, the ResNet50 UNet model achieved an accuracy of 94.37%, the DeepLabV3 ResNet50 model achieved an accuracy of 94.77%, and the Vanilla-UNet model achieved an accuracy of 91.31%. The accuracy, precision, recall, and F1-score of DeepLabV3 and ResNet50 are higher than those of the other two models. The proposed approach is also compared to the existing UNet approach, and the proposed approaches have produced greater probability prediction scores than the conventional UNet model for all classes. Our approach outperforms model DeepLabV3 ResNet50 on aerial image datasets based on the performance.

Keywords: Ariel imagery; Data augmentation; Deep learning; Geospatial information; Land cover.

Grants and funding

This work was supported by the Deanship of Scientific Research at King Khalid University under grant number RGP2/384/45. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.