Remote Sensing Scene Image Classification with Swin Transformer-Based Transfer Learning


Kanca Gülsoy E., Gülsoy T., Baykal Kablan E., Ayas S.

2025 9th International Symposium on Innovative Approaches in Smart Technologies (ISAS), Gaziantep, Türkiye, 27 - 28 Haziran 2025, ss.1-6, (Tam Metin Bildiri)

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Doi Numarası: 10.1109/isas66241.2025.11101758
  • Basıldığı Şehir: Gaziantep
  • Basıldığı Ülke: Türkiye
  • Sayfa Sayıları: ss.1-6
  • Karadeniz Teknik Üniversitesi Adresli: Evet

Özet

In remote sensing classification problems, high visual similarity between scenes reduces the classification performance of traditional methods. Therefore, advanced deep neural network models are preferred over traditional methods for classifying these images. In this study, the classification performance of the Swin Transformer model, a deep learning method that has attracted considerable attention in recent years, is comprehensively evaluated on two datasets widely used in remote sensing and compared with popular studies in the literature. Thanks to the hierarchical and window-based self-attention mechanism of the Swin Transformer model, the model effectively learns both global and local contextual information from the images and improves the scene classification performance of the model. In order to obtain reliable and generalisable results, 5-fold cross-validation method was applied to the dataset. The results showed that the Swin Transformer model achieved impressive results with 96.52% and 95.39% accuracy rates, outperforming advanced models such as classical CNN, Ensemble and GAN-based models.