A Spatial–Spectral Transformer for Hyperspectral Image Classification Based on Global Dependencies of Multi-Scale Features

Ma, Yunxuan and Lan, Yan and Xie, Yakun and Yu, Lanxin and Chen, Chen and Wu, Yusong and Dai, Xiaoai (2024) A Spatial–Spectral Transformer for Hyperspectral Image Classification Based on Global Dependencies of Multi-Scale Features. Remote Sensing, 16 (2). p. 404. ISSN 2072-4292

[thumbnail of remotesensing-16-00404.pdf] Text
remotesensing-16-00404.pdf - Published Version

Download (9MB)

Abstract

Vision transformers (ViTs) are increasingly utilized for HSI classification due to their outstanding performance. However, ViTs encounter challenges in capturing global dependencies among objects of varying sizes, and fail to effectively exploit the spatial–spectral information inherent in HSI. In response to this limitation, we propose a novel solution: the multi-scale spatial–spectral transformer (MSST). Within the MSST framework, we introduce a spatial–spectral token generator (SSTG) and a token fusion self-attention (TFSA) module. Serving as the feature extractor for the MSST, the SSTG incorporates a dual-branch multi-dimensional convolutional structure, enabling the extraction of semantic characteristics that encompass spatial–spectral information from HSI and subsequently tokenizing them. TFSA is a multi-head attention module with the ability to encode attention to features across various scales. We integrated TFSA with cross-covariance attention (CCA) to construct the transformer encoder (TE) for the MSST. Utilizing this TE to perform attention modeling on tokens derived from the SSTG, the network effectively simulates global dependencies among multi-scale features in the data, concurrently making optimal use of spatial–spectral information in HSI. Finally, the output of the TE is fed into a linear mapping layer to obtain the classification results. Experiments conducted on three popular public datasets demonstrate that the MSST method achieved higher classification accuracy compared to state-of-the-art (SOTA) methods.

Item Type: Article
Subjects: Lib Research Guardians > Multidisciplinary
Depositing User: Unnamed user with email support@lib.researchguardians.com
Date Deposited: 23 Jan 2024 06:26
Last Modified: 20 Jul 2024 05:34
URI: http://eprints.classicrepository.com/id/eprint/2579

Actions (login required)

View Item
View Item