Fine-To Self-Attention Graph Convolutional Network for Skeleton-Based Action Recognition
Loading...

Date
2026
Authors
Kilic, Ugur
Karadag, Ozge Oztimur
Ozyer, Gulsah Tumuklu
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Open Access Color
OpenAIRE Downloads
OpenAIRE Views
Abstract
Skeleton data has become an important modality in action recognition due to its robustness to environmental changes, computational efficiency, compact structure, and privacy-oriented nature. With the rise of deep learning, many methods for action recognition using skeleton data have been developed. Among these methods, spatial-temporal graph convolutional networks (ST-GCNs) have seen growing popularity due to the suitability of skeleton data for graph-based modeling. However, ST-GCN models use fixed graph topologies and fixed-size spatial-temporal convolution kernels. This limits their ability to model coordinated movements of joints in different body regions and long-term spatial-temporal dependencies. To address these limitations, we propose a fine-to-coarse self-attention graph convolutional network (FCSA-GCN). Our approach employs a fine-to-coarse scaling strategy for multi-scale feature extraction. This strategy effectively models both local and global spatial temporal relationships and better represents the interactions among joint groups in different body regions. By integrating a temporal self-attention mechanism (TSA) into the multi-scale feature extraction process, we enhance the model's ability to capture long-term temporal dependencies effectively. Additionally, during training, we employ the dynamic weight averaging (DWA) approach to ensure balanced optimization across the multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60, NTU-120, and NW-UCLA datasets demonstrate that FCSA-GCN outperforms state-of-the-art methods. These results highlight that the proposed approach effectively addresses the current challenges in skeleton-based action recognition (SBAR).
Description
Kilic, Ugur/0000-0003-4092-3785;
ORCID
Keywords
Fine-to-Coarse Approach, Graph Convolutional Networks, Multi-Scale, Skeleton-Based Action Recognition, Skeletal Data, Temporal Self-Attention, Multi-Scale Feature Extraction Stages. Comprehensive Experiments Conducted on the NTU-60, NTU-120, multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60, Skeleton-based action recognition, Fine-to-coarse approach, Skeletal data, Temporal self-attention, NTU-120, Graph convolutional networks, Multi-scale
Fields of Science
Citation
WoS Q
Q1
Scopus Q
Q1

OpenCitations Citation Count
N/A
Source
Applied Soft Computing
Volume
186
Issue
Start Page
114268
End Page
PlumX Metrics
Citations
Scopus : 0
Google Scholar™


