Fine-To Self-Attention Graph Convolutional Network for Skeleton-Based Action Recognition

Loading...
Publication Logo

Date

2026

Authors

Kilic, Ugur
Karadag, Ozge Oztimur
Ozyer, Gulsah Tumuklu

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Open Access Color

OpenAIRE Downloads

OpenAIRE Views

Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

Skeleton data has become an important modality in action recognition due to its robustness to environmental changes, computational efficiency, compact structure, and privacy-oriented nature. With the rise of deep learning, many methods for action recognition using skeleton data have been developed. Among these methods, spatial-temporal graph convolutional networks (ST-GCNs) have seen growing popularity due to the suitability of skeleton data for graph-based modeling. However, ST-GCN models use fixed graph topologies and fixed-size spatial-temporal convolution kernels. This limits their ability to model coordinated movements of joints in different body regions and long-term spatial-temporal dependencies. To address these limitations, we propose a fine-to-coarse self-attention graph convolutional network (FCSA-GCN). Our approach employs a fine-to-coarse scaling strategy for multi-scale feature extraction. This strategy effectively models both local and global spatial temporal relationships and better represents the interactions among joint groups in different body regions. By integrating a temporal self-attention mechanism (TSA) into the multi-scale feature extraction process, we enhance the model's ability to capture long-term temporal dependencies effectively. Additionally, during training, we employ the dynamic weight averaging (DWA) approach to ensure balanced optimization across the multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60, NTU-120, and NW-UCLA datasets demonstrate that FCSA-GCN outperforms state-of-the-art methods. These results highlight that the proposed approach effectively addresses the current challenges in skeleton-based action recognition (SBAR).

Description

Kilic, Ugur/0000-0003-4092-3785;

Keywords

Fine-to-Coarse Approach, Graph Convolutional Networks, Multi-Scale, Skeleton-Based Action Recognition, Skeletal Data, Temporal Self-Attention, Multi-Scale Feature Extraction Stages. Comprehensive Experiments Conducted on the NTU-60, NTU-120, multi-scale feature extraction stages. Comprehensive experiments conducted on the NTU-60, Skeleton-based action recognition, Fine-to-coarse approach, Skeletal data, Temporal self-attention, NTU-120, Graph convolutional networks, Multi-scale

Fields of Science

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

Applied Soft Computing

Volume

186

Issue

Start Page

114268

End Page

PlumX Metrics
Citations

Scopus : 0

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.0

Sustainable Development Goals