Multi-modal fusion enhances activity recognition by integrating RGBD and skeletal data

Archana Vinod Bansod, Shailesh Kumar

PDF

Published: Dec 20, 2024

Keywords:

Human Activity Recognition (HAR), Deep Learning, LSTM, 3DCNN

Archana Vinod Bansod, Shailesh Kumar

Abstract

In this paper, the proposed work is based on a multi-modal fusion framework for human activity recognition (HAR). This approach makes use of three modalities such as RGB, depth maps and 3D-Skeletion joint position to develop robust HAR system. Two 3DCNN models with different network parameters and an LSTM model are used to obtain the features from each modality. Next, the score of each activity is obtained using SVM in each model and optimized using two evolutionally algorithms. The experimental work on the public dataset has also been discussed to validate the proposed approach. The experimental results show that the proposed framework is an improvement over previous work and is capable of accurately recognizing human activities

Issue

Vol. 44 No. 6 (2024): LIB PRO. 44(6), JUL-DEC 2024 (20-12-2024)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details