Pose Estimation for Human Activity Recognition Using Deep Learning on Video Data

Rajdeep Singh Sohal, Assistant Professor, Department of Electronics Technology, Guru Nanak Dev University, Amritsar. Mohabat Pal Singh, Karunjot Singh, Student of B.Tech. (Electronics and Computer Engineering) 8th Semester, Department of Electronics Technology, Guru Nanak Dev University, Amritsar.
  DOI: 10.62796/pijst.2024v1i503   DOI URL: https://doi.org/10.62796/pijst.2024v1i503
Published Date: 17-05-2024 Issue: Vol. 1 No. 5 (2024): May 2024 Published Paper PDF: Download

Abstract- Pose estimation is a critical task in computer vision, aiming to determine the spatial positions and orientations of objects or individuals within an image or video. This paper introduces a novel approach to pose estimation that leverages deep learning techniques to achieve high accuracy and robustness in diverse environments. We propose a multi-stage convolutional neural network (CNN) that refines pose predictions through iterative processing, significantly enhancing the precision of keypoint localization. The network architecture is complemented by a loss function designed to handle occlusions and ambiguous poses, ensuring reliable performance even in complex scenes.