Vision Sensor for Automatic Recognition of Human Activities via Hybrid Features and Multi-Class Support Vector Machine

Saleha Kamal; Haifa F Alhasson; Mohammed Alnusayri; Mohammed Alatiyyah; Hanan Aljuaid; Ahmad Jalal; Hui Liu

doi:10.3390/s25010200

Vision Sensor for Automatic Recognition of Human Activities via Hybrid Features and Multi-Class Support Vector Machine

Sensors (Basel). 2025 Jan 1;25(1):200. doi: 10.3390/s25010200.

Authors

Saleha Kamal¹, Haifa F Alhasson², Mohammed Alnusayri³, Mohammed Alatiyyah⁴, Hanan Aljuaid⁵, Ahmad Jalal^{1

6}, Hui Liu⁷

Affiliations

¹ Department of Computer Science, Air University, Islamabad 44000, Pakistan.
² Department of Information Technology, College of Computer, Qassim University, Buraydah 52571, Saudi Arabia.
³ Department of Computer Science, College of Computer and Information Sciences, Jouf University, Sakaka 72388, Saudi Arabia.
⁴ Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Al-Kharj 16278, Saudi Arabia.
⁵ Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia.
⁶ Department of Computer Science and Engineering, College of Informatics, Korea University, Seoul 02841, Republic of Korea.
⁷ Cognitive Systems Lab, University of Bremen, 28359 Bremen, Germany.

Abstract

Over recent years, automated Human Activity Recognition (HAR) has been an area of concern for many researchers due to its widespread application in surveillance systems, healthcare environments, and many more. This has led researchers to develop coherent and robust systems that efficiently perform HAR. Although there have been many efficient systems developed to date, still, there are many issues to be addressed. There are several elements that contribute to the complexity of the task, making it more challenging to detect human activities, i.e., (i) poor lightning conditions; (ii) different viewing angles; (iii) intricate clothing styles; (iv) diverse activities with similar gestures; and (v) limited availability of large datasets. However, through effective feature extraction, we can develop resilient systems for higher accuracies. During feature extraction, we aim to extract unique key body points and full-body features that exhibit distinct attributes for each activity. Our proposed system introduces an innovative approach for the identification of human activity in outdoor and indoor settings by extracting effective spatio-temporal features, along with a Multi-Class Support Vector Machine, which enhances the model's performance to accurately identify the activity classes. The experimental findings show that our model outperforms others in terms of classification, accuracy, and generalization, indicating its efficient analysis on benchmark datasets. Various performance metrics, including mean recognition accuracy, precision, F1 score, and recall assess the effectiveness of our model. The assessment findings show a remarkable recognition rate of around 88.61%, 87.33, 86.5%, and 81.25% on the BIT-Interaction dataset, UT-Interaction dataset, NTU RGB + D 120 dataset, and PKUMMD dataset, respectively.

Keywords: body pose; human activity recognition; human motion analysis; key body points; machine learning; object detectors; spatio-temporal features.

MeSH terms

Algorithms
Human Activities*
Humans
Pattern Recognition, Automated* / methods
Support Vector Machine*

Grants and funding

The APC was funded by the Open Access Initiative of the University of Bremen and the DFG via SuUB Bremen. Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R54), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.