Human Action Recognition Using YOLOv11 Ultralytics: A Comprehensive Study for Real-Time Applications

Seung Jin Kim; Myuhng Joo Kim

doi:10.56147/aaiet.1.2.14

Authors

Seung Jin Kim AI Convergence Engineering, Assist University, Seoul, Korea Author
Myuhng Joo Kim Department of Computer Science, Assist University, Seoul, Korea Author

DOI:

https://doi.org/10.56147/aaiet.1.2.14

Keywords:

Human action recognition, YOLOv11, Ultralytics, Deep learning, Computer vision, Real-time detection, Surveillance, Healthcare

Abstract

Human action recognition (HAR) is a pivotal task
in computer vision, with applications in surveillance, healthcare,
robotics, and human-computer interaction. This study presents a
novel framework for HAR using the YOLOv11 model by Ultralyt
ics, a state-of-the-art object detection architecture optimized for
real-time performance. We trained and evaluated the model on a
custom dataset comprising 18 distinct human actions, captured
in indoor environments using fisheye cameras. The actions range
from everyday activities (e.g., walking, sitting) to specialized
tasks (e.g., patient on stretcher, patient on wheelchair). Our
results show that YOLOv11 achieves a mean Average Precision
(mAP@0.5) of 0.401, with exceptional performance on actions like
”cleaning” (mAP@0.5: 0.760), ”searching” (mAP@0.5: 0.695),
and ”patient on wheelchair” (mAP@0.5: 0.995). We provide
an in-depth analysis of the model’s training metrics, bounding
box distributions, precision-recall curves, F1-confidence curves,
recall-confidence curves, and confusion matrices. Additionally, we
present extensive qualitative results to demonstrate the model’s
robustness in real-world scenarios. A comparison with existing
methods, such as two-stream CNNs and Transformer-based
models, highlights YOLOv11’s superior balance of accuracy
and speed, making it a promising solution for real-time HAR
applications. This study also discusses the model’s limitations
and outlines directions for future research, paving the way for
enhanced action recognition systems.

Human Action Recognition Using YOLOv11 Ultralytics: A Comprehensive Study for Real-Time Applications

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

How to Cite

SA

Submit Article

Journal Highlights

Deep Learning

Machine Learning (ML)

Reinforcement Learning (RL)

Artificial General Intelligence (AGI)

Neural Networks

Convolutional Neural Networks (CNN)

Mechanical Engineering

Electrical Engineering

Civil Engineering

Chemical Engineering

Aerospace Engineering

Structural Engineering

Environmental Engineering

Biomedical Engineering

Industrial Engineering

Systems Engineering

Materials Science

Robotics

Mechatronics

Nanotechnology

Control Systems

Renewable Energy

View More

Wren Research Journals

PUBLISHED By

Wren Reasearch Journals

Latest publications

Information