As shown in this test video, the trained model struggles to accurately detect keypoints for gestures like pinching and swiping. However, it performs well in identifying keypoints on open hands, whether facing forward or backward.
The hand keypoint dataset is split into two subsets:
Train: This subset contains 18,776 images from the hand keypoints dataset, annotated for training pose estimation models.
Val: This subset contains 7992 images that can be used for validation purposes during model training.
Mosaiced Images
This image demonstrates a training batch composed of mosaiced dataset images. Mosaicing is a technique used during training that combines multiple images into a single image to increase the variety of objects and scenes within each training batch. This helps improve the model’s ability to generalize to different object sizes, aspect ratios, and contexts. - Ultralytics
Citations and Acknowledgments
Big thanks to @IsaacTheDev for letting me use his 4070 for training <3
If you use the hand-keypoints dataset in your research or development work, please acknowledge the following sources:
@article{afifi201911kHands,
title = {11K Hands: gender recognition and biometric identification using a large dataset of hand images},
author = {Afifi, Mahmoud},
journal = {Multimedia Tools and Applications},
doi = {10.1007/s11042-019-7424-8},
url = {https://doi.org/10.1007/s11042-019-7424-8},
year = {2019}
}
@misc{imsparsh2020gesture,
title = {Gesture Recognition Dataset},
author = {Sparsh, Imsparsh},
year = {2020},
url = {https://www.kaggle.com/datasets/imsparsh/gesture-recognition}
}
@misc{giridhar2020hand,
title = {2000 Hand Gestures},
author = {Giridhar, Ritika},
year = {2020},
url = {https://www.kaggle.com/datasets/ritikagiridhar/2000-hand-gestures}
}
YOLO11n-pose Hand Keypoint Detection
As shown in this test video, the trained model struggles to accurately detect keypoints for gestures like pinching and swiping. However, it performs well in identifying keypoints on open hands, whether facing forward or backward.
Run your own tests with this model
Clone the repository
Install dependencies (just need
Ultralyticsand its dependencies)Run the webcam test on your machine
Note: Test videos saved to
/runs/pose/output/test/<#>.aviModel Metrics
Download the trained
best.ptfile hereHow this model was trained
Dataset
This model was trained on the “Hand Keypoint Dataset 26K” made by Rion Dsilva
The hand keypoint dataset is split into two subsets:
Mosaiced Images
Citations and Acknowledgments
Big thanks to @IsaacTheDev for letting me use his 4070 for training <3
If you use the hand-keypoints dataset in your research or development work, please acknowledge the following sources: