Command Palette
Search for a command to run...
Hampali Shreyas ; Rad Mahdi ; Oberweger Markus ; Lepetit Vincent

Abstract
We propose a method for annotating images of a hand manipulating an objectwith the 3D poses of both the hand and the object, together with a datasetcreated using this method. Our motivation is the current lack of annotated realimages for this problem, as estimating the 3D poses is challenging, mostlybecause of the mutual occlusions between the hand and the object. To tacklethis challenge, we capture sequences with one or several RGB-D cameras andjointly optimize the 3D hand and object poses over all the framessimultaneously. This method allows us to automatically annotate each frame withaccurate estimates of the poses, despite large mutual occlusions. With thismethod, we created HO-3D, the first markerless dataset of color images with 3Dannotations for both the hand and object. This dataset is currently made of77,558 frames, 68 sequences, 10 persons, and 10 objects. Using our dataset, wedevelop a single RGB image-based method to predict the hand pose wheninteracting with objects under severe occlusions and show it generalizes toobjects not seen in the dataset.
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-hand-pose-estimation-on-ho-3d | Hampali et al. | AUC_J: 0.788 AUC_V: 0.790 F@15mm: 0.942 F@5mm: 0.506 PA-MPJPE (mm): 10.7 PA-MPVPE: 10.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.