Command Palette
Search for a command to run...
H2ONet: Hand-Occlusion-and-Orientation-Aware Network for Real-Time 3D Hand Mesh Reconstruction
{Chi-Wing Fu Xiao Tang Tianyu Wang Hao Xu}

Abstract
Real-time 3D hand mesh reconstruction is challenging, especially when the hand is holding some object. Beyond the previous methods, we design H2ONet to fully exploit non-occluded information from multiple frames to boost the reconstruction quality. First, we decouple hand mesh reconstruction into two branches, one to exploit finger-level non-occluded information and the other to exploit global hand orientation, with lightweight structures to promote real-time inference. Second, we propose finger-level occlusion-aware feature fusion, leveraging predicted finger-level occlusion information as guidance to fuse finger-level information across time frames. Further, we design hand-level occlusion-aware feature fusion to fetch non-occluded information from nearby time frames. We conduct experiments on the Dex-YCB and HO3D-v2 datasets with challenging hand-object occlusion cases, manifesting that H2ONet is able to run in real-time and achieves state-of-the-art performance on both the hand mesh and pose precision. The code will be released on GitHub.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| 3d-hand-pose-estimation-on-dexycb | H2ONet | Average MPJPE (mm): 14.0 MPVPE: 13.0 PA-MPVPE: 5.5 PA-VAUC: 89.1 Procrustes-Aligned MPJPE: 5.70 VAUC: 76.2 |
| 3d-hand-pose-estimation-on-ho-3d | H2ONet | AUC_J: 0.829 AUC_V: 0.828 F@15mm: 0.966 F@5mm: 0.570 PA-MPJPE (mm): 8.5 PA-MPVPE: 8.6 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.