8 months ago

3D Machine Vision

Computer Vision

Image Understanding

Computer Vision

Shreyas Hampali Mahdi Rad Markus Oberweger Vincent Lepetit

Abstract

We propose a method for annotating images of a hand manipulating an objectwith the 3D poses of both the hand and the object, together with a datasetcreated using this method. Our motivation is the current lack of annotated realimages for this problem, as estimating the 3D poses is challenging, mostlybecause of the mutual occlusions between the hand and the object. To tacklethis challenge, we capture sequences with one or several RGB-D cameras andjointly optimize the 3D hand and object poses over all the framessimultaneously. This method allows us to automatically annotate each frame withaccurate estimates of the poses, despite large mutual occlusions. With thismethod, we created HO-3D, the first markerless dataset of color images with 3Dannotations for both the hand and object. This dataset is currently made of77,558 frames, 68 sequences, 10 persons, and 10 objects. Using our dataset, wedevelop a single RGB image-based method to predict the hand pose wheninteracting with objects under severe occlusions and show it generalizes toobjects not seen in the dataset.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp

8 months ago

3D Machine Vision

Computer Vision

Image Understanding

Computer Vision

Shreyas Hampali Mahdi Rad Markus Oberweger Vincent Lepetit

Abstract

We propose a method for annotating images of a hand manipulating an objectwith the 3D poses of both the hand and the object, together with a datasetcreated using this method. Our motivation is the current lack of annotated realimages for this problem, as estimating the 3D poses is challenging, mostlybecause of the mutual occlusions between the hand and the object. To tacklethis challenge, we capture sequences with one or several RGB-D cameras andjointly optimize the 3D hand and object poses over all the framessimultaneously. This method allows us to automatically annotate each frame withaccurate estimates of the poses, despite large mutual occlusions. With thismethod, we created HO-3D, the first markerless dataset of color images with 3Dannotations for both the hand and object. This dataset is currently made of77,558 frames, 68 sequences, 10 persons, and 10 objects. Using our dataset, wedevelop a single RGB image-based method to predict the hand pose wheninteracting with objects under severe occlusions and show it generalizes toobjects not seen in the dataset.

Source PDF View Code

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Powered by MailChimp