HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation

He Wang; Srinath Sridhar; Jingwei Huang; Julien Valentin; Shuran Song; Leonidas J. Guibas

Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation

Abstract

The goal of this paper is to estimate the 6D pose and dimensions of unseen object instances in an RGB-D image. Contrary to "instance-level" 6D pose estimation tasks, our problem assumes that no exact object CAD models are available during either training or testing time. To handle different and unseen object instances in a given category, we introduce a Normalized Object Coordinate Space (NOCS)---a shared canonical representation for all possible object instances within a category. Our region-based neural network is then trained to directly infer the correspondence from observed pixels to this shared object representation (NOCS) along with other object information such as class label and instance mask. These predictions can be combined with the depth map to jointly estimate the metric 6D pose and dimensions of multiple objects in a cluttered scene. To train our network, we present a new context-aware technique to generate large amounts of fully annotated mixed reality data. To further improve our model and evaluate its performance on real data, we also provide a fully annotated real-world dataset with large environment and instance variation. Extensive experiments demonstrate that the proposed method is able to robustly estimate the pose and size of unseen object instances in real environments while also achieving state-of-the-art performance on standard 6D pose estimation benchmarks.

Code Repositories

edavalosanaya/FastPoseCNN
pytorch
Mentioned in GitHub
ykzzyk/vision6d
Mentioned in GitHub
sahithchada/NOCS_PyTorch
pytorch
Mentioned in GitHub
interactivegl/vision6d
pytorch
Mentioned in GitHub
lh641446825/NOCS_2019CVPR
tf
Mentioned in GitHub
hughw19/NOCS_CVPR2019
Official
tf
Mentioned in GitHub
pairlab/6pack
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
6d-pose-estimation-using-rgbd-on-camera25NOCS (128 bins)
mAP 10, 10cm: 62.2
mAP 10, 5cm: 61.7
mAP 3DIou@25: 91.4
mAP 3DIou@50: 85.3
mAP 5, 5cm: 38.8
6d-pose-estimation-using-rgbd-on-real275NOCS (128 bins)
mAP 10, 10cm: 26.7
mAP 10, 5cm: 26.7
mAP 3DIou@25: 84.9
mAP 3DIou@50: 80.5
mAP 5, 5cm: 9.5

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation | Papers | HyperAI