Command Palette
Search for a command to run...
Domen Tabernik Jon Muhovič Matej Urbas Danijel Skočaj

Abstract
Object grasping is a fundamental challenge in robotics and computer vision, critical for advancing robotic manipulation capabilities. Deformable objects, like fabrics and cloths, pose additional challenges due to their non-rigid nature. In this work, we introduce CeDiRNet-3DoF, a deep-learning model for grasp point detection, with a particular focus on cloth objects. CeDiRNet-3DoF employs center direction regression alongside a localization network, attaining first place in the perception task of ICRA 2023's Cloth Manipulation Challenge. Recognizing the lack of standardized benchmarks in the literature that hinder effective method comparison, we present the ViCoS Towel Dataset. This extensive benchmark dataset comprises 8,000 real and 12,000 synthetic images, serving as a robust resource for training and evaluating contemporary data-driven deep-learning approaches. Extensive evaluation revealed CeDiRNet-3DoF's robustness in real-world performance, outperforming state-of-the-art methods, including the latest transformer-based models. Our work bridges a crucial gap, offering a robust solution and benchmark for cloth grasping in computer vision and robotics. Code and dataset are available at: https://github.com/vicoslab/CeDiRNet-3DoF
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| keypoint-detection-on-vicos-towel-dataset | Lisp et al. - RGB (ConvNetx-B) | Best F1: 65.7 |
| keypoint-detection-on-vicos-towel-dataset | YOLOv7 - RGB | Best F1: 48.3 |
| keypoint-detection-on-vicos-towel-dataset | CeDiRNet-3DoF - RGB (ConvNext-B) | Best F1: 78 |
| keypoint-detection-on-vicos-towel-dataset | MaskRCNN - RGB (ResNext101) | Best F1: 68.3 |
| keypoint-detection-on-vicos-towel-dataset | DINO - RGB (ConvNetx-B) | Best F1: 72.7 |
| keypoint-detection-on-vicos-towel-dataset | CeDiRNet-3DoF - RGB (ConvNext-L) | Best F1: 78.4 |
| keypoint-detection-on-vicos-towel-dataset | DeformDETR - RGB (ConvNetx-B) | Best F1: 61.2 |
| keypoint-detection-on-vicos-towel-dataset | CeDiRNet-3DoF - RGB-D (ConvNext-B) | Best F1: 81.4 |
| keypoint-detection-on-vicos-towel-dataset | CeDiRNet-3DoF - RGB-D (ConvNext-L) | Best F1: 80.8 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.