Command Palette
Search for a command to run...
Hyeonwoo Noh; Andre Araujo; Jack Sim; Tobias Weyand; Bohyung Han

Abstract
We propose an attentive local feature descriptor suitable for large-scale image retrieval, referred to as DELF (DEep Local Feature). The new feature is based on convolutional neural networks, which are trained only with image-level annotations on a landmark image dataset. To identify semantically useful local features for image retrieval, we also propose an attention mechanism for keypoint selection, which shares most network layers with the descriptor. This framework can be used for image retrieval as a drop-in replacement for other keypoint detectors and descriptors, enabling more accurate feature matching and geometric verification. Our system produces reliable confidence scores to reject false positives---in particular, it is robust against queries that have no correct match in the database. To evaluate the proposed descriptor, we introduce a new large-scale dataset, referred to as Google-Landmarks dataset, which involves challenges in both database and query such as background clutter, partial occlusion, multiple landmarks, objects in variable scales, etc. We show that DELF outperforms the state-of-the-art global and local descriptors in the large-scale setting by significant margins. Code and dataset can be found at the project webpage: https://github.com/tensorflow/models/tree/master/research/delf .
Code Repositories
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| image-retrieval-on-oxf105k | DELF+FT+ATT+DIR+QE | MAP: 88.5% |
| image-retrieval-on-oxf105k | DELF+FT+ATT | MAP: 82.6% |
| image-retrieval-on-oxf5k | DELF+FT+ATT | MAP: 83.8% |
| image-retrieval-on-oxf5k | DELF+FT+ATT+DIR+QE | MAP: 90.0% |
| image-retrieval-on-par106k | DELF+FT+ATT+DIR+QE | mAP: 92.8% |
| image-retrieval-on-par106k | DELF+FT+ATT | mAP: 81.7% |
| image-retrieval-on-par6k | DELF+FT+ATT+DIR+QE | mAP: 95.7% |
| image-retrieval-on-par6k | DELF+FT+ATT | mAP: 85.0% |
| image-retrieval-on-roxford-hard | DELF–ASMK*+SP | mAP: 43.1 |
| image-retrieval-on-roxford-hard | DELF–HQE+SP | mAP: 50.3 |
| image-retrieval-on-roxford-medium | DELF–ASMK*+SP | mAP: 67.8 |
| image-retrieval-on-roxford-medium | DELF–HQE+SP | mAP: 73.4 |
| image-retrieval-on-rparis-hard | DELF–ASMK*+SP | mAP: 55.4 |
| image-retrieval-on-rparis-hard | DELF–HQE+SP | mAP: 69.3 |
| image-retrieval-on-rparis-medium | DELF–ASMK*+SP | mAP: 76.9 |
| image-retrieval-on-rparis-medium | DELF–HQE+SP | mAP: 84.0 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.