HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

The 2021 Image Similarity Dataset and Challenge

The 2021 Image Similarity Dataset and Challenge

Abstract

This paper introduces a new benchmark for large-scale image similaritydetection. This benchmark is used for the Image Similarity Challenge atNeurIPS'21 (ISC2021). The goal is to determine whether a query image is amodified copy of any image in a reference corpus of size 1~million. Thebenchmark features a variety of image transformations such as automatedtransformations, hand-crafted image edits and machine-learning basedmanipulations. This mimics real-life cases appearing in social media, forexample for integrity-related problems dealing with misinformation andobjectionable content. The strength of the image manipulations, and thereforethe difficulty of the benchmark, is calibrated according to the performance ofa set of baseline approaches. Both the query and reference set contain amajority of "distractor" images that do not match, which corresponds to areal-life needle-in-haystack setting, and the evaluation metric reflects that.We expect the DISC21 benchmark to promote image copy detection as an importantand challenging computer vision task and refresh the state of the art. Code anddata are available at https://github.com/facebookresearch/isc2021

Code Repositories

facebookresearch/isc2021
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
image-similarity-detection-on-disc21-devGIST PCA 256
dimension: 256
hardware: CPU, 2.2 GHz, 40 threads
w/o normalization: 15.56
image-similarity-detection-on-disc21-devGIST 960 dim
Time (ms): 0.55
dimension: 960
hardware: CPU, 2.2 GHz, 40 threads
w/o normalization: 14.42
image-similarity-detection-on-disc21-devHOW+ASMK
Time (ms): 150
hardware: Tesla P-100
w/o normalization: 17.32
with normalization: 37.15
image-similarity-detection-on-disc21-devMultigrain 1500 dim
Time (ms): 23
dimension: 1500
hardware: Tesla V100
w/o normalization: 16.47
with normalization: 36.42

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
The 2021 Image Similarity Dataset and Challenge | Papers | HyperAI