HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Deep Learning for Entity Matching: A Design Space Exploration

{Vijay Raghavendra Esteban Arcaute Rohit Deep Ganesh Krishnan Youngchoon Park AnHai Doan Theodoros Rekatsinas Han Li Sidharth Mudgal}

Deep Learning for Entity Matching: A Design Space Exploration

Abstract

Entity matching (EM) finds data instances that refer to the same real-world entity. In this paper we examine applying deep learning (DL) to EM, to understand DL's benefits and limitations. We review many DL solutions that have been developed for related matching tasks in text processing (e.g., entity linking, textual entailment, etc.). We categorize these solutions and define a space of DL solutions for EM, as embodied by four solutions with varying representational power: SIF, RNN, Attention, and Hybrid. Next, we investigate the types of EM problems for which DL can be helpful. We consider three such problem types, which match structured data instances, textual instances, and dirty instances, respectively. We empirically compare the above four DL solutions with Magellan, a state-of-the-art learning-based EM solution. The results show that DL does not outperform current solutions on structured EM, but it can significantly outperform them on textual and dirty EM. For practitioners, this suggests that they should seriously consider using DL for textual and dirty EM problems. Finally, we analyze DL's performance and discuss future research directions.

Benchmarks

BenchmarkMethodologyMetrics
entity-resolution-on-abt-buyDeepMatcher - Hybrid
F1 (%): 62.80
entity-resolution-on-amazon-googleDeepMatcher - Hybrid
F1 (%): 69.3

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Deep Learning for Entity Matching: A Design Space Exploration | Papers | HyperAI