HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc

Guzman-Olivares Daniel Quijano-Sanchez Lara Liberatore Federico

SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in
  Context ad Hoc

Abstract

The rise of generative chat-based Large Language Models (LLMs) over the pasttwo years has spurred a race to develop systems that promise near-humanconversational and reasoning experiences. However, recent studies indicate thatthe language understanding offered by these models remains limited and far fromhuman-like performance, particularly in grasping the contextual meanings ofwords, an essential aspect of reasoning. In this paper, we present a simple yetcomputationally efficient framework for multilingual Word Sense Disambiguation(WSD). Our approach reframes the WSD task as a cluster discrimination analysisover a semantic network refined from BabelNet using group algebra. We validateour methodology across multiple WSD benchmarks, achieving a new state of theart for all languages and tasks, as well as in individual assessments by partof speech. Notably, our model significantly surpasses the performance ofcurrent alternatives, even in low-resource languages, while reducing theparameter count by 72%.

Code Repositories

danielguzmanolivares/sandwich
Official
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
word-sense-disambiguation-on-supervisedSANDWiCH
SemEval 2007: 80.9
SemEval 2013: 92.6
SemEval 2015: 91.5
Senseval 2: 87.8
Senseval 3: 85.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc | Papers | HyperAI