HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Ilias Chalkidis Abhik Jana Dirk Hartung Michael Bommarito Ion Androutsopoulos Daniel Martin Katz Nikolaos Aletras

LexGLUE: A Benchmark Dataset for Legal Language Understanding in English

Abstract

Laws and their interpretations, legal arguments and agreements\ are typically expressed in writing, leading to the production of vast corpora of legal text. Their analysis, which is at the center of legal practice, becomes increasingly elaborate as these collections grow in size. Natural language understanding (NLU) technologies can be a valuable tool to support legal practitioners in these endeavors. Their usefulness, however, largely depends on whether current state-of-the-art models can generalize across various tasks in the legal domain. To answer this currently open question, we introduce the Legal General Language Understanding Evaluation (LexGLUE) benchmark, a collection of datasets for evaluating model performance across a diverse set of legal NLU tasks in a standardized way. We also provide an evaluation and analysis of several generic and legal-oriented models demonstrating that the latter consistently offer performance improvements across multiple tasks.

Code Repositories

coastalcph/lex-glue
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
natural-language-understanding-on-lexglueCaseLaw-BERT
CaseHOLD: 75.6
ECtHR Task A: 71.2 / 64.2
ECtHR Task B: 88.0 / 77.5
EUR-LEX: 71.0 / 55.9
LEDGAR: 88.0 / 82.3
SCOTUS: 76.4 / 66.2
UNFAIR-ToS: 88.3 / 81.0
natural-language-understanding-on-lexglueRoBERTa
CaseHOLD: 71.7
ECtHR Task A: 69.5 / 60.7
ECtHR Task B: 87.2 / 77.3
EUR-LEX: 71.8 / 57.5
LEDGAR: 87.9 / 82.1
SCOTUS: 70.8 / 61.2
UNFAIR-ToS: 87.7 / 81.5
natural-language-understanding-on-lexglueBERT
CaseHOLD: 70.7
ECtHR Task A: 71.4 / 64.0
ECtHR Task B: 87.6 / 77.8
EUR-LEX: 71.6 / 55.6
LEDGAR: 87.7 / 82.2
SCOTUS: 70.5 / 60.9
UNFAIR-ToS: 87.5 / 81.0
natural-language-understanding-on-lexglueDeBERTa
CaseHOLD: 72.1
ECtHR Task A: 69.1 / 61.2
ECtHR Task B: 87.4 / 77.3
EUR-LEX: 72.3 / 57.2
LEDGAR: 87.9 / 82.0
SCOTUS: 70.0 / 60.0
UNFAIR-ToS: 87.2 / 78.8
natural-language-understanding-on-lexglueLongformer
CaseHOLD: 72.0
ECtHR Task A: 69.6 / 62.4
ECtHR Task B: 88.0 / 77.8
EUR-LEX: 71.9 / 56.7
LEDGAR: 87.7 / 82.3
SCOTUS: 72.2 / 62.5
UNFAIR-ToS: 87.7 / 80.1
natural-language-understanding-on-lexglueLegal-BERT
CaseHOLD: 75.1
ECtHR Task A: 71.2 / 64.6
ECtHR Task B: 88.0 / 77.2
EUR-LEX: 72.2 / 56.2
LEDGAR: 88.1 / 82.7
SCOTUS: 76.2 / 65.8
UNFAIR-ToS: 88.6 / 82.3
natural-language-understanding-on-lexglueBigBird
CaseHOLD: 70.4
ECtHR Task A: 70.5 / 63.8
ECtHR Task B: 88.1 / 76.6
EUR-LEX: 71.8 / 56.6
LEDGAR: 87.7 / 82.1
SCOTUS: 71.7 / 61.4
UNFAIR-ToS: 87.7 / 80.2

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp