HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts

Daniel Braun; Florian Matthes

AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts

Abstract

Legal tasks and datasets are often used as benchmarks for the capabilities of language models. However, openly available annotated datasets are rare. In this paper, we introduce AGB-DE, a corpus of 3,764 clauses from German consumer contracts that have been annotated and legally assessed by legal experts. Together with the data, we present a first baseline for the task of detecting potentially void clauses, comparing the performance of an SVM baseline with three fine-tuned open language models and the performance of GPT-3.5. Our results show the challenging nature of the task, with no approach exceeding an F1-score of 0.54. While the fine-tuned models often performed better with regard to precision, GPT-3.5 outperformed the other approaches with regard to recall. An analysis of the errors indicates that one of the main challenges could be the correct interpretation of complex clauses, rather than the decision boundaries of what is permissible and what is not.

Code Repositories

DaBr01/AGB-DE
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
detection-of-potentially-void-clauses-on-agbAGBert
F1: 0.54

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
AGB-DE: A Corpus for the Automated Legal Assessment of Clauses in German Consumer Contracts | Papers | HyperAI