HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Overview of the 2022 Validity and Novelty Prediction Shared Task

{Philipp Cimiano Moritz Plenz Juri Opitz Anette Frank Philipp Heinisch}

Overview of the 2022 Validity and Novelty Prediction Shared Task

Abstract

This paper provides an overview of the Argument Validity and Novelty Prediction Shared Task that was organized as part of the 9th Workshop on Argument Mining (ArgMining 2022). The task focused on the prediction of the validity and novelty of a conclusion given a textual premise. Validity is defined as the degree to which the conclusion is justified with respect to the given premise. Novelty defines the degree to which the conclusion contains content that is new in relation to the premise. Six groups participated in the task, submitting overall 13 system runs for the subtask of binary classification and 2 system runs for the subtask of relative classification. The results reveal that the task is challenging, with best results obtained for Validity prediction in the range of 75% F1 score, for Novelty prediction of 70% F1 score and for correctly predicting both Validity and Novelty of 45% F1 score. In this paper we summarize the task definition and dataset. We give an overview of the results obtained by the participating systems, as well as insights to be gained from the diverse contributions.

Benchmarks

BenchmarkMethodologyMetrics
valnov-on-valnov-subtask-aACCEPT-1
JOINT-F1: 43.13
NOV-F1: 70.00
VAL-F1: 59.20
valnov-on-valnov-subtask-aBaseline
JOINT-F1: 23.90
NOV-F1: 36.12
VAL-F1: 59.96
valnov-on-valnov-subtask-aCSS
JOINT-F1: 42.40
NOV-F1: 59.86
VAL-F1: 70.76
valnov-on-valnov-subtask-aSystem Average
JOINT-F1: 35.94
NOV-F1: 52.97
VAL-F1: 62.74
valnov-on-valnov-subtask-aNLP@UIT
JOINT-F1: 25.89
NOV-F1: 43.36
VAL-F1: 61.72
valnov-on-valnov-subtask-aCLTeamL-3
JOINT-F1: 45.16
NOV-F1: 61.75
VAL-F1: 74.64
valnov-on-valnov-subtask-aHarshad
JOINT-F1: 17.35
NOV-F1: 39.00
VAL-F1: 56.31
valnov-on-valnov-subtask-bAXiS@EdUni
JOINT-F1: 29.16
NOV-F1: 25.86
VAL-F1: 32.47
valnov-on-valnov-subtask-bNLP@UIT
JOINT-F1: 41.50
NOV-F1: 38.39
VAL-F1: 44.60
valnov-on-valnov-subtask-bBaseline
JOINT-F1: 21.46
NOV-F1: 23.09
VAL-F1: 19.82

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Overview of the 2022 Validity and Novelty Prediction Shared Task | Papers | HyperAI