HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Universal Evasion Attacks on Summarization Scoring

Wenchuan Mu Kwan Hui Lim

Universal Evasion Attacks on Summarization Scoring

Abstract

The automatic scoring of summaries is important as it guides the development of summarizers. Scoring is also complex, as it involves multiple aspects such as fluency, grammar, and even textual entailment with the source text. However, summary scoring has not been considered a machine learning task to study its accuracy and robustness. In this study, we place automatic scoring in the context of regression machine learning tasks and perform evasion attacks to explore its robustness. Attack systems predict a non-summary string from each input, and these non-summary strings achieve competitive scores with good summarizers on the most popular metrics: ROUGE, METEOR, and BERTScore. Attack systems also "outperform" state-of-the-art summarization methods on ROUGE-1 and ROUGE-L, and score the second-highest on METEOR. Furthermore, a BERTScore backdoor is observed: a simple trigger can score higher than any automatic summarization method. The evasion attacks in this work indicate the low robustness of current scoring systems at the system level. We hope that our highlighting of these proposed attacks will facilitate the development of summary scores.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
abstractive-text-summarization-on-cnn-dailyScrambled code + broken
ROUGE-1: 46.71
ROUGE-2: 20.39
ROUGE-L: 43.56
abstractive-text-summarization-on-cnn-dailyScrambled code + broken (alter)
ROUGE-1: 48.18
ROUGE-2: 19.84
ROUGE-L: 45.35
document-summarization-on-cnn-daily-mailScrambled code + broken (alter)
ROUGE-1: 48.18
ROUGE-2: 19.84
ROUGE-L: 45.35

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Universal Evasion Attacks on Summarization Scoring | Papers | HyperAI