HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Attention Is (not) All You Need for Commonsense Reasoning

Tassilo Klein; Moin Nabi

Attention Is (not) All You Need for Commonsense Reasoning

Abstract

The recently introduced BERT model exhibits strong performance on several language understanding benchmarks. In this paper, we describe a simple re-implementation of BERT for commonsense reasoning. We show that the attentions produced by BERT can be directly utilized for tasks such as the Pronoun Disambiguation Problem and Winograd Schema Challenge. Our proposed attention-guided commonsense reasoning method is conceptually simple yet empirically powerful. Experimental analysis on multiple datasets demonstrates that our proposed system performs remarkably well on all cases while outperforming the previously reported state of the art by a margin. While results suggest that BERT seems to implicitly learn to establish complex relationships between entities, solving commonsense reasoning tasks might require more than unsupervised models learned from huge text corpora.

Code Repositories

SAP-samples/acl2020-commonsense
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
coreference-resolution-on-winograd-schemaUSSM + Supervised DeepNet + KB
Accuracy: 52.8
coreference-resolution-on-winograd-schemaUSSM + KB
Accuracy: 52
coreference-resolution-on-winograd-schemaBERT-base 110M + MAS
Accuracy: 60.3
natural-language-understanding-on-pdp60BERT-base 110M + MAS
Accuracy: 68.3
natural-language-understanding-on-pdp60USSM + Supervised Deepnet
Accuracy: 53.3
natural-language-understanding-on-pdp60USSM + Supervised Deepnet + 3 Knowledge Bases
Accuracy: 66.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Attention Is (not) All You Need for Commonsense Reasoning | Papers | HyperAI