HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

A Simple Method for Commonsense Reasoning

Trieu H. Trinh; Quoc V. Le

A Simple Method for Commonsense Reasoning

Abstract

Commonsense reasoning is a long-standing challenge for deep learning. For example, it is difficult to use neural networks to tackle the Winograd Schema dataset (Levesque et al., 2011). In this paper, we present a simple method for commonsense reasoning with neural networks, using unsupervised learning. Key to our method is the use of language models, trained on a massive amount of unlabled data, to score multiple choice questions posed by commonsense reasoning tests. On both Pronoun Disambiguation and Winograd Schema challenges, our models outperform previous state-of-the-art methods by a large margin, without using expensive annotated knowledge bases or hand-engineered features. We train an array of large RNN language models that operate at word or character level on LM-1-Billion, CommonCrawl, SQuAD, Gutenberg Books, and a customized corpus for this task and show that diversity of training data plays an important role in test performance. Further analysis also shows that our system successfully discovers important features of the context that decide the correct answer, indicating a good grasp of commonsense knowledge.

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
coreference-resolution-on-winograd-schemaChar-level CNN+LSTM (partial scoring)
Accuracy: 57.9
coreference-resolution-on-winograd-schemaEnsemble of 14 LMs
Accuracy: 63.7
coreference-resolution-on-winograd-schemaWord-level CNN+LSTM (partial scoring)
Accuracy: 62.6
natural-language-understanding-on-pdp60Word-level CNN+LSTM (partial scoring)
Accuracy: 53.3
natural-language-understanding-on-pdp60Word-level CNN+LSTM (full scoring)
Accuracy: 60.0

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Simple Method for Commonsense Reasoning | Papers | HyperAI