HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

Shuohang Wang; Sheng Zhang; Yelong Shen; Xiaodong Liu; Jingjing Liu; Jianfeng Gao; Jing Jiang

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

Abstract

Commonsense reasoning is fundamental to natural language understanding. While traditional methods rely heavily on human-crafted features and knowledge bases, we explore learning commonsense knowledge from a large amount of raw text via unsupervised learning. We propose two neural network models based on the Deep Structured Semantic Models (DSSM) framework to tackle two classic commonsense reasoning tasks, Winograd Schema challenges (WSC) and Pronoun Disambiguation (PDP). Evaluation shows that the proposed models effectively capture contextual information in the sentence and co-reference information between pronouns and nouns, and achieve significant improvement over previous state-of-the-art approaches.

Benchmarks

BenchmarkMethodologyMetrics
coreference-resolution-on-winograd-schemaUDSSM-I (ensemble)
Accuracy: 57.1
coreference-resolution-on-winograd-schemaUDSSM-II (ensemble)
Accuracy: 62.4
coreference-resolution-on-winograd-schemaUDSSM-I
Accuracy: 54.5
coreference-resolution-on-winograd-schemaDSSM
Accuracy: 63.0
coreference-resolution-on-winograd-schemaUDSSM-II
Accuracy: 59.2
natural-language-understanding-on-pdp60UDSSM-II (ensemble)
Accuracy: 78.3
natural-language-understanding-on-pdp60UDSSM-II
Accuracy: 75
natural-language-understanding-on-pdp60DSSM
Accuracy: 75.0
natural-language-understanding-on-pdp60UDSSM-I (ensemble)
Accuracy: 76.7

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Unsupervised Deep Structured Semantic Models for Commonsense Reasoning | Papers | HyperAI