HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Semantics-aware BERT for Language Understanding

Zhuosheng Zhang; Yuwei Wu; Hai Zhao; Zuchao Li; Shuailiang Zhang; Xi Zhou; Xiang Zhou

Semantics-aware BERT for Language Understanding

Abstract

The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.

Code Repositories

cooelf/SemBERT
Official
pytorch

Benchmarks

BenchmarkMethodologyMetrics
natural-language-inference-on-snliSemBERT
% Test Accuracy: 91.9
% Train Accuracy: 94.4
Parameters: 339m
question-answering-on-squad20SemBERT (single model)
EM: 84.800
F1: 87.864
question-answering-on-squad20SemBERT(ensemble)
EM: 86.166
F1: 88.886
question-answering-on-squad20SemBERT (ensemble)
EM: 86.166
F1: 88.886
question-answering-on-squad20-devSemBERT large
EM: 80.9
F1: 83.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Semantics-aware BERT for Language Understanding | Papers | HyperAI