HyperAIHyperAI

Command Palette

Search for a command to run...

4 months ago

Multi-Task Deep Neural Networks for Natural Language Understanding

Xiaodong Liu; Pengcheng He; Weizhu Chen; Jianfeng Gao

Multi-Task Deep Neural Networks for Natural Language Understanding

Abstract

In this paper, we present a Multi-Task Deep Neural Network (MT-DNN) for learning representations across multiple natural language understanding (NLU) tasks. MT-DNN not only leverages large amounts of cross-task data, but also benefits from a regularization effect that leads to more general representations in order to adapt to new tasks and domains. MT-DNN extends the model proposed in Liu et al. (2015) by incorporating a pre-trained bidirectional transformer language model, known as BERT (Devlin et al., 2018). MT-DNN obtains new state-of-the-art results on ten NLU tasks, including SNLI, SciTail, and eight out of nine GLUE tasks, pushing the GLUE benchmark to 82.7% (2.2% absolute improvement). We also demonstrate using the SNLI and SciTail datasets that the representations learned by MT-DNN allow domain adaptation with substantially fewer in-domain labels than the pre-trained BERT representations. The code and pre-trained models are publicly available at https://github.com/namisan/mt-dnn.

Code Repositories

phueb/BabyBertSRL
pytorch
Mentioned in GitHub
gaohuan2015/NLPTool
pytorch
Mentioned in GitHub
namisan/mt-dnn
Official
pytorch
Mentioned in GitHub
om00839/machine-suneung
pytorch
Mentioned in GitHub
ABaldrati/MT-BERT
pytorch
Mentioned in GitHub
xycforgithub/MultiTask-MRC
pytorch
Mentioned in GitHub
phueb/CHILDES-SRL
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
linguistic-acceptability-on-colaMT-DNN
Accuracy: 68.4%
natural-language-inference-on-multinliMT-DNN
Matched: 86.7
Mismatched: 86.0
natural-language-inference-on-scitailMT-DNN
Accuracy: 94.1
natural-language-inference-on-snliMT-DNN
% Test Accuracy: 91.6
% Train Accuracy: 97.2
Parameters: 330m
natural-language-inference-on-snliNtumpha
% Test Accuracy: 90.5
% Train Accuracy: 99.1
Parameters: 220
paraphrase-identification-on-quora-questionMT-DNN
Accuracy: 89.6
F1: 72.4
sentiment-analysis-on-sst-2-binaryMT-DNN
Accuracy: 95.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Multi-Task Deep Neural Networks for Natural Language Understanding | Papers | HyperAI