HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

Semi-Supervised Sequence Modeling with Cross-View Training

Kevin Clark; Minh-Thang Luong; Christopher D. Manning; Quoc V. Le

Semi-Supervised Sequence Modeling with Cross-View Training

Abstract

Unsupervised representation learning algorithms such as word2vec and ELMo improve the accuracy of many supervised NLP models, mainly because they can take advantage of large amounts of unlabeled text. However, the supervised models only learn from task-specific labeled data during the main training phase. We therefore propose Cross-View Training (CVT), a semi-supervised learning algorithm that improves the representations of a Bi-LSTM sentence encoder using a mix of labeled and unlabeled data. On labeled examples, standard supervised learning is used. On unlabeled examples, CVT teaches auxiliary prediction modules that see restricted views of the input (e.g., only part of a sentence) to match the predictions of the full model seeing the whole input. Since the auxiliary modules and the full model share intermediate representations, this in turn improves the full model. Moreover, we show that CVT is particularly effective when combined with multi-task learning. We evaluate CVT on five sequence tagging tasks, machine translation, and dependency parsing, achieving state-of-the-art results.

Code Repositories

rezkaaufar/pytorch-cvt
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
ccg-supertagging-on-ccgbankCVT + Multi-task + Large
Accuracy: 96.1
dependency-parsing-on-penn-treebankCVT + Multi-Task
LAS: 95.02
UAS: 96.61
machine-translation-on-iwslt2015-english-1CVT
BLEU: 29.6
named-entity-recognition-ner-on-conll-2003CVT + Multi-Task + Large
F1: 92.61
named-entity-recognition-ner-on-conll-2003CVT + Multi-Task
F1: 92.61
named-entity-recognition-ner-on-ontonotes-v5CVT + Multi-Task + Large
F1: 88.81
part-of-speech-tagging-on-penn-treebankCVT + Multi-task
Accuracy: 97.76

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Semi-Supervised Sequence Modeling with Cross-View Training | Papers | HyperAI