3 months ago

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

Mike Lewis Yinhan Liu Naman Goyal Marjan Ghazvininejad Abdelrahman Mohamed Omer Levy Ves Stoyanov Luke Zettlemoyer

Abstract

We present BART, a denoising autoencoder for pretraining sequence-to-sequence models. BART is trained by (1) corrupting text with an arbitrary noising function, and (2) learning a model to reconstruct the original text. It uses a standard Tranformer-based neural machine translation architecture which, despite its simplicity, can be seen as generalizing BERT (due to the bidirectional encoder), GPT (with the left-to-right decoder), and many other more recent pretraining schemes. We evaluate a number of noising approaches, finding the best performance by both randomly shuffling the order of the original sentences and using a novel in-filling scheme, where spans of text are replaced with a single mask token. BART is particularly effective when fine tuned for text generation but also works well for comprehension tasks. It matches the performance of RoBERTa with comparable training resources on GLUE and SQuAD, achieves new state-of-the-art results on a range of abstractive dialogue, question answering, and summarization tasks, with gains of up to 6 ROUGE. BART also provides a 1.1 BLEU increase over a back-translation system for machine translation, with only target language pretraining. We also report ablation experiments that replicate other pretraining schemes within the BART framework, to better measure which factors most influence end-task performance.

Code Repositories

shijx12/kqapro_baselines

pytorch

Mentioned in GitHub

W4ngatang/qags

pytorch

Mentioned in GitHub

2024-MindSpore-1/Code2/tree/main/model-1/bart

mindspore

tangg555/sabart

pytorch

Mentioned in GitHub

awalther/scibart

pytorch

Mentioned in GitHub

bfopengradient/NLP_text_summarization_apps

Mentioned in GitHub

hihellohowareyou/RESREF_Chatbot_data_for_Korean

pytorch

Mentioned in GitHub

jiacheng-xu/text-sum-uncertainty

pytorch

Mentioned in GitHub

chakravarthi-v/Polaroid-1

pytorch

Mentioned in GitHub

bayer-science-for-a-better-life/data2text-bioleaflets

pytorch

Mentioned in GitHub

facebookresearch/GENRE

pytorch

Mentioned in GitHub

mcao610/Factual-Error-Correction

pytorch

Mentioned in GitHub

microsoft/fastseq

pytorch

Mentioned in GitHub

Mind23-2/MindCode-132

jongwooko/nash-pruning-official

pytorch

Mentioned in GitHub

cosmoquester/2021-dialogue-summary-competition

pytorch

Mentioned in GitHub

vgaraujov/seq2seq-spanish-plms

pytorch

Mentioned in GitHub

xieyxclack/factual_coco

pytorch

Mentioned in GitHub

nlmatics/llmsherpa

Mentioned in GitHub

fwbrandao/Abstractive_Summarisation

Mentioned in GitHub

asahi417/lm-question-generation

Mentioned in GitHub

tanyuqian/aspect-based-summarization

pytorch

Mentioned in GitHub

cosmoquester/transformers-bart-pretrain

Mentioned in GitHub

shmsw25/bart-closed-book-qa

pytorch

Mentioned in GitHub

thefonseca/factorsum

pytorch

Mentioned in GitHub

khushsi/Finetuning_BART_for_FACET_Summarization

pytorch

Mentioned in GitHub

priyamtejaswin/multistep-retrieve-summarize

pytorch

Mentioned in GitHub

zhdbwe/Paper-DailyReading

Mentioned in GitHub

KushGrandhi/Polaroid

pytorch

Mentioned in GitHub

john-bradshaw/rxn-lm

pytorch

Mentioned in GitHub

allenai/scientific-claim-generation

pytorch

Mentioned in GitHub

vinayak19th/Brevis-2.0

Mentioned in GitHub

udnet96/BART-various-finetune

pytorch

Mentioned in GitHub

huggingface/transformers

pytorch

Mentioned in GitHub

dawn0815/UniSA

pytorch

Mentioned in GitHub

facebookresearch/bart_ls

pytorch

Mentioned in GitHub

skt-ai/kobart

Mentioned in GitHub

qywu/memformers

pytorch

Mentioned in GitHub

i2r-simmc/i2r-simmc-2020

pytorch

Mentioned in GitHub

Mind23-2/MindCode-160

mindspore

huangxt39/BART_on_COVID_dialogue

pytorch

Mentioned in GitHub

microsoft/Table-Pretraining

pytorch

Mentioned in GitHub

2023-MindSpore-1/ms-code-149

mindspore

maanvithag/thinkai

Mentioned in GitHub

timrozday/spl-indications-bart

pytorch

Mentioned in GitHub

HHousen/TransformerSum

pytorch

Mentioned in GitHub

wyu97/Easy-use-BART

pytorch

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
abstractive-text-summarization-on-cnn-daily	BART	ROUGE-1: 44.16 ROUGE-2: 21.28 ROUGE-L: 40.90
open-domain-question-answering-on-eli5	BART	Rouge-1: 30.6 Rouge-2: 6.2 Rouge-L: 24.3
question-answering-on-squad11-dev	BART Base (with text infilling)	F1: 90.8
text-summarization-on-x-sum	BART	ROUGE-1: 45.14 ROUGE-2: 22.27 ROUGE-3: 37.25

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

Mike Lewis Yinhan Liu Naman Goyal Marjan Ghazvininejad Abdelrahman Mohamed Omer Levy Ves Stoyanov Luke Zettlemoyer

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters