4 months ago

Universal Language Model Fine-tuning for Text Classification

Jeremy Howard; Sebastian Ruder

Abstract

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We open-source our pretrained models and code.

Code Repositories

comicencyclo/TransferLearning_DiscriminativeFineTuning

Mentioned in GitHub

mrdbourke/tensorflow-deep-learning

Mentioned in GitHub

tanmaylaud/Patient_Conversation_Classifier_FastAI

Mentioned in GitHub

dpalominop/ULMFit

Mentioned in GitHub

mamamot/Russian-ULMFit

pytorch

Mentioned in GitHub

joelweber97/Python3_TF_Certificate

Mentioned in GitHub

alexandra-chron/wassa-2018

pytorch

Mentioned in GitHub

LamLauChiu/Tensorflow_Learning

Mentioned in GitHub

Julian1070/Deep-Learning

pytorch

Mentioned in GitHub

PrideLee/sentiment-analysis

pytorch

Mentioned in GitHub

castortroynz/desafio_atuacao19

Mentioned in GitHub

Socialbird-AILab/BERT-Classification-Tutorial

Mentioned in GitHub

anthonyckleung/Transfer-Learning-in-Sentiment-Tweets

pytorch

Mentioned in GitHub

cstorm125/thai2fit

pytorch

Mentioned in GitHub

prajjwal1/language-modelling

pytorch

Mentioned in GitHub

mdaniluk/language-detector

Mentioned in GitHub

magic-lantern/nlp-transfer-learning

pytorch

Mentioned in GitHub

amagooda/SummaRuNNer_coattention

pytorch

Mentioned in GitHub

hpanwar08/sentence-classification-pytorch

pytorch

Mentioned in GitHub

nextbigwhat-ai/sentiment-analysis-pytorch

pytorch

Mentioned in GitHub

TheShadow29/subreddit-classification-dataset

Mentioned in GitHub

tanvir-ishraq/healifyai--llm-based-healthcare-system

pytorch

Mentioned in GitHub

rodrigopivi/aida

Mentioned in GitHub

uchange/ulangel

pytorch

Mentioned in GitHub

shivam360d/Sentiment-Analysis-ULMFit

Mentioned in GitHub

alantancr/Hotel-Recommender

Mentioned in GitHub

khumbuai/keras_wiki_lm

Mentioned in GitHub

floraxinru/HotelReviews_SentimentAnalysis

Mentioned in GitHub

MJahangeerQureshi/Text-Classification

Mentioned in GitHub

thisiskhan/tensorflow-developer-certificate-machine-learning-kit

Mentioned in GitHub

dhruvsawhney/CS152_FinalProject

Mentioned in GitHub

apmoore1/language-model

pytorch

Mentioned in GitHub

lukexyz/Language-Models

pytorch

Mentioned in GitHub

benjaminvdb/110kDBRD

Mentioned in GitHub

muellerzr/CodeFest_2019

Mentioned in GitHub

SkullFang/ULMFIT_NLP_Classification

Mentioned in GitHub

rania000/SentAnalyser

pytorch

Mentioned in GitHub

alexandra-chron/ntua-slp-wassa-iest2018

pytorch

Mentioned in GitHub

floraxinru/NLP_HotelReviews

Mentioned in GitHub

varshinireddyt/ULMFiT

Mentioned in GitHub

AnttiKarlsson/finnish_ulmfit

pytorch

Mentioned in GitHub

cahya-wirawan/indonesian-language-models

Mentioned in GitHub

radoslawkrolikowski/sentiment-analysis-pytorch

pytorch

Mentioned in GitHub

algoashutosh/sentiment-analysis-project

pytorch

Mentioned in GitHub

anubhavmaity/Ag-News-Category-Classifier

Mentioned in GitHub

fastai/fastai

Official

pytorch

rajs96/ULMFiT-Twitter-US-Airline-Sentiment

Mentioned in GitHub

simecek/Czech-ULMFiT

Mentioned in GitHub

RajasSU/Twitter-Sentiment-Analysis-using-ULMFiT

Mentioned in GitHub

haianhle/ULMFiT-Sentiment

Mentioned in GitHub

noise-field/Russian-ULMFit

pytorch

Mentioned in GitHub

Deepayan137/Adapting-OCR

pytorch

Mentioned in GitHub

keithRebello/ULMFiT_sentiment_analysis

Mentioned in GitHub

nishee0521/Sarcasm-Detector

Mentioned in GitHub

jackbandy/deep_learning_ulmfit

Mentioned in GitHub

ahmadelsallab/READ

Mentioned in GitHub

AbhimanyuAryan/IMDB-NLP

Mentioned in GitHub

jannenev/ulmfit-language-model

pytorch

Mentioned in GitHub

floleuerer/fastai_ulmfit

Mentioned in GitHub

benjaminvdb/DBRD

Mentioned in GitHub

lukashedegaard/ride

pytorch

Mentioned in GitHub

Mees-Molenaar/protein_location

pytorch

Mentioned in GitHub

heye0507/individualNLPClassifier

pytorch

Mentioned in GitHub

akari0216/Paddle-awdlstm

paddle

Mentioned in GitHub

neburseni/NLP

Mentioned in GitHub

bhuvanakundumani/sexist_rmks_classifier

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
sentiment-analysis-on-imdb	ULMFiT	Accuracy: 95.4
sentiment-analysis-on-yelp-binary	ULMFiT	Error: 2.16
sentiment-analysis-on-yelp-fine-grained	ULMFiT	Error: 29.98
text-classification-on-ag-news	ULMFiT	Error: 5.01
text-classification-on-dbpedia	ULMFiT	Error: 0.80
text-classification-on-trec-6	ULMFiT	Error: 3.6

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

Universal Language Model Fine-tuning for Text Classification

Jeremy Howard; Sebastian Ruder

Abstract

Code Repositories

Benchmarks

Build AI with AI

Hyper Newsletters