4 months ago

Learning to Generate Reviews and Discovering Sentiment

Alec Radford; Rafal Jozefowicz; Ilya Sutskever

Abstract

We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of the art on the binary subset of the Stanford Sentiment Treebank. They are also very data efficient. When using only a handful of labeled examples, our approach matches the performance of strong baselines trained on full datasets. We also demonstrate the sentiment unit has a direct influence on the generative process of the model. Simply fixing its value to be positive or negative generates samples with the corresponding positive or negative sentiment.

Code Repositories

openai/generating-reviews-discovering-sentiment

Mentioned in GitHub

faramarzmunshi/d2l-nlp

Mentioned in GitHub

Benchmarks

Benchmark	Methodology	Metrics
sentiment-analysis-on-sst-2-binary	bmLSTM	Accuracy: 91.8
subjectivity-analysis-on-subj	Byte mLSTM	Accuracy: 94.60

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started

Hyper Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette