HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Data Extrapolation for Text-to-image Generation on Small Datasets

Senmao Ye Fei Liu

Data Extrapolation for Text-to-image Generation on Small Datasets

Abstract

Text-to-image generation requires large amount of training data to synthesizing high-quality images. For augmenting training data, previous methods rely on data interpolations like cropping, flipping, and mixing up, which fail to introduce new information and yield only marginal improvements. In this paper, we propose a new data augmentation method for text-to-image generation using linear extrapolation. Specifically, we apply linear extrapolation only on text feature, and new image data are retrieved from the internet by search engines. For the reliability of new text-image pairs, we design two outlier detectors to purify retrieved images. Based on extrapolation, we construct training samples dozens of times larger than the original dataset, resulting in a significant improvement in text-to-image performance. Moreover, we propose a NULL-guidance to refine score estimation, and apply recurrent affine transformation to fuse text information. Our model achieves FID scores of 7.91, 9.52 and 5.00 on the CUB, Oxford and COCO datasets. The code and data will be available on GitHub (https://github.com/senmaoy/RAT-Diffusion).

Code Repositories

senmaoy/RAT-Diffusion
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
text-to-image-generation-on-cocoRAT-Diffusion
FID: 5.00
text-to-image-generation-on-cubRAT-Diffusion
FID: 6.36
Inception score: 6.56
text-to-image-generation-on-oxford-102RAT-Diffusion
FID: 9.52
Inception score: 4.35

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Data Extrapolation for Text-to-image Generation on Small Datasets | Papers | HyperAI