HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation

{Zheng Zhang Xipeng Qiu Qipeng Guo Zhijing Jin}

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation

Abstract

Data collection for the knowledge graph-to-text generation is expensive. As a result, research on unsupervised models has emerged as an active field recently. However, most unsupervised models have to use non-parallel versions of existing small supervised datasets, which largely constrain their potential. In this paper, we propose a large-scale, general-domain dataset, GenWiki. Our unsupervised dataset has 1.3M text and graph examples, respectively. With a human-annotated test set, we provide this new benchmark dataset for future research on unsupervised text generation from knowledge graphs.

Benchmarks

BenchmarkMethodologyMetrics
unsupervised-kg-to-text-generation-on-genwikiCycleGT_Warm
BLEU: 41.35
CIDEr: 3.45
METEOR: 35.20
ROUGE-L: 63.01
unsupervised-kg-to-text-generation-on-genwikiRule-Based
BLEU: 13.45
CIDEr: 1.26
METEOR: 30.72
ROUGE-L: 40.93
unsupervised-kg-to-text-generation-on-genwikiNoisySupervised
BLEU: 30.12
CIDEr: 2.52
METEOR: 28.12
ROUGE-L: 56.96
unsupervised-kg-to-text-generation-on-genwikiCycleGT_Base
BLEU: 41.59
CIDEr: 3.57
METEOR: 35.72
ROUGE-L: 63.31
unsupervised-kg-to-text-generation-on-genwikiDirectTransfer
BLEU: 13.89
CIDEr: 1.26
METEOR: 25.76
ROUGE-L: 39.75
unsupervised-kg-to-text-generation-on-genwiki-1CycleGT_Warm
BLEU: 40.47
CIDEr: 3.48
METEOR: 34.84
ROUGE-L: 63.40
unsupervised-kg-to-text-generation-on-genwiki-1CycleGT_Base
BLEU: 41.29
CIDEr: 3.53
METEOR: 35.39
ROUGE-L: 63.73
unsupervised-kg-to-text-generation-on-genwiki-1DirectTransfer
BLEU: 13.89
CIDEr: 1.26
METEOR: 25.76
ROUGE-L: 39.75
unsupervised-kg-to-text-generation-on-genwiki-1Rule-Based
BLEU: 13.45
CIDEr: 1.26
METEOR: 30.72
ROUGE-L: 40.93
unsupervised-kg-to-text-generation-on-genwiki-1NoisySupervised
BLEU: 35.03
CIDEr: 2.63
METEOR: 33.45
ROUGE-L: 58.14

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation | Papers | HyperAI