HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity

Hamza Harkous Isabel Groves Amir Saffari

Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity

Abstract

End-to-end neural data-to-text (D2T) generation has recently emerged as an alternative to pipeline-based architectures. However, it has faced challenges in generalizing to new domains and generating semantically consistent text. In this work, we present DataTuner, a neural, end-to-end data-to-text generation system that makes minimal assumptions about the data representation and the target domain. We take a two-stage generation-reranking approach, combining a fine-tuned language model with a semantic fidelity classifier. Each of our components is learnt end-to-end without the need for dataset-specific heuristics, entity delexicalization, or post-processing. We show that DataTuner achieves state of the art results on the automated metrics across four major D2T datasets (LDC2017T10, WebNLG, ViGGO, and Cleaned E2E), with a fluency assessed by human annotators nearing or exceeding the human-written reference texts. We further demonstrate that the model-based semantic fidelity scorer in DataTuner is a better assessment tool compared to traditional, heuristic-based measures. Our generated text has a significantly better semantic fidelity than the state of the art across all four datasets

Code Repositories

Benchmarks

BenchmarkMethodologyMetrics
data-to-text-generation-on-cleaned-e2e-nlg-1DataTuner_FC
BLEU (Test set): 43.6
data-to-text-generation-on-viggo-1DataTuner_FC
BLEU: 53.6
data-to-text-generation-on-webnlg-full-1DATATUNER_NO_FC
BLEU: 52.9

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity | Papers | HyperAI