Command Palette
Search for a command to run...
FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text Generation
{Ioana Manolescu Oana Balalau Kun Zhang}

Abstract
Graph-to-text (G2T) generation takes a graph as input and aims to generate a fluent and faith- ful textual representation of the information in the graph. The task has many applications, such as dialogue generation and question an- swering. In this work, we investigate to what extent the G2T generation problem is solved for previously studied datasets, and how pro- posed metrics perform when comparing generated texts. To help address their limitations, we propose a new metric that correctly identifies factual faithfulness, i.e., given a triple (subject, predicate, object), it decides if the triple is present in a generated text. We show that our metric FactSpotter achieves the highest correlation with human annotations on data correct- ness, data coverage, and relevance. In addition, FactSpotter can be used as a plug-in feature to improve the factual faithfulness of existing models. Finally, we investigate if existing G2T datasets are still challenging for state-of-the-art models. Our code is available online: https://github.com/guihuzhang/FactSpotter.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| data-to-text-generation-on-dart | T5-B Baseline | BLEU: 48.47 BLEURT: 67.49 METEOR: 40.74 |
| data-to-text-generation-on-webnlg | T5-B Baseline | BLEU: 67.04 BLEURT: 73.22 METEOR: 48.35 |
| data-to-text-generation-on-webnlg | JointGT Baseline | BLEU: 67.08 BLEURT: 73.44 METEOR: 48.34 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.