HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

Shuhe Wang; Yuxian Meng; Xiaoya Li; Xiaofei Sun; Rongbin Ouyang; Jiwei Li

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

Abstract

In order to better simulate the real human conversation process, models need to generate dialogue utterances based on not only preceding textual contexts but also visual contexts. However, with the development of multi-modal dialogue learning, the dataset scale gradually becomes a bottleneck. In this report, we release OpenViDial 2.0, a larger-scale open-domain multi-modal dialogue dataset compared to the previous version OpenViDial 1.0. OpenViDial 2.0 contains a total number of 5.6 million dialogue turns extracted from either movies or TV series from different resources, and each dialogue turn is paired with its corresponding visual context. We hope this large-scale dataset can help facilitate future researches on open-domain multi-modal dialog generation, e.g., multi-modal pretraining for dialogue generation.

Code Repositories

ShannonAI/OpenViDial
Official
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
multi-modal-dialogue-generation-on-openvidialCV (w/o MI)
BLEU: 1.97
Dis-1: 0.0041
Dis-2: 0.0353
Dis-3: 0.0999
Dis-4: 0.1726
multi-modal-dialogue-generation-on-openvidialNV (w/o MI)
BLEU: 1.95
Dis-1: 0.0037
Dis-2: 0.0302
Dis-3: 0.0929
Dis-4: 0.1711
multi-modal-dialogue-generation-on-openvidialNV (w/ MI)
BLEU: 1.96
Dis-1: 0.0039
Dis-2: 0.0311
Dis-3: 0.0953
Dis-4: 0.163
multi-modal-dialogue-generation-on-openvidialFV (w/o MI)
BLEU: 1.99
Dis-1: 0.0056
Dis-2: 0.0431
Dis-3: 0.125
Dis-4: 0.2215

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts | Papers | HyperAI