HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

A Unified Pre-training Framework for Conversational AI

Siqi Bao; Bingjin Chen; Huang He; Xin Tian; Han Zhou; Fan Wang; Hua Wu; Haifeng Wang; Wenquan Wu; Yingzhan Lin

A Unified Pre-training Framework for Conversational AI

Abstract

In this work, we explore the application of PLATO-2 on various dialogue systems, including open-domain conversation, knowledge grounded dialogue, and task-oriented conversation. PLATO-2 is initially designed as an open-domain chatbot, trained via two-stage curriculum learning. In the first stage, a coarse-grained response generation model is learned to fit the simplified one-to-one mapping relationship. This model is applied to the task-oriented conversation, given that the semantic mappings tend to be deterministic in task completion. In the second stage, another fine-grained generation model and an evaluation model are further learned for diverse response generation and coherence estimation, respectively. With superior capability on capturing one-to-many mapping, such models are suitable for the open-domain conversation and knowledge grounded dialogue. For the comprehensive evaluation of PLATO-2, we have participated in multiple tasks of DSTC9, including interactive evaluation of open-domain conversation (Track3-task2), static evaluation of knowledge grounded dialogue (Track3-task1), and end-to-end task-oriented conversation (Track2-task1). PLATO-2 has obtained the 1st place in all three tasks, verifying its effectiveness as a unified framework for various dialogue systems.

Code Repositories

PaddlePaddle/Knover
Official
paddle

Benchmarks

BenchmarkMethodologyMetrics
interactive-evaluation-of-dialog-on-dstc9PLATO-2
Coherent: 2.8017
Consistent: 0.9390
Diversity: 2.7441
Error Recovery: 2.7518
Flexible: 2.8000
Informative: 2.7881
Inquisitive: 2.7949
Likeable: 2.7878
Overall Human Rating: 4.15
Topic Depth: 2.7678
Understanding: 2.8285

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
A Unified Pre-training Framework for Conversational AI | Papers | HyperAI