HyperAIHyperAI

Command Palette

Search for a command to run...

MOSS: Text-to-Spoken Dialogue Generation

1. Tutorial Introduction

Build

This tutorial uses a single RTX 5090 card as the resource.

2. Project Examples

3. Operation steps

1. After starting the container, click the API address to enter the Web interface

2. Usage steps

If "Bad Gateway" is displayed, it means that the model is initializing. Since the model is large, please wait for about 2-3 minutes and refresh the page. When using the Safari browser, the audio may not be played directly and needs to be downloaded before playing.

*This tutorial allows you to choose between single-player audio generation (Single) and two-player dialogue audio generation (Role) in the "Audio Input Mode".

Citation Information

The citation information for this project is as follows:

@article{moss2025ttsd,
  title={Text to Spoken Dialogue Generation}, 
  author={OpenMOSS Team},
  year={2025}
}

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
MOSS: Text-to-Spoken Dialogue Generation | Tutorials | HyperAI