Command Palette
Search for a command to run...
MOSS: Text-to-Spoken Dialogue Generation
1. Tutorial Introduction

This tutorial uses a single RTX 5090 card as the resource.
2. Project Examples

3. Operation steps
1. After starting the container, click the API address to enter the Web interface

2. Usage steps
If "Bad Gateway" is displayed, it means that the model is initializing. Since the model is large, please wait for about 2-3 minutes and refresh the page. When using the Safari browser, the audio may not be played directly and needs to be downloaded before playing.
*This tutorial allows you to choose between single-player audio generation (Single) and two-player dialogue audio generation (Role) in the "Audio Input Mode".


Citation Information
The citation information for this project is as follows:
@article{moss2025ttsd,
title={Text to Spoken Dialogue Generation},
author={OpenMOSS Team},
year={2025}
}Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.