Speech Synthesis On North American English
评估指标
Mean Opinion Score
评测结果
各个模型在此基准测试上的表现结果
| Paper Title | Repository | ||
|---|---|---|---|
| Tacotron 2 | 4.526 | Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions | |
| WaveNet (Linguistic) | 4.341 | Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions | |
| WaveNet (L+F) | 4.21 | WaveNet: A Generative Model for Raw Audio | |
| Tacotron | 4.001 | Tacotron: Towards End-to-End Speech Synthesis | |
| HMM-driven concatenative | 3.86 | WaveNet: A Generative Model for Raw Audio | |
| LSTM-RNN parametric | 3.67 | WaveNet: A Generative Model for Raw Audio | |
| means | 0 | Merging $K$-means with hierarchical clustering for identifying general-shaped groups | - |
0 of 7 row(s) selected.