HyperAI
HyperAI超神经
首页
算力平台
文档
资讯
论文
教程
数据集
百科
SOTA
LLM 模型天梯
GPU 天梯
顶会
开源项目
全站搜索
关于
中文
HyperAI
HyperAI超神经
Toggle sidebar
全站搜索…
⌘
K
全站搜索…
⌘
K
首页
SOTA
语音识别
Speech Recognition On Lrs3 Ted
Speech Recognition On Lrs3 Ted
评估指标
Word Error Rate (WER)
评测结果
各个模型在此基准测试上的表现结果
Columns
模型名称
Word Error Rate (WER)
Paper Title
Repository
RAVEn Large
1.4
Jointly Learning Visual and Auditory Speech Representations from Raw Data
AV-HuBERT Large
1.3
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Llama-AVSR
0.81
Large Language Models are Strong Audio-Visual Speech Recognition Learners
Whisper
0.68
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
0 of 4 row(s) selected.
Previous
Next
Speech Recognition On Lrs3 Ted | SOTA | HyperAI超神经