Command Palette
Search for a command to run...
{Milan Rusko Jozef Juhár Matúš Pleva Martin Lojka Peter Viszlay Jozef Vavrek}

Abstract
In this paper, we present our retrieving system for QUery by Example Search on Speech Task (QUESST), comprising the posteriorgram-based modeling approach along with the weighted fast sequential dynamic time warping algorithm (WFS-DTW). For this year, our main effort was directed toward developing language-dependent keyword matching system, utilizing all available information about spoken languages, considering all queries and utterance files. Despite the fact that the retrieving algorithm is the same as we used in previous year, a big novelty resides in the way of utilizing the information about all languages spoken in the retrieving database. Two low-resource systems using language dependent acoustic unit modeling (AUM) approaches have been submitted. The first one, called supervised, employs four well-trained phonetic decoders using acoustic models trained on time-aligned and annotated speech. The second one, defined as unsupervised, uses blind phonetic segmentation for the specific language where the information about spoken language is extracted from Mediaeval 2013 and Mediaeval 2014 databases. Considering the influence on the overall retrieving performance, the acoustic model adaptation to the specific language through retraining procedure was investigated for both approaches as well.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| keyword-spotting-on-quesst | TUKE g-U late submission (eval) | ATWV: 0.028 Cnxe: 0.974 MTWV: 0.032 MinCnxe: 0.954 |
| keyword-spotting-on-quesst | TUKE p-S (eval) | ATWV: 0.002 Cnxe: 0.971 MTWV: 0.022 MinCnxe: 0.953 |
| keyword-spotting-on-quesst | TUKE p-S late submission (eval) | ATWV: 0.046 Cnxe: 0.963 MTWV: 0.049 MinCnxe: 0.940 |
| keyword-spotting-on-quesst | TUKE p-S (dev) | ATWV: 0.022 Cnxe: 0.970 ISF: 2.312 MTWV: 0.036 MinCnxe: 0.947 PL: 0.068 PMUi: 0.250 PMUs: 1.874 SSF: 0.0061 |
| keyword-spotting-on-quesst | TUKE g-U (dev) | ATWV: 0.0001 Cnxe: 0.974 ISF: 0.383 MTWV: 0.031 MinCnxe: 0.953 PL: 0.033 PMUi: 0.515 PMUs: 2.292 SSF: 0.0066 |
| keyword-spotting-on-quesst | TUKE p-S late submission (dev) | ATWV: 0.055 Cnxe: 0.962 MTWV: 0.059 MinCnxe: 0.940 |
| keyword-spotting-on-quesst | TUKE g-U late submission (dev) | ATWV: 0.032 Cnxe: 0.970 MTWV: 0.035 MinCnxe: 0.951 |
| keyword-spotting-on-quesst | TUKE g-U (eval) | ATWV: -0.01 Cnxe: 0.973 MTWV: 0.023 MinCnxe: 0.953 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.