HyperAIHyperAI

Command Palette

Search for a command to run...

3 months ago

TUKE at MediaEval 2015 QUESST

{Milan Rusko Jozef Juhár Matúš Pleva Martin Lojka Peter Viszlay Jozef Vavrek}

TUKE at MediaEval 2015 QUESST

Abstract

In this paper, we present our retrieving system for QUery by Example Search on Speech Task (QUESST), comprising the posteriorgram-based modeling approach along with the weighted fast sequential dynamic time warping algorithm (WFS-DTW). For this year, our main effort was directed toward developing language-dependent keyword matching system, utilizing all available information about spoken languages, considering all queries and utterance files. Despite the fact that the retrieving algorithm is the same as we used in previous year, a big novelty resides in the way of utilizing the information about all languages spoken in the retrieving database. Two low-resource systems using language dependent acoustic unit modeling (AUM) approaches have been submitted. The first one, called supervised, employs four well-trained phonetic decoders using acoustic models trained on time-aligned and annotated speech. The second one, defined as unsupervised, uses blind phonetic segmentation for the specific language where the information about spoken language is extracted from Mediaeval 2013 and Mediaeval 2014 databases. Considering the influence on the overall retrieving performance, the acoustic model adaptation to the specific language through retraining procedure was investigated for both approaches as well.

Benchmarks

BenchmarkMethodologyMetrics
keyword-spotting-on-quesstTUKE g-U late submission (eval)
ATWV: 0.028
Cnxe: 0.974
MTWV: 0.032
MinCnxe: 0.954
keyword-spotting-on-quesstTUKE p-S (eval)
ATWV: 0.002
Cnxe: 0.971
MTWV: 0.022
MinCnxe: 0.953
keyword-spotting-on-quesstTUKE p-S late submission (eval)
ATWV: 0.046
Cnxe: 0.963
MTWV: 0.049
MinCnxe: 0.940
keyword-spotting-on-quesstTUKE p-S (dev)
ATWV: 0.022
Cnxe: 0.970
ISF: 2.312
MTWV: 0.036
MinCnxe: 0.947
PL: 0.068
PMUi: 0.250
PMUs: 1.874
SSF: 0.0061
keyword-spotting-on-quesstTUKE g-U (dev)
ATWV: 0.0001
Cnxe: 0.974
ISF: 0.383
MTWV: 0.031
MinCnxe: 0.953
PL: 0.033
PMUi: 0.515
PMUs: 2.292
SSF: 0.0066
keyword-spotting-on-quesstTUKE p-S late submission (dev)
ATWV: 0.055
Cnxe: 0.962
MTWV: 0.059
MinCnxe: 0.940
keyword-spotting-on-quesstTUKE g-U late submission (dev)
ATWV: 0.032
Cnxe: 0.970
MTWV: 0.035
MinCnxe: 0.951
keyword-spotting-on-quesstTUKE g-U (eval)
ATWV: -0.01
Cnxe: 0.973
MTWV: 0.023
MinCnxe: 0.953

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
TUKE at MediaEval 2015 QUESST | Papers | HyperAI