مرکز منطقه ای اطلاع رساني علوم و فناوري - Noise-robust speech recognition with exemplar-based sparse representations using Alpha-Beta divergence

DocumentCode :

179567

Title :

Noise-robust speech recognition with exemplar-based sparse representations using Alpha-Beta divergence

Author :

Yilmaz, Emre ; Gemmeke, Jori F. ; Van hamme, Hugo

Author_Institution :

Dept. ESAT, KU Leuven, Leuven, Belgium

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

5502

Lastpage :

5506

Abstract :

In this paper, we investigate the performance of a noise-robust sparse representations (SR)-based recognizer using the Alpha-Beta (AB)-divergence to compare the noisy speech segments and exemplars. The baseline recognizer, which approximates noisy speech segments as a linear combination of speech and noise exemplars of variable length, uses the generalized Kullback-Leibler divergence to quantify the approximation quality. Incorporating a reconstruction error-based back-end, the recognition performance highly depends on the congruence of the divergence measure and used speech features. Having two tuning parameters, namely α and β, the AB-divergence provides improved robustness against background noise and outliers. These parameters can be adjusted for better performance depending on the distribution of speech and noise exemplars in the high-dimensional feature space. Moreover, various well-known distance/divergence measures such as the Euclidean distance, generalized Kullback-Leibler divergence, Itakura-Saito divergence and Hellinger distance are special cases of the AB-divergence for different (α, β) values. The goal of this work is to investigate the optimal divergence for mel-scaled magnitude spectral features by performing recognition experiments at several SNR levels using different (α, β) pairs. The results demonstrate the effectiveness of the AB-divergence compared to the generalized Kullback-Leibler divergence especially at the lower SNR levels.

Keywords :

noise; speech processing; speech recognition; AB-divergence; Euclidean distance; Hellinger distance; Itakura-Saito divergence; Kullback-Leibler divergence; SNR level; SR-based recognizer; alpha-beta divergence; approximation quality; baseline recognizer; divergence measure; exemplar-based sparse representation; high-dimensional feature space; mel-scaled magnitude spectral features; noise exemplar; noise-robust speech recognition; noisy speech segment; optimal divergence; recognition performance; reconstruction error; speech distribution; speech feature; Accuracy; Dictionaries; Noise; Noise measurement; Speech; Speech recognition; Vectors; alpha-beta divergence; exemplar-based speech recognition; noise-robustness; sparse representations;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854655

Filename :

6854655

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=179567