DocumentCode :
1761008
Title :
Optimisation of multiple feature stream weights for distributed speech processing in mobile environments
Author :
Djamel, Addou ; Sid-Ahmed, Selouani
Author_Institution :
Fac. of Electron. & Comput. Sci., USTHB, Algiers, Algeria
Volume :
9
Issue :
4
fYear :
2015
fDate :
6 2015
Firstpage :
387
Lastpage :
394
Abstract :
Mobile environments are highly influenced by ambient noise that can cause a significant deterioration in speech recognition performance. In this study, a new framework integrating a noise-robust frontend (FE) in distributed speech recognition (DSR) is presented. Using the Aurora-2 speech database, the authors evaluate the impact of the proposed multidimensional acoustical analysis on the performance of the Mel-frequency-based European Telecommunications Standards Institute-advanced FE (AFE) combined with the Mel-line spectral frequencies (MLSFs) robust features for highly noisy speech. The stream weights of the resulting multi-stream hidden Markov models are optimised automatically by deploying a novel approach based on a discriminative model combination. Finally, these features are effectively transformed and reduced using the Karhunen-Loève transform. The proposed MLSF-based FE (MLSF-FE) is shown to exhibit a reduction in the relative error rate. Moreover, the proposed FE provides comparable recognition performance to the current DSR-AFE available in global system of mobile communications.
Keywords :
Karhunen-Loeve transforms; cellular radio; hidden Markov models; speech recognition; Aurora-2 speech database; DSR; Karhunen-Loeve transform; MLSF robust features; MLSF-FE; MLSF-based FE; Mel-frequency-based European Telecommunications Standards Institute-advanced FE; Mel-line spectral frequencies; ambient noise; discriminative model combination; distributed speech processing; distributed speech recognition; global system of mobile communications; highly-noisy speech; mobile environments; multidimensional acoustical analysis; multiple-feature stream weight optimisation; multistream hidden Markov model; noise-robust FE; noise-robust frontend; relative error rate; speech recognition performance;
fLanguage :
English
Journal_Title :
Signal Processing, IET
Publisher :
iet
ISSN :
1751-9675
Type :
jour
DOI :
10.1049/iet-spr.2013.0392
Filename :
7122408
Link To Document :
بازگشت