مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimisation of multiple feature stream weights for distributed speech processing in mobile environments

DocumentCode :

1761008

Title :

Optimisation of multiple feature stream weights for distributed speech processing in mobile environments

Author :

Djamel, Addou ; Sid-Ahmed, Selouani

Author_Institution :

Fac. of Electron. & Comput. Sci., USTHB, Algiers, Algeria

Volume :

Issue :

fYear :

2015

fDate :

6 2015

Firstpage :

387

Lastpage :

394

Abstract :

Mobile environments are highly influenced by ambient noise that can cause a significant deterioration in speech recognition performance. In this study, a new framework integrating a noise-robust frontend (FE) in distributed speech recognition (DSR) is presented. Using the Aurora-2 speech database, the authors evaluate the impact of the proposed multidimensional acoustical analysis on the performance of the Mel-frequency-based European Telecommunications Standards Institute-advanced FE (AFE) combined with the Mel-line spectral frequencies (MLSFs) robust features for highly noisy speech. The stream weights of the resulting multi-stream hidden Markov models are optimised automatically by deploying a novel approach based on a discriminative model combination. Finally, these features are effectively transformed and reduced using the Karhunen-Loève transform. The proposed MLSF-based FE (MLSF-FE) is shown to exhibit a reduction in the relative error rate. Moreover, the proposed FE provides comparable recognition performance to the current DSR-AFE available in global system of mobile communications.

Keywords :

Karhunen-Loeve transforms; cellular radio; hidden Markov models; speech recognition; Aurora-2 speech database; DSR; Karhunen-Loeve transform; MLSF robust features; MLSF-FE; MLSF-based FE; Mel-frequency-based European Telecommunications Standards Institute-advanced FE; Mel-line spectral frequencies; ambient noise; discriminative model combination; distributed speech processing; distributed speech recognition; global system of mobile communications; highly-noisy speech; mobile environments; multidimensional acoustical analysis; multiple-feature stream weight optimisation; multistream hidden Markov model; noise-robust FE; noise-robust frontend; relative error rate; speech recognition performance;

fLanguage :

English

Journal_Title :

Signal Processing, IET

Publisher :

iet

ISSN :

1751-9675

Type :

jour

DOI :

10.1049/iet-spr.2013.0392

Filename :

7122408

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1761008