مرکز منطقه ای اطلاع رساني علوم و فناوري - Region dependent linear transforms in multilingual speech recognition

DocumentCode :

3648288

Title :

Region dependent linear transforms in multilingual speech recognition

Author :

Martin Karafiát;Miloš Janda;Jan Černocký;Lukáš Burget

Author_Institution :

Brno University of Technology, Speech@FIT, Czech Republic

fYear :

2012

fDate :

3/1/2012 12:00:00 AM

Firstpage :

4885

Lastpage :

4888

Abstract :

In today´s speech recognition systems, linear or nonlinear transformations are usually applied to post-process speech features forming input to HMM based acoustic models. In this work, we experiment with three popular transforms: HLDA, MPE-HLDA and Region Dependent Linear Transforms (RDLT), which are trained jointly with the acoustic model to extract maximum of the discriminative information from the raw features and to represent it in a form suitable for the following GMM-HMM based acoustic model. We focus on multi-lingual environments, where limited resources are available for training recognizers of many languages. Using data from GlobalPhone database, we show that, under such restrictive conditions, the feature transformations can be advantageously shared across languages and robustly trained using data from several languages.

Keywords :

"Training","Speech recognition","Acoustics","Hidden Markov models","Speech","Transforms","Feature extraction"

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Type :

conf

DOI :

10.1109/ICASSP.2012.6289014

Filename :

6289014

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3648288