مرکز منطقه ای اطلاع رساني علوم و فناوري - A dynamic in-search discriminative training approach for large vocabulary speech recognition

DocumentCode :

542181

Title :

A dynamic in-search discriminative training approach for large vocabulary speech recognition

Author :

Jiang, Hui ; Siohan, Olivier ; Soong, Frank K. ; Lee, Chin-Hui

Author_Institution :

Dialogue Systems Research, Multimedia Communication Research Lab, Bell Labs, Lucent Technologies, Murray Hill, NJ 07974, USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

In this paper, we propose a dynamic in-search discriminative training approach of a large-scale HMM model for large vocabulary speech recognition. A previously proposed data selection method is used to choose competing hypotheses dynamically during Viterbi beam search procedure. Particularly, all active word-ending paths are examined during search with reference transcription to identify competing tokens for different HMM´s. Then HMMs are re-estimated based on an GPD-based discriminative training to minimize total number of possible error tokens among all collected competing tokens. In this way, recognition errors, e.g., word error rate, in training data can be reduced indirectly. The proposed approach is flexible enough to run in a batch or incremental mode. Also, the method can efficiently be implemented to process large amount of training data and update a large-scale state-tied HMM: set for large vocabulary recognition tasks. Some preliminary results on DARPA communicator task show the new discriminative training method can improve recognition performance over our best ML-trained system.

Keywords :

Decoding; Hidden Markov models; Markov processes; Speech; Training; Viterbi algorithm; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743667

Filename :

5743667

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542181