مرکز منطقه ای اطلاع رساني علوم و فناوري - A direct-concatenation approach to train hidden Markov models to recognize the highly confusing Mandarin syllables with very limited training data

DocumentCode :

918751

Title :

A direct-concatenation approach to train hidden Markov models to recognize the highly confusing Mandarin syllables with very limited training data

Author :

Liu, Fu-hua ; Lee, Yumin ; Lee, Lin-shan

Author_Institution :

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume :

Issue :

fYear :

1993

fDate :

1/1/1993 12:00:00 AM

Firstpage :

113

Lastpage :

119

Abstract :

The recognition of a total of 408 very confusing Mandarin syllables is very difficult because this vocabulary consists of 38 confusing sets, each of which can have as many as 19 syllables. The recognition of these 408 syllables becomes even more difficult when only very limited training data are available. A special direct-concatenation approach for training hidden Markov models (HMMs) to recognize these syllables with very limited training data is developed in which each syllable is divided into INITIAL and FINAL parts and 408 right-context-dependent INITIAL HMMs and 38 left-context-independent FINAL HMMs are separately trained and the transition region carefully taken account of, and then these INITIAL and FINAL HMMs are directly concatenated to form syllable recognition. Experimental results show that this approach can utilize the very limited training data most efficiently and provide significant improvements in recognition performance. Although the results are obtained for Mandarin syllables, the approach is believed to be equally helpful for the recognition of other confusing vocabularies

Keywords :

hidden Markov models; speech recognition; Mandarin syllables recognition, HMM; confusing vocabularies; direct-concatenation approach; hidden Markov models; training data; Acoustic distortion; Hidden Markov models; Interpolation; Signal synthesis; Signal to noise ratio; Speech recognition; Speech synthesis; Testing; Training data; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.221375

Filename :

221375

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=918751