DocumentCode :
3244440
Title :
Low memory decision tree method for text-to-phoneme mapping
Author :
Suontausta, Janne ; Tian, Jilei
Author_Institution :
Audio-Visual Syst. Lab., Nokia Res. Center, Tampere, Finland
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
135
Lastpage :
140
Abstract :
Pronunciation models are commonly used in automatic speech recognition (ASR) as well as in text-to-speech (TTS) applications. Decision tree (DT) and neural network (NN) methods have been used for modeling the languages with irregular pronunciation. The DT based methods are usually more accurate and therefore they provide better recognition accuracy than the NN based methods. The main drawback of the DT based methods is their large memory footprint. In the paper, we propose three methods, i.e.: (1) a clipping approach to reduce the pronunciation variability in the aligned dictionary; (2) a revised DT structure; and (3) a Huffman coding scheme for efficient DT parameter storing that can jointly be applied to minimize the memory footprint of the DT models. The results obtained in the simulation experiments indicate that the memory requirements of the DT models can significantly be reduced without degrading the mapping accuracy. The applicability of the approach is also verified in the speech recognition experiments.
Keywords :
Huffman codes; decision trees; speech processing; speech recognition; speech synthesis; ASR; DT parameter storing; Huffman coding; TTS applications; aligned dictionary; automatic speech recognition; clipping approach; irregular pronunciation; low memory decision tree; memory footprint minimization; pronunciation models; pronunciation variability; revised DT structure; speech recognition; text-to-phoneme mapping; text-to-speech applications; Audio-visual systems; Automatic speech recognition; Decision trees; Engines; Laboratories; Natural languages; Neural networks; Speech recognition; Speech synthesis; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318417
Filename :
1318417
Link To Document :
بازگشت