DocumentCode :
2703113
Title :
Word-Level Tone Modeling for Mandarin Speech Recognition
Author :
Xin Lei ; Ostendorf, Mari
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Standard HMM-based Mandarin speech recognition systems do not exploit the suprasegmental nature of tones, but explicit tone models can be incorporated with lattice rescoring. This work extends previous approaches to explicit tone modeling from the syllable level to the word level, incorporating a hierarchical backoff. Word-dependent tone models are trained to explicitly model the tone coarticulation within the word. For less frequent words, tonal-syllable-dependent tone models or plain tone models are used as backoff. More generally, context-dependent tone models can be used as backoff. The word-dependent tone modeling framework can be viewed as a generalization of the traditional context-independent and context-dependent tone modeling. Under this framework, different types of tone modeling strategies are compared experimentally on a Mandarin broadcast news speech recognition task, showing significant gains from the word-level tone modeling approach.
Keywords :
hidden Markov models; speech recognition; HMM; Mandarin broadcast news speech recognition; Mandarin speech recognition; context-independent tone modeling; lattice rescoring; tonal-syllable-dependent tone models; tone coarticulation; word-level tone modeling; Broadcasting; Context modeling; Dictionaries; Hidden Markov models; Labeling; Lattices; Maximum likelihood decoding; Neural networks; Speech recognition; Mandarin speech recognition; tone modeling; word prosody;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367000
Filename :
4218188
Link To Document :
بازگشت