Title :
Word-Level Tone Modeling for Mandarin Speech Recognition
Author :
Xin Lei ; Ostendorf, Mari
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
Abstract :
Standard HMM-based Mandarin speech recognition systems do not exploit the suprasegmental nature of tones, but explicit tone models can be incorporated with lattice rescoring. This work extends previous approaches to explicit tone modeling from the syllable level to the word level, incorporating a hierarchical backoff. Word-dependent tone models are trained to explicitly model the tone coarticulation within the word. For less frequent words, tonal-syllable-dependent tone models or plain tone models are used as backoff. More generally, context-dependent tone models can be used as backoff. The word-dependent tone modeling framework can be viewed as a generalization of the traditional context-independent and context-dependent tone modeling. Under this framework, different types of tone modeling strategies are compared experimentally on a Mandarin broadcast news speech recognition task, showing significant gains from the word-level tone modeling approach.
Keywords :
hidden Markov models; speech recognition; HMM; Mandarin broadcast news speech recognition; Mandarin speech recognition; context-independent tone modeling; lattice rescoring; tonal-syllable-dependent tone models; tone coarticulation; word-level tone modeling; Broadcasting; Context modeling; Dictionaries; Hidden Markov models; Labeling; Lattices; Maximum likelihood decoding; Neural networks; Speech recognition; Mandarin speech recognition; tone modeling; word prosody;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367000