Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion

Author

Zhang, Jin-Song ; Hu, Xin-Hui ; Nakamura, Satoshi

Author_Institution

ATR Spoken Language Commun. Res. Lab., Kyoto

Volume

1

fYear

2006

fDate

14-19 May 2006

Abstract

An appropriate approach to model tone information is helpful for building Chinese large vocabulary continuous speech recognition system. We propose to derive an efficient phoneme set of tone-dependent sub-word units to build a recognition system, by iteratively merging a pair of tone-dependent units according to the principle of minimal loss of the mutual information. The mutual information is measured between the word tokens and their phoneme transcriptions in a training text corpus, based on the system lexical and language model. The approach has the capability to keep discriminative tonal (and phoneme) contrasts that are most helpful for disambiguating homophone words due to lack of tones, and merge those tonal (and phoneme) contrasts that are not important for word disambiguation for the recognition task. This enable a flexible selection of phoneme set according to a balance between the MI information amount and the number of phonemes. We applied the method to traditional phoneme set of initial/finals, and derived several phoneme sets with different number of units. Speech recognition experiments using the derived sets showed their effectiveness

Keywords

natural languages; speech recognition; Chinese speech recognition; large vocabulary continuous speech recognition; mutual information criterion; phoneme set; tone information; tone-dependent sub-word units; Cities and towns; Decoding; Hidden Markov models; Merging; Mutual information; Natural languages; Power system modeling; Robustness; Speech recognition; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location

Toulouse

ISSN

1520-6149

Print_ISBN

1-4244-0469-X

Type

conf

DOI

10.1109/ICASSP.2006.1660026

Filename

1660026