• DocumentCode
    454574
  • Title

    Automatic Derivation of a Phoneme Set with Tone Information for Chinese Speech Recognition Based on Mutual Information Criterion

  • Author

    Zhang, Jin-Song ; Hu, Xin-Hui ; Nakamura, Satoshi

  • Author_Institution
    ATR Spoken Language Commun. Res. Lab., Kyoto
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    An appropriate approach to model tone information is helpful for building Chinese large vocabulary continuous speech recognition system. We propose to derive an efficient phoneme set of tone-dependent sub-word units to build a recognition system, by iteratively merging a pair of tone-dependent units according to the principle of minimal loss of the mutual information. The mutual information is measured between the word tokens and their phoneme transcriptions in a training text corpus, based on the system lexical and language model. The approach has the capability to keep discriminative tonal (and phoneme) contrasts that are most helpful for disambiguating homophone words due to lack of tones, and merge those tonal (and phoneme) contrasts that are not important for word disambiguation for the recognition task. This enable a flexible selection of phoneme set according to a balance between the MI information amount and the number of phonemes. We applied the method to traditional phoneme set of initial/finals, and derived several phoneme sets with different number of units. Speech recognition experiments using the derived sets showed their effectiveness
  • Keywords
    natural languages; speech recognition; Chinese speech recognition; large vocabulary continuous speech recognition; mutual information criterion; phoneme set; tone information; tone-dependent sub-word units; Cities and towns; Decoding; Hidden Markov models; Merging; Mutual information; Natural languages; Power system modeling; Robustness; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660026
  • Filename
    1660026