• DocumentCode
    1818239
  • Title

    A continuous speaker-independent putonghua dictation system

  • Author

    Chen, C.J. ; Gopinath, R.A. ; Monkowski, M.D. ; Picheny, M.A.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    14-18 Oct 1996
  • Firstpage
    821
  • Abstract
    We describe new methods for continuous putonghua speech recognition. We have augmented the IBM HMM-based continuous speech recognition system with the following features. First, we treat tones in putonghua as attributes of certain phonemes, instead of syllables. We call those phonemes with tone tonemes. Second, instantaneous pitch is treated as a variable in the acoustic feature vector, in the same way as cepstra or energy. Third, by designing a set of word-segmentation rules to convert the continuous Chinese text into segmented text, the trigram language model works effectively. By applying those new methods, a speaker-independent, very-large-vocabulary continuous putonghua dictation system can be constructed
  • Keywords
    acoustic signal processing; decoding; dictation; feature extraction; grammars; hidden Markov models; natural languages; speech coding; speech recognition; speech synthesis; vector quantisation; HMM based continuous speech recognition; IBM; VQ; acoustic feature vector; cepstra; continuous Chinese text; continuous putonghua speech recognition; continuous speaker-independent putonghua dictation system; decoding; energy; instantaneous pitch; phonemes; segmented text; tonemes; tones; trigram language model; word segmentation rules; Hoses; Modems; Natural languages; Quantization; Speech recognition; Terminology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 1996., 3rd International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-2912-0
  • Type

    conf

  • DOI
    10.1109/ICSIGP.1996.567389
  • Filename
    567389