• DocumentCode
    336798
  • Title

    A segment-based C0 adaptation scheme for PMC-based noisy Mandarin speech recognition

  • Author

    Hong, Wei-Tyng ; Chen, Sin-Horng

  • Author_Institution
    Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
  • Volume
    1
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    433
  • Abstract
    A segment-based C0 (the zero-th order of cepstral coefficient) adaptation scheme for PMC-based Mandarin speech recognition is proposed in this paper. It incorporates a new C0 model of speech signal into the PMC method to improve the gain matching between the clean-speech HMM models and the current noise model. The C0 model is constructed in the training phase by jointly modeling the normalized C0 with other MFCC recognition features to form C0-normalized HMM models. In the testing phase, it pre-segments the input utterance into syllable-like segments, performs C 0-denormalization operations to expand the C0-normalized HMM models, and uses them in the PMC method. Compared with the conventional PMC method, the proposed method can achieve a much better noise compensation effect due to the use of more precise gain matching in the PMC model combination. Experimental results showed that the base-syllable accuracy rate was significantly upgraded for continuous noisy Mandarin speech recognition
  • Keywords
    cepstral analysis; natural languages; noise; speech recognition; C0-denormalization; C0-normalized HMM models; MFCC recognition features; PMC-based noisy Mandarin speech recognition; base-syllable accuracy rate; cepstral coefficient; clean-speech HMM models; continuous noisy Mandarin speech recognition; experimental results; gain matching; input utterance; noise compensation; noise model; normalized C0; parallel model combination; segment-based C0 adaptation; speech signal; syllable-like segments; testing phase; training phase; zero-th order; Acoustic noise; Acoustic testing; Energy states; Hidden Markov models; Mel frequency cepstral coefficient; Noise generators; Performance evaluation; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.758155
  • Filename
    758155