• DocumentCode
    2839089
  • Title

    Feature masking in an embedded Mandarin speech recognition system

  • Author

    Tang, Yuezhong ; Wang, Xia ; Cao, Yang ; Ding, Feng

  • Author_Institution
    Nokia Res. Center, Beijing, China
  • fYear
    2004
  • fDate
    15-18 Dec. 2004
  • Firstpage
    245
  • Lastpage
    248
  • Abstract
    In this paper, we explored a feature component masking scheme for embedded tonal language recognition systems, in order to reduce the computational complexity with least degradation of recognition accuracy. We carried out a lot of experiments on a Mandarin isolated word recognition task with a tone-confusable vocabulary. With consideration of both clean and noisy conditions, we were able to find a masking scheme that filtered out 31 of 54 components and still outperformed the baseline with 54 components in the feature set, with dramatically less computational and memory complexity. The results showed that feature masking was a promising approach for complexity reduction in embedded tonal language recognition systems. The results also verified the effectiveness of higher order cepstral coefficients for tonal language recognition because most of them were preserved during the feature masking experiments.
  • Keywords
    cepstral analysis; natural languages; speech recognition; ASR; Mandarin isolated word recognition; automatic speech recognition; computational complexity reduction; embedded Mandarin speech recognition system; embedded tonal language recognition systems; feature component masking; feature set size; higher order cepstral coefficients; recognition accuracy; tone-confusable vocabulary; Automatic speech recognition; Cepstral analysis; Computational complexity; Databases; Degradation; Hardware; Natural languages; Speech recognition; Vocabulary; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2004 International Symposium on
  • Print_ISBN
    0-7803-8678-7
  • Type

    conf

  • DOI
    10.1109/CHINSL.2004.1409632
  • Filename
    1409632