DocumentCode
2839089
Title
Feature masking in an embedded Mandarin speech recognition system
Author
Tang, Yuezhong ; Wang, Xia ; Cao, Yang ; Ding, Feng
Author_Institution
Nokia Res. Center, Beijing, China
fYear
2004
fDate
15-18 Dec. 2004
Firstpage
245
Lastpage
248
Abstract
In this paper, we explored a feature component masking scheme for embedded tonal language recognition systems, in order to reduce the computational complexity with least degradation of recognition accuracy. We carried out a lot of experiments on a Mandarin isolated word recognition task with a tone-confusable vocabulary. With consideration of both clean and noisy conditions, we were able to find a masking scheme that filtered out 31 of 54 components and still outperformed the baseline with 54 components in the feature set, with dramatically less computational and memory complexity. The results showed that feature masking was a promising approach for complexity reduction in embedded tonal language recognition systems. The results also verified the effectiveness of higher order cepstral coefficients for tonal language recognition because most of them were preserved during the feature masking experiments.
Keywords
cepstral analysis; natural languages; speech recognition; ASR; Mandarin isolated word recognition; automatic speech recognition; computational complexity reduction; embedded Mandarin speech recognition system; embedded tonal language recognition systems; feature component masking; feature set size; higher order cepstral coefficients; recognition accuracy; tone-confusable vocabulary; Automatic speech recognition; Cepstral analysis; Computational complexity; Databases; Degradation; Hardware; Natural languages; Speech recognition; Vocabulary; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing, 2004 International Symposium on
Print_ISBN
0-7803-8678-7
Type
conf
DOI
10.1109/CHINSL.2004.1409632
Filename
1409632
Link To Document