DocumentCode :
3396940
Title :
Speech recognition using three channel redundant wavelet filterbank
Author :
Tohidypour, Hamid Reza ; Seyyedsalehi, Seyyed Ali ; Roshandel, Hossein ; Behbood, Hossein
Author_Institution :
Dept. Biomed. Eng., Amirkabir Univ., Tehran, Iran
Volume :
2
fYear :
2010
fDate :
30-31 May 2010
Firstpage :
325
Lastpage :
328
Abstract :
Although Wavelet Transform has multi resolution properties, it is not optimized for speech recognition. This paper presents redundant discrete wavelet-based speech representations, which owing to much less shift-sensitivity are better for speech recognition tasks, in contrast with two-channel Discrete wavelet transform. However, this improvement is at the expense of higher redundancy. In this paper, three types of wavelet features are presented, including a combination of critically sampled Discrete Wavelet and 3-channel redundant filter banks with down-sampling by 2. For comparing different methods, time-delay neural networks are implemented. Using different mother wavelets for feature extraction improve speech recognition rates. It is shown that redundant three-channel wavelet filter banks work better in speech recognition.
Keywords :
discrete wavelet transforms; neural nets; speech recognition; 3-channel redundant filter banks; feature extraction; multi resolution properties; redundant discrete wavelet; redundant three-channel wavelet filter banks; speech recognition; speech representation; three channel redundant wavelet; time-delay neural networks; two-channel discrete wavelet transform; Biomedical engineering; Discrete wavelet transforms; Feature extraction; Filter bank; Humans; Mel frequency cepstral coefficient; Neural networks; Speech recognition; Wavelet packets; Wavelet transforms; FARSDAT Database; Frame Wavelet; Redundant Filter bank; Speech Recognition; Time Delay Neural Network (TDNN); Wavelet Transform (WT);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Industrial Mechatronics and Automation (ICIMA), 2010 2nd International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-7653-4
Type :
conf
DOI :
10.1109/ICINDMA.2010.5538303
Filename :
5538303
Link To Document :
بازگشت