DocumentCode :
2262927
Title :
Noisy word recognition using a feature based on ternarized spectral slope
Author :
Umeno, Masayoshi ; Funada, T. ; Nomura, Hideyuki
Author_Institution :
Kanazawa Univ., Kanazawa
fYear :
2007
fDate :
17-19 Oct. 2007
Firstpage :
1575
Lastpage :
1579
Abstract :
In previous paper, we proposed a feature FTTSS (Fourier transform of ternarized spectral slope) based on power spectrum derivatives with regard to frequency to develop a robust word recognition system under noisy environments, and we confirmed noise robustness of the feature compared with MFCC by applying it to word recognition with HMM. Generally, word recognition with HMM is improved by adding features that may express temporal variations, such as DeltaMFCC or DeltaFTTSS, because HMM can deal with only piecewise stationary signals. Actually, we have examined effectiveness of using DeltaFTTSS in word recognition. It is supposed that features showing raw temporal variations of spectral power are effective in speech recognition and ternary conversion of features may decrease deteriorations of recognition performance by noise corruption. Therefore in this research, we propose a new feature FTTTS (Fourier transform of ternarized temporal slope) instead of DeltaFTTSS. The FTTTS is defined by Fourier transform along frequency of smoothed ternarized temporal variations of spectral power at specific frequency. As a result, we have confirmed experimentally that the proposed feature FTTTS have noise robustness for SNR 0-20 dB compared with FTTSS+DeltaFTTSS or the conventional feature MFCC+DeltaMFCC by applying them to word recognition with HMM.
Keywords :
Fourier transforms; speech recognition; Fourier transform; noisy word recognition; smoothed ternarized temporal variation; ternarized spectral slope; Information technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technologies, 2007. ISCIT '07. International Symposium on
Conference_Location :
Sydney,. NSW
Print_ISBN :
978-1-4244-0976-1
Electronic_ISBN :
978-1-4244-0977-8
Type :
conf
DOI :
10.1109/ISCIT.2007.4392268
Filename :
4392268
Link To Document :
بازگشت