DocumentCode
1659357
Title
A study of multilingual speech features: perceptive scalogram based on wavelet analysis
Author
Hossain, Md Iqbal ; Liu, James ; Lee, Raymond
Author_Institution
Dept. of Comput., Hong Kong Polytech. Univ., Kowloon, Hong Kong
Volume
2
fYear
1999
fDate
6/21/1905 12:00:00 AM
Firstpage
178
Abstract
In recent years adopted speech feature for automatic speech recognition system is degraded by the additive noise when the system is being used in real environments. In this respect a lot of efforts have been devoting to find the robust features. In this paper, we propose the use of perceptive scalogram, which is a kind of time-scale distribution of signal to approximate perception of speech. It provides a three-dimensional representation of speech signal spectra, with time represented on the x-axis and frequency on the y-axis. Gray scales are used to represent the third dimension, either the amplitudes (for scalograms) or the phase (for phase-shift diagrams), of the wavelet transform data. It takes considerations that speech and background noise has different perceptions on human beings, and speech signals are non-stationary. It is a time-scale distribution of signal, and uses wavelets as its analysis tool, which makes a good approximation of some hearing properties. Computer simulations show that the proposed perceptive scalogram is very robust to additive white Gaussian noise for different languages (e.g. English, Putonghua, Cantonese)
Keywords
AWGN; speech recognition; wavelet transforms; Gray scales; additive nois; automatic speech recognition system; computer simulations; multilingual speech features; perceptive scalogram; time-scale distribution; wavelet analysis; wavelet transform data; wavelets; white Gaussian noise; Additive noise; Automatic speech recognition; Background noise; Degradation; Frequency; Humans; Noise robustness; Speech enhancement; Wavelet analysis; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location
Tokyo
ISSN
1062-922X
Print_ISBN
0-7803-5731-0
Type
conf
DOI
10.1109/ICSMC.1999.825229
Filename
825229
Link To Document