Title :
The usage of wavelet packet transformation in automatic noisy speech recognition systems
Author :
Kotnik, Bojan ; Kacic, Zdravko ; Horvat, Bogomir
Author_Institution :
Electr. Eng. & Comput. Sci. Fac., Maribor Univ., Slovenia
Abstract :
In this paper a noise robust speech feature extraction algorithm using wavelet packet decomposition (WPD) of the speech signal is presented. In contrast to the time-frequency signal representation based on short-time Fourier transform (STFT), a computational efficient WPD can lead to good representation of stationary (vowel phonemes) as well as non-stationary (consonants) segments of the speech signal. In the proposed WPD scheme a novel wavelet function is developed and presented. The noise robustness is improved with the application of proposed wavelet based denoising algorithm with the modified soft thresholding procedure. For decorrelation of feature vector elements and dimensionality reduction of final feature vector a principal component analysis (PCA) is used. Automatic speech recognition results on Aurora 3 database show performance improvement when compared to the standardized mel-frequency cepstral coefficients (MFCC) feature extraction algorithm.
Keywords :
feature extraction; principal component analysis; signal denoising; speech recognition; wavelet transforms; Aurora 3 database; automatic noisy speech recognition systems; automatic speech recognition; consonants; decorrelation; dimensionality reduction; feature vector elements; mel-frequency cepstral coefficients; noise robust speech feature extraction algorithm; noise robustness; nonstationary segments; performance improvement; principal component analysis; short-time Fourier transform; soft thresholding procedure; speech signal; stationary segments; time-frequency signal; vowel phonemes; wavelet based denoising algorithm; wavelet function; wavelet packet decomposition; wavelet packet transformation; Computational efficiency; Feature extraction; Fourier transforms; Noise robustness; Principal component analysis; Signal representations; Speech enhancement; Speech recognition; Time frequency analysis; Wavelet packets;
Conference_Titel :
EUROCON 2003. Computer as a Tool. The IEEE Region 8
Print_ISBN :
0-7803-7763-X
DOI :
10.1109/EURCON.2003.1248166