DocumentCode :
3488743
Title :
Phase autocorrelation (PAC) derived robust speech features
Author :
Ikbal, Shajith ; Misra, Hemant ; Bourlard, Hervé
Author_Institution :
IDIAP, Martigny, Switzerland
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
We introduce a new class of noise robust acoustic features derived from a new measure of autocorrelation, and explicitly exploiting the phase variation of the speech signal frame over time. This family of features, referred to as "phase autocorrelation" (PAC) features, include PAC spectrum and PAC MFCC (Mel-frequency cepstral coefficient), among others. In regular autocorrelation based features, the correlation between two signal segments (signal vectors), separated by a particular time interval k, is calculated as a dot product of these two vectors. In our proposed PAC approach, the angle between the two vectors is used as a measure of correlation. Since dot product is usually more affected by noise than the angle, PAC-features are expected to be more robust to noise. This is indeed significantly confirmed by the presented experimental results. The experiments were conducted on the Numbers 95 database, on which "stationary" (car) and "non -stationary" (factory) Noisex 92 noises were added with varying SNR. In most of the cases, without any specific tuning, PAC-MFCC features perform better.
Keywords :
acoustic noise; correlation methods; random noise; speech recognition; vectors; MFCC; Mel-frequency cepstral coefficient; Noisex 92 noise; Numbers 95 database; SNR; autocorrelation; feature extraction; noise robust acoustic features; phase autocorrelation features; robust speech features; speech recognition; vector dot product; Acoustic measurements; Acoustic noise; Autocorrelation; Mel frequency cepstral coefficient; Noise measurement; Noise robustness; Phase measurement; Phase noise; Speech enhancement; Time measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202312
Filename :
1202312
Link To Document :
بازگشت