Title :
Phone recognition experiments with 2D-DCT spectro-temporal features
Author :
Kovács, Gy ; Tóth, L.
Author_Institution :
Res. Group on Artificial Intell., Univ. of Szeged & Hungarian Acad. of Sci., Szeged, Hungary
Abstract :
Localized spectro-temporal analysis is a novel feature extraction strategy in speech recognition, which was inspired by neurophysiological findings. Here we perform phone recognition experiments on features that are extracted from the patches of the critical-band log-energy spectrum by applying the two-dimensional cosine trans-form. We find that in phone recognition experiments the proposed feature set yields results similar to the standard MFCC features under clean conditions, while it provides a significantly smaller performance degradation in noisy conditions. Moreover, we show that the new and the standard features can be readily combined to improve the recognition accuracy still further.
Keywords :
cepstral analysis; discrete cosine transforms; feature extraction; neurophysiology; speech recognition; 2D-DCT spectro-temporal features; MFCC features; critical-band log-energy spectrum; discrete cosine transform; feature extraction strategy; localized spectro-temporal analysis; mel-frequency cepstral coefficients; neurophysiological findings; recognition experiments; speech recognition; Error analysis; Feature extraction; Mel frequency cepstral coefficient; Noise; Speech; Speech recognition; Time frequency analysis;
Conference_Titel :
Applied Computational Intelligence and Informatics (SACI), 2011 6th IEEE International Symposium on
Conference_Location :
Timisoara
Print_ISBN :
978-1-4244-9108-7
DOI :
10.1109/SACI.2011.5872988