Title :
Entropy-based variable frame rate analysis of speech signals and its application to ASR
Author :
You, H. ; Zhu, Q. ; Alwan, A.
Author_Institution :
Electr. Eng. Dept., UCLA, Los Angeles, CA, USA
Abstract :
Most speech processing algorithms analyze speech signals frame by frame with a fixed frame rate. Fixed-rate analysis is inconsistent with human speech perception and effectively assigns the same importance or ´weight´ to all equi-duration frames. In Zhu et al. (2000), we proposed a variable frame rate (VFR) analysis technique that is based on a Euclidian distance measure. In this paper, we propose another approach for VFR based on the entropy of the signal. We compare entropy and Euclidian distance measures for VFR in ASR experiments using the Aurora2 and T146 databases. Better performance is observed for the entropy-based VFR over our earlier VFR approach and over the fixed-rate system.
Keywords :
entropy; speech processing; speech recognition; ASR; Aurora2; Euclidian distance measures; T146 database; VFR; automatic speech recognition; entropy; fixed-rate analysis; performance; speech processing algorithms; speech signals; variable frame rate analysis; Acoustic noise; Automatic speech recognition; Covariance matrix; Distributed computing; Entropy; Random variables; Signal analysis; Signal processing; Speech analysis; Speech processing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326044