DocumentCode :
447207
Title :
A two-dimensional robust algorithm based on Mel-bank log-spectrum
Author :
Wang, Yizhou ; Wu, Ji ; Wang, Zuoying
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
1
fYear :
2005
fDate :
12-14 Oct. 2005
Firstpage :
759
Lastpage :
763
Abstract :
In this paper, a novel technique named two-dimensional robust algorithm based on Mel-bank log-spectrum is presented which combines sub-band average log-spectrum enhancement algorithm and peak isolation algorithm in log-spectrum domain. Information on the long envelopes of time trajectories of log-spectral energies and robustness of the log-spectral peaks in frequency bands are used together to achieve more excellent performance in this algorithm. Comparative experiments on the Aurora 2 databases are performed between these techniques and the proposed two-dimensional algorithm. It is shown that this combined two-dimensional algorithm results in better recognition performance than each technique. Compared to the Aurora 2.0 baseline system, the word error rate reduction was 21.23% in the clean training case for this combined algorithm.
Keywords :
speech enhancement; speech recognition; Mel-bank log-spectrum; peak isolation algorithm; subband average log-spectrum enhancement algorithm; two-dimensional robust algorithm; Acoustic noise; Automatic speech recognition; Filtering; Frequency domain analysis; Robustness; Signal processing algorithms; Speech analysis; Speech enhancement; Speech processing; Wiener filter;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications and Information Technology, 2005. ISCIT 2005. IEEE International Symposium on
Print_ISBN :
0-7803-9538-7
Type :
conf
DOI :
10.1109/ISCIT.2005.1566964
Filename :
1566964
Link To Document :
بازگشت