DocumentCode :
2279149
Title :
Histogram based normalization in the acoustic feature space
Author :
Molau, Sirko ; Pitz, Michael ; Ney, Hermann
Author_Institution :
Lehrstuhl fdr Informatik VI, Rheinisch-Westfalische Tech. Hochschule, Aachen, Germany
fYear :
2001
fDate :
2001
Firstpage :
21
Lastpage :
24
Abstract :
We describe a technique called histogram normalization that aims at normalizing feature space distributions at different stages in the signal analysis front-end, namely the log-compressed filterbank vectors, cepstrum coefficients, and LDA (local density approximation) transformed acoustic vectors. Best results are obtained at the filterbank, and in most cases there is a minor additional gain when normalization is applied sequentially at different stages. We show that histogram normalization performs best if applied both in training and recognition, and that smoothing the target histogram obtained on the training data is also helpful. On the VerbMobil II corpus, a German large-vocabulary conversational speech recognition task, we achieve an overall reduction in word error rate of about 10% relative.
Keywords :
acoustic signal processing; cepstral analysis; channel bank filters; density functional theory; learning (artificial intelligence); speech recognition; statistical analysis; German large-vocabulary; LDA; VerbMobil II corpus; acoustic feature space; cepstrum coefficients; conversational speech recognition; histogram normalization; local density approximation; log-compressed filterbank vectors; recognition; signal analysis front-end; training; Cepstrum; Error analysis; Filter bank; Histograms; Linear discriminant analysis; Signal analysis; Smoothing methods; Speech recognition; Target recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
Type :
conf
DOI :
10.1109/ASRU.2001.1034579
Filename :
1034579
Link To Document :
بازگشت