DocumentCode
2279149
Title
Histogram based normalization in the acoustic feature space
Author
Molau, Sirko ; Pitz, Michael ; Ney, Hermann
Author_Institution
Lehrstuhl fdr Informatik VI, Rheinisch-Westfalische Tech. Hochschule, Aachen, Germany
fYear
2001
fDate
2001
Firstpage
21
Lastpage
24
Abstract
We describe a technique called histogram normalization that aims at normalizing feature space distributions at different stages in the signal analysis front-end, namely the log-compressed filterbank vectors, cepstrum coefficients, and LDA (local density approximation) transformed acoustic vectors. Best results are obtained at the filterbank, and in most cases there is a minor additional gain when normalization is applied sequentially at different stages. We show that histogram normalization performs best if applied both in training and recognition, and that smoothing the target histogram obtained on the training data is also helpful. On the VerbMobil II corpus, a German large-vocabulary conversational speech recognition task, we achieve an overall reduction in word error rate of about 10% relative.
Keywords
acoustic signal processing; cepstral analysis; channel bank filters; density functional theory; learning (artificial intelligence); speech recognition; statistical analysis; German large-vocabulary; LDA; VerbMobil II corpus; acoustic feature space; cepstrum coefficients; conversational speech recognition; histogram normalization; local density approximation; log-compressed filterbank vectors; recognition; signal analysis front-end; training; Cepstrum; Error analysis; Filter bank; Histograms; Linear discriminant analysis; Signal analysis; Smoothing methods; Speech recognition; Target recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN
0-7803-7343-X
Type
conf
DOI
10.1109/ASRU.2001.1034579
Filename
1034579
Link To Document