• DocumentCode
    2279149
  • Title

    Histogram based normalization in the acoustic feature space

  • Author

    Molau, Sirko ; Pitz, Michael ; Ney, Hermann

  • Author_Institution
    Lehrstuhl fdr Informatik VI, Rheinisch-Westfalische Tech. Hochschule, Aachen, Germany
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    21
  • Lastpage
    24
  • Abstract
    We describe a technique called histogram normalization that aims at normalizing feature space distributions at different stages in the signal analysis front-end, namely the log-compressed filterbank vectors, cepstrum coefficients, and LDA (local density approximation) transformed acoustic vectors. Best results are obtained at the filterbank, and in most cases there is a minor additional gain when normalization is applied sequentially at different stages. We show that histogram normalization performs best if applied both in training and recognition, and that smoothing the target histogram obtained on the training data is also helpful. On the VerbMobil II corpus, a German large-vocabulary conversational speech recognition task, we achieve an overall reduction in word error rate of about 10% relative.
  • Keywords
    acoustic signal processing; cepstral analysis; channel bank filters; density functional theory; learning (artificial intelligence); speech recognition; statistical analysis; German large-vocabulary; LDA; VerbMobil II corpus; acoustic feature space; cepstrum coefficients; conversational speech recognition; histogram normalization; local density approximation; log-compressed filterbank vectors; recognition; signal analysis front-end; training; Cepstrum; Error analysis; Filter bank; Histograms; Linear discriminant analysis; Signal analysis; Smoothing methods; Speech recognition; Target recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
  • Print_ISBN
    0-7803-7343-X
  • Type

    conf

  • DOI
    10.1109/ASRU.2001.1034579
  • Filename
    1034579