• DocumentCode
    323478
  • Title

    Exploiting acoustic feature correlations by joint neural vector quantizer design in a discrete HMM system

  • Author

    Neukirchen, Christoph ; Willett, Daniel ; Eickeler, Stefan ; Müller, Stefan

  • Author_Institution
    Dept. of Comput. Sci, Gerhard-Mercator Univ., Duisburg, Germany
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    5
  • Abstract
    In previous work about hybrid speech recognizers with discrete HMMs we have shown that VQs, that are trained according to a maximum mutual information (MMI) criterion, are well suited for ML estimated Bayes classifiers. This is only valid for single VQ systems. In this paper we extend the theory to speech recognizers with multiple VQs. This leads to a joint training criterion for arbitrary multiple neural VQs that considers the inter VQ correlation during parameter estimation. The idea of a gradient based joint training method is derived. Experimental results indicate that inter VQ correlations can cause some degradation of recognition performance. The joint multiple VQ training decorrelates the quantizer labels and improves system performance. In addition the new training criterion allows for a less careful way of splitting up the feature vector into multiple streams that do not have to be statistically independent. In particular the use of highly correlated features in conjunction with the novel training criterion in the experiments leads to important gains in recognition performance for the speaker independent resource management database and gives the lowest error rate of 5.0% we ever obtained in this framework
  • Keywords
    correlation methods; hidden Markov models; learning (artificial intelligence); neural nets; parameter estimation; speech recognition; vector quantisation; ML estimated Bayes classifiers; VQ; acoustic feature correlations; discrete HMM system; feature vector; gradient based joint training method; highly correlated features; hybrid speech recognizers; inter VQ correlation; joint neural vector quantizer design; joint training criterion; maximum mutual information criterion; multiple VQ training; parameter estimation; speaker independent resource management database; speech recognition; Decorrelation; Degradation; Management training; Maximum likelihood estimation; Mutual information; Parameter estimation; Performance gain; Resource management; Speech recognition; System performance;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674353
  • Filename
    674353