• DocumentCode
    3429807
  • Title

    Free energy for speech recognition

  • Author

    Singh, Rita ; Kumatani, Kenichi

  • Author_Institution
    Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4515
  • Lastpage
    4519
  • Abstract
    Traditionally, speech recognizers have used a strictly Bayesian paradigm for finding the best hypothesis from amongst all possible hypotheses for the data to be recognized. The Bayes classification rule has been shown to be optimal when the class distributions represent the true distributions of the data to be classified. In reality, however, this condition is often not satisfied - the classifier itself is trained on some training data and may be deployed to classify data whose statistical characteristics are different from the training data. The Bayes classification rule may result in suboptimal performance under these conditions of mismatch. Classification may benefit from the use of modified classification rules in this case. The use of entropy as an optimization criterion for various classification tasks has been well established in the literature. In this paper we show that free energy, a thermodynamic concept directly related to entropy, can also be used as an objective criterion in classification. Furthermore, we show how this novel classification scheme can be used in the framework of existing Bayesian classification schemes implemented in current speech recognizers by simply modifying the class distributions a priori. Pilot experiments show that minimization of free energy results in more accurate recognition under conditions of mismatch.
  • Keywords
    Bayes methods; speech recognition; Bayes classification rule; Bayesian classification schemes; Bayesian paradigm; data classification; free energy; objective criterion; speech recognition; statistical characteristics; suboptimal performance; thermodynamic concept; training data; Bayes methods; Entropy; Hidden Markov models; Mathematical model; Speech; Speech recognition; Thermodynamics; Bayesian classification; Free energy; Speech recognition; Temperature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178825
  • Filename
    7178825