Free energy for speech recognition

Author

Singh, Rita ; Kumatani, Kenichi

Author_Institution

Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear

2015

fDate

19-24 April 2015

Firstpage

4515

Lastpage

4519

Abstract

Traditionally, speech recognizers have used a strictly Bayesian paradigm for finding the best hypothesis from amongst all possible hypotheses for the data to be recognized. The Bayes classification rule has been shown to be optimal when the class distributions represent the true distributions of the data to be classified. In reality, however, this condition is often not satisfied - the classifier itself is trained on some training data and may be deployed to classify data whose statistical characteristics are different from the training data. The Bayes classification rule may result in suboptimal performance under these conditions of mismatch. Classification may benefit from the use of modified classification rules in this case. The use of entropy as an optimization criterion for various classification tasks has been well established in the literature. In this paper we show that free energy, a thermodynamic concept directly related to entropy, can also be used as an objective criterion in classification. Furthermore, we show how this novel classification scheme can be used in the framework of existing Bayesian classification schemes implemented in current speech recognizers by simply modifying the class distributions a priori. Pilot experiments show that minimization of free energy results in more accurate recognition under conditions of mismatch.

Keywords

Bayes methods; speech recognition; Bayes classification rule; Bayesian classification schemes; Bayesian paradigm; data classification; free energy; objective criterion; speech recognition; statistical characteristics; suboptimal performance; thermodynamic concept; training data; Bayes methods; Entropy; Hidden Markov models; Mathematical model; Speech; Speech recognition; Thermodynamics; Bayesian classification; Free energy; Speech recognition; Temperature;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7178825

Filename

7178825