DocumentCode :
1303568
Title :
Model complexity adaptation using a discriminant measure
Author :
Padmanabhan, M. ; Ban, L.R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
8
Issue :
2
fYear :
2000
fDate :
3/1/2000 12:00:00 AM
Firstpage :
205
Lastpage :
208
Abstract :
We present a discriminant measure that can be used to determine the model complexity in a speech recognition system. In the speech recognition process, sub-phonetic classes are modelled as mixtures of Gaussians, and we present a new discriminant measure that uses the classification accuracy to determine in an objective fashion, the number of Gaussians required to best model the PDF of an allophone class. We compare the performance of this criterion with other criteria such as the Bayesian information criterion (BIC), and show that the BIC and the discriminative criterion lead to parsimonious models that provide the same word error rate performance as much larger baseline systems. However, this performance improvement depends on the size of the system, and there appears to be a crossover point beyond which both the BIC and the discriminative criterion are worse than a much simpler criterion. The discriminative criterion also enables this crossover point to be controlled by means of a threshold that is used in the criterion, and can lead to a better tradeoff of complexity versus word error rate
Keywords :
Bayes methods; Gaussian processes; computational complexity; error statistics; probability; speech recognition; Bayesian information criterion; Gaussian mixtures; PDF; allophone class; classification accuracy; crossover point; discriminant measure; model complexity adaptation; parsimonious models; performance; speech recognition system; sub-phonetic classes; word error rate performance; Adaptation model; Error analysis; Feature extraction; Gaussian processes; Parametric statistics; Probability; Speech recognition; Testing; Training data; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.824707
Filename :
824707
Link To Document :
بازگشت