DocumentCode
2177253
Title
Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition
Author
Balakrishnan, Venkataramanan ; Sivaram, G.S.V.S. ; Khudanpur, Sanjeev
Author_Institution
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
fYear
2011
fDate
22-27 May 2011
Firstpage
5028
Lastpage
5031
Abstract
In this paper, we present a novel technique for modeling the posterior probability estimates obtained from a neural net work directly in the HMM framework using the Dirichlet Mixture Models (DMMs). Since posterior probability vectors lie on a probability simplex their distribution can be modeled using DMMs. Being in an exponential family, the parameters of DMMs can be estimated in an efficient manner. Conventional approaches like TANDEM attempt to gaussianize the posteriors by suitable transforms and model them using Gaussian Mixture Models (GMMs). This requires more number of parameters as it does not exploit the fact that the probability vectors lie on a simplex. We demonstrate through TIMIT phoneme recognition experiments that the proposed technique outperforms the conventional TANDEM approach.
Keywords
Gaussian processes; hidden Markov models; speech recognition; statistical distributions; DMM; Dirichlet mixture models; GMM; Gaussian Mixture Models; HMM-based speech recognition; TANDEM approach; TIMIT phoneme recognition; neural net posteriors; posterior probability estimates; Computational modeling; Data models; Feature extraction; Hidden Markov models; Probability; Speech; Training; Dirichlet distribution; HMMs; neural network posteriors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5947486
Filename
5947486
Link To Document