DocumentCode :
1761644
Title :
Laplace Group Sensing for Acoustic Models
Author :
Jen-Tzung Chien
Author_Institution :
Dept. of Electr. & Comput. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
23
Issue :
5
fYear :
2015
fDate :
42125
Firstpage :
909
Lastpage :
922
Abstract :
This paper presents the group sparse learning for acoustic models where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by groups of basis vectors. The group of common bases represents the features across Markov states within a regression class. The group of individual basis compensates the intra-state residual information. Laplace distribution is used as the sparse prior of sensing weights for group basis representation. Laplace parameter serves as regularization parameter or automatic relevance determination which controls the selection of relevant bases for acoustic modeling. The groups of regularization parameters and basis vectors are estimated from training data by maximizing the marginal likelihood over sensing weights which is implemented by Laplace approximation using the Hessian matrix and the maximum a posteriori parameters. Model uncertainty is compensated through full Bayesian treatment. The connection of Laplace group sensing to lasso regularization is illustrated. Experiments on noisy speech recognition show the robustness of group sparse acoustic models in presence of different noise types and SNRs.
Keywords :
Bayes methods; Hessian matrices; Markov processes; acoustic signal processing; compressed sensing; feature extraction; maximum likelihood estimation; signal representation; speech recognition; Bayesian treatment; Hessian matrix; Laplace distribution; Laplace group sensing; Markov chain; acoustic feature vector sequence; acoustic model; automatic relevance determination; group basis representation; group sparse learning; lasso regularization; marginal likelihood maximization; maximum a posteriori parameter; regression class; regularization parameter; speech recognition; Acoustics; Bayes methods; Encoding; Hidden Markov models; Sensors; Speech recognition; Vectors; Acoustic model; Laplace distribution; basis representation; group sparsity; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2015.2412466
Filename :
7058393
Link To Document :
بازگشت