DocumentCode :
1740822
Title :
Fusing audio and visual features of speech
Author :
Pan, Hao ; Liang, Zhi-Pei ; Huang, Thomas S.
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
214
Abstract :
In this paper, the audio and visual features of speech are integrated using a novel fused-HMM. We assume that the two sets of features may have different data rates and duration. Hidden Markov models (HMMs) are first used to model them separately, and then a general Bayesian fusion method, which is optimal in the maximum entropy sense, is employed to fuse them together. Particularly, an efficient learning algorithm is introduced. Instead of maximizing the joint likelihood of the fuse-HMM, the learning algorithm maximizes the two HMMs separately, and then fuses the HMMs together. In addition, an inference algorithm is proposed. We have tested the proposed method by person verification experiments. Results show that the proposed method significantly reduces the recognition error rates as compared to the unimodal HMMs and the loosely-coupled fusion model
Keywords :
Bayes methods; feature extraction; hidden Markov models; inference mechanisms; learning (artificial intelligence); maximum entropy methods; speaker recognition; audio features; data duration; data rates; efficient learning algorithm; fuse-HMM; fused-HMM; general Bayesian fusion method; hidden Markov models; inference algorithm; learning algorithm; loosely-coupled fusion model; maximum entropy; person verification; recognition error rate reduction; speaker verification; speech features fusion; unimodal HMM; visual features; Bayesian methods; Entropy; Error analysis; Fuses; Hidden Markov models; Inference algorithms; Probability; Speech; Surges; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2000. Proceedings. 2000 International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1522-4880
Print_ISBN :
0-7803-6297-7
Type :
conf
DOI :
10.1109/ICIP.2000.899333
Filename :
899333
Link To Document :
بازگشت