DocumentCode :
3641630
Title :
Improving speech recognition with audio-visual tandem classifiers and their fusions
Author :
İbrahim Saygın Topkaya;Mehmet Umut Şen;Mustafa Berkay Yılmaz;Hakan Erdoğan
Author_Institution :
Sabancı
fYear :
2011
fDate :
4/1/2011 12:00:00 AM
Firstpage :
407
Lastpage :
410
Abstract :
“Tandem approach” is a method used in speech recognition to increase performance by using classifier posterior probabilities as observations in a hidden Markov model. In this work we study the effect of using multiple visual tandem features to improve audio-visual recognition accuracy. In addition, we investigate methods to combine outputs of several audio and visual tandem classifiers with a classifier fusion system to generate outputs using learned weights. Experiments show that both approaches help to improve audio-visual speech recognition with respect to regular audio-visual speech recognition especially in noisy environments.
Keywords :
"Markov processes","Hidden Markov models","Speech recognition","Mel frequency cepstral coefficient","Signal processing","Conferences","Speech"
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications Applications (SIU), 2011 IEEE 19th Conference on
ISSN :
2165-0608
Print_ISBN :
978-1-4577-0462-8
Type :
conf
DOI :
10.1109/SIU.2011.5929673
Filename :
5929673
Link To Document :
بازگشت