DocumentCode :
454827
Title :
A Combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition
Author :
Reiter, Stephan ; Schuller, Bjorn ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine-Commun., Technische Univ. Munchen, Garching
Volume :
2
fYear :
2006
fDate :
14-19 May 2006
Abstract :
Automatic segmentation and classification of recorded meetings provides a basis that enables effective browsing and querying in a meeting archive. Yet, robustness of today´s approaches is often not reliable enough. We therefore strive to improve on this task by introduction of a tandem approach combining the discriminative abilities of recurrent neural nets and warping capabilities of hidden Markov models. Thereby long short-term memory cells are used for audio-visual frame analysis within the neural net. These help to overcome typical long time lags. Extensive test runs on the public M4 Scripted Meeting Corpus show great performance applying our suggested novel approach
Keywords :
hidden Markov models; image classification; image segmentation; recurrent neural nets; HMM; audio-visual frame analysis; hidden Markov models; long short-term memory recurrent neural nets; meeting event recognition; meeting event segmentation; recorded meetings classification; Feature extraction; Hidden Markov models; Microphone arrays; Neural networks; Pattern recognition; Recurrent neural networks; Robustness; Speech; Streaming media; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660362
Filename :
1660362
Link To Document :
بازگشت