DocumentCode :
1728455
Title :
Dominant speaker detection using discrete Markov chain for multi-user video conferencing
Author :
Baskaran, Vishnu Monn ; Yoong Choon Chang ; Loo, Jonathan ; KokSheik Wong ; Ming-Tao Gan
Author_Institution :
Fac. of Eng., Multimedia Univ., Cyberjaya, Malaysia
fYear :
2015
Firstpage :
492
Lastpage :
493
Abstract :
This paper puts forward a discrete-time Markov chain algorithm in predicting a pair of active or dominant speakers in an ultra-high definition multi-user video conferencing system. The applied Markov chain minimizes false dominant speaker classification due to transient noise during a video conferencing session. This algorithm also includes a set of linear weights-based assignment for both the initial state vector and transition probability matrix, which improves the response of the algorithm towards changing dominant speakers. Experimental results suggests that this algorithm accurately predicts the most dominant speaker at a rate of 83% for 11 clients in a combined video with 86% reduction in false dominant speaker classification, based on given a set of artificial speaker data.
Keywords :
Markov processes; matrix algebra; probability; speaker recognition; teleconferencing; video communication; artificial speaker data; discrete time Markov chain algorithm; dominant speaker detection; initial state vector; linear weights; multiuser video conferencing system; speaker classification; transient noise; transition probability matrix; video conferencing session; Bandwidth; Classification algorithms; Containers; Markov processes; Noise; Prediction algorithms; Transient analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
Conference_Location :
Taipei
Type :
conf
DOI :
10.1109/ICCE-TW.2015.7217016
Filename :
7217016
Link To Document :
بازگشت