DocumentCode
1728455
Title
Dominant speaker detection using discrete Markov chain for multi-user video conferencing
Author
Baskaran, Vishnu Monn ; Yoong Choon Chang ; Loo, Jonathan ; KokSheik Wong ; Ming-Tao Gan
Author_Institution
Fac. of Eng., Multimedia Univ., Cyberjaya, Malaysia
fYear
2015
Firstpage
492
Lastpage
493
Abstract
This paper puts forward a discrete-time Markov chain algorithm in predicting a pair of active or dominant speakers in an ultra-high definition multi-user video conferencing system. The applied Markov chain minimizes false dominant speaker classification due to transient noise during a video conferencing session. This algorithm also includes a set of linear weights-based assignment for both the initial state vector and transition probability matrix, which improves the response of the algorithm towards changing dominant speakers. Experimental results suggests that this algorithm accurately predicts the most dominant speaker at a rate of 83% for 11 clients in a combined video with 86% reduction in false dominant speaker classification, based on given a set of artificial speaker data.
Keywords
Markov processes; matrix algebra; probability; speaker recognition; teleconferencing; video communication; artificial speaker data; discrete time Markov chain algorithm; dominant speaker detection; initial state vector; linear weights; multiuser video conferencing system; speaker classification; transient noise; transition probability matrix; video conferencing session; Bandwidth; Classification algorithms; Containers; Markov processes; Noise; Prediction algorithms; Transient analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
Conference_Location
Taipei
Type
conf
DOI
10.1109/ICCE-TW.2015.7217016
Filename
7217016
Link To Document