• DocumentCode
    1728455
  • Title

    Dominant speaker detection using discrete Markov chain for multi-user video conferencing

  • Author

    Baskaran, Vishnu Monn ; Yoong Choon Chang ; Loo, Jonathan ; KokSheik Wong ; Ming-Tao Gan

  • Author_Institution
    Fac. of Eng., Multimedia Univ., Cyberjaya, Malaysia
  • fYear
    2015
  • Firstpage
    492
  • Lastpage
    493
  • Abstract
    This paper puts forward a discrete-time Markov chain algorithm in predicting a pair of active or dominant speakers in an ultra-high definition multi-user video conferencing system. The applied Markov chain minimizes false dominant speaker classification due to transient noise during a video conferencing session. This algorithm also includes a set of linear weights-based assignment for both the initial state vector and transition probability matrix, which improves the response of the algorithm towards changing dominant speakers. Experimental results suggests that this algorithm accurately predicts the most dominant speaker at a rate of 83% for 11 clients in a combined video with 86% reduction in false dominant speaker classification, based on given a set of artificial speaker data.
  • Keywords
    Markov processes; matrix algebra; probability; speaker recognition; teleconferencing; video communication; artificial speaker data; discrete time Markov chain algorithm; dominant speaker detection; initial state vector; linear weights; multiuser video conferencing system; speaker classification; transient noise; transition probability matrix; video conferencing session; Bandwidth; Classification algorithms; Containers; Markov processes; Noise; Prediction algorithms; Transient analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Consumer Electronics - Taiwan (ICCE-TW), 2015 IEEE International Conference on
  • Conference_Location
    Taipei
  • Type

    conf

  • DOI
    10.1109/ICCE-TW.2015.7217016
  • Filename
    7217016