DocumentCode :
257795
Title :
Improving overlapping speaker detection using multiple speaker tracking information
Author :
Oualil, Youssef ; Toroghi, Rahil Mahdian ; Klakow, Dietrich
Author_Institution :
Spoken Language Syst., Saarland Univ., Saarbrucken, Germany
fYear :
2014
fDate :
3-5 Dec. 2014
Firstpage :
552
Lastpage :
556
Abstract :
Traditionally, multiple speaker tracking consists of two stages, namely, 1) detection of location measurements, followed by 2) a multiple object tracking approach. In general, these two steps are performed separately, and the tracking performance is highly dependent on the measurement detection rate. The performance of the widely used Steered Response Power (SRP)-based measurement detectors, however, drastically decreases in the overlapping speech scenario, where the dominant speaker frequently masks the low-energy speakers. To overcome this problem, we propose an approach that enhances the probabilistic SRP-based measurement detector, using the multiple speaker information obtained in the tracking step. In doing so, this approach tightly couples the two stages, and increases the detection rate of low-energy speakers during overlapping speech segments. Experiments conducted on the AV16.3 corpus showed a significant improvement of the detection and tracking performance, when the proposed approach is integrated into a Kalman-based multiple speaker tracking framework.
Keywords :
object tracking; speaker recognition; AV16.3 corpus; Kalman-based multiple speaker tracking framework; SRP-based measurement detector; location measurement detection; low-energy speakers; multiple object tracking approach; multiple speaker tracking information; overlapping speaker detection; overlapping speech segment; probabilistic SRP-based measurement detector; steered response power; Bayes methods; Detectors; Microphones; Noise; Speech; Speech processing; Target tracking; Kaiman filter; Speaker overlap; conversational speech; multiple speaker tracking; steered response power;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal and Information Processing (GlobalSIP), 2014 IEEE Global Conference on
Conference_Location :
Atlanta, GA
Type :
conf
DOI :
10.1109/GlobalSIP.2014.7032178
Filename :
7032178
Link To Document :
بازگشت