Title :
Online unsupervised overlapping speaker detection using enhanced classification history-based features
Author :
Oualil, Youssef ; Toroghi, Rahil Mahdian ; Klakow, Dietrich
Author_Institution :
Spoken Language Syst., Saarland Univ., Saarbrücken, Germany
Abstract :
Overlapping speaker localization approaches generally require a binary detector which performs the source/noise classification of the location estimates. This is mainly due to the unknown time-varying number of sources, and to the presence of noise and reverberation. In this paper, we firstly introduce an online implementation of a previously developed offline multiple speaker detector. This classifier is then extended to include new detection features. More precisely, the proposed approach uses the classified location estimates as labelled data to train new classification models for different potential features. The resulting models are then integrated into the online classifier to improve the classification performance. In particular, this paper investigates three different classification history-based models, namely, the location, the kurtosis and the probabilistic steered response power features. Experiments conducted on the AV16.3 corpus show the effectiveness of the proposed approach.
Keywords :
feature extraction; signal classification; speaker recognition; unsupervised learning; AV16.3 corpus; binary detector; classified location estimation; enhanced classification history based features; noise classification; online unsupervised overlapping speaker detection; source classification; speaker localization; Acoustics; Conferences; Feature extraction; Maximum likelihood estimation; Microphones; Noise; Speech; Multiple speaker detection; steered response power; unsupervised Bayesian classifier;
Conference_Titel :
Acoustic Signal Enhancement (IWAENC), 2014 14th International Workshop on
Conference_Location :
Juan-les-Pins
DOI :
10.1109/IWAENC.2014.6954012