DocumentCode
699283
Title
Voice separation of overlapping speech using tracking techniques and the gating process
Author
Potamitis, Ilyas ; Zervas, Panos ; Fakotakis, Nikos
Author_Institution
Electr. & Comput. Eng. Dept., Univ. of Patras, Patras, Greece
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
1119
Lastpage
1122
Abstract
This paper investigates the use of tracking techniques successfully applied to aircraft tracking and navigation to segment possibly overlapping speech of multiple static speakers in an enclosure. The tracking technique applied, namely the probabilistic data association (PDA) in conjunction with the interacting multiple model (IMM) estimator directly accounts for measurement origin uncertainty, i.e., which direction of arrival (DOA) measurement comes from which speaker and rejects spurious DOAs. The estimated DOAs are utilized by a single microphone array to provide separation through its directional receptive field. Based on the prediction of the IMM filter that constructs permissible DOA regions for each speaker (gates), we elaborate on the concept and application of the so called `gating process´ that can be utilized in the initialization and termination of speech tracks, thus serving as a voice activity detector (VAD). The effectiveness of the approach is illustrated by extensive simulation study on tracking and separating three static speakers having a conversation with partially overlapping speech and long pauses.
Keywords
aircraft navigation; direction-of-arrival estimation; speech processing; tracking; DOA measurement; IMM estimator; IMM filter prediction; PDA; VAD; aircraft navigation; aircraft tracking; direction-of-arrival measurement; directional receptive field; gating process; interacting multiple model; multiple-static speakers; overlapping speech segmentation; partially-overlapping speech; probabilistic data association; single-microphone array; speech track initialization; speech track termination; static speaker separation; static speaker tracking; tracking technique; voice activity detector; voice separation; Abstracts; Robustness; Wideband;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079813
Link To Document