DocumentCode
164840
Title
The Athena-RC system for speech activity detection and speaker localization in the DIRHA smart home
Author
Giannoulis, Panagiotis ; Tsiami, Antigoni ; Rodomagoulakis, I. ; Katsamanis, Athanasios ; Potamianos, Gerasimos ; Maragos, Petros
Author_Institution
Sch. of Electr. & Comput. Eng., Nat. Tech. Univ. of Athens, Athens, Greece
fYear
2014
fDate
12-14 May 2014
Firstpage
167
Lastpage
171
Abstract
We present our system for speech activity detection and speaker localization inside a smart home with multiple rooms equipped with microphone arrays of known geometry and placement. The smart home is developed as part of the DIRHA European funded project, providing both simulated and real data for system development and evaluation, under extremely challenging conditions of noise, reverberation, and speech overlap. Our proposed approach performs speech activity detection first, by employing multi-microphone decision fusion on traditional statistical models and acoustic features, within a Viterbi decoding framework, further assisted by signal energy- and model log-likelihood threshold-based heuristics. Then it performs speaker localization using traditional time-difference of arrival estimation between properly selected microphone pairs, further assisted by a dereverberation component. The system achieves very low detection errors, namely less than 4% (5%) for speech activity detection in the simulated (real) DIRHA corpus, and less than 10% (12%) for joint speech detection and speaker localization.
Keywords
Viterbi decoding; acoustic signal processing; home computing; microphones; speaker recognition; speech coding; Athena-RC system; DIRHA European funded project; DIRHA corpus; DIRHA smart home; Viterbi decoding framework; acoustic features; dereverberation component; joint speech detection; microphone arrays; microphone pairs; model log-likelihood threshold-based heuristics; multimicrophone decision fusion; noise; signal energy-based heuristics; speaker localization; speech activity detection; speech overlap; statistical models; system development; system evaluation; time-difference of arrival estimation; Acoustics; Direction-of-arrival estimation; Estimation; Microphone arrays; Smart homes; Speech; microphone arrays; smart homes; speaker localization; speech detection;
fLanguage
English
Publisher
ieee
Conference_Titel
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
Conference_Location
Villers-les-Nancy
Type
conf
DOI
10.1109/HSCMA.2014.6843273
Filename
6843273
Link To Document