Title :
Group delay based methods for speech source localization over circular arrays
Author :
Tripathy, Ardhendu ; Kumar, L. ; Hegde, Rajesh M.
Author_Institution :
Indian Inst. of Technol., Kanpur, India
fDate :
May 30 2011-June 1 2011
Abstract :
Conventional sub space based approaches for source localization use the spectral magnitude of MUSIC. In this paper, a group delay based method for source localization of spatially close speech sources over circular arrays, with minimal number of sensors is proposed. This approach is based on the MUSIC-Group delay spectrum and can be used to accurately estimate both azimuth and elevation angles of spatially close sources. Both simulated and real speech signal measurements are acquired over a circular array and the DOA estimation is carried out for several trials. The accuracy of the proposed approach is illustrated by using two dimensional scatter plots for a single source, and average error distribution plots for multiple sources. The high resolution property of this method is explained using the additive property of the MUSIC-Group delay spectrum. The proposed method is also evaluated under sensor perturbation errors. Experiments on distant speech recognition are conducted using the proposed approach on sentences from the TIMIT database acquired over circular arrays. The MUSIC-Group delay method indicates reasonable reduction in word error rates when compared to the standard MUSIC-Magnitude method as noted from these experiments.
Keywords :
array signal processing; delays; direction-of-arrival estimation; signal classification; speech recognition; DOA estimation; MUSIC-group delay spectrum; MUSIC-magnitude method; TIMIT database; average error distribution plots; azimuth estimation; circular arrays; distant speech recognition; elevation angle estimation; group delay method; sensor perturbation errors; spectral magnitude; speech signal measurements; speech source localization; speech sources; subspace based approaches; two dimensional scatter plots; Azimuth; Delay; Direction of arrival estimation; Estimation; Microphones; Multiple signal classification; Sensors; Azimuth; DOA; Distant Speech Recognition; Elevation; Group delay; MUSIC; Source localization; UCA; ULA;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2011 Joint Workshop on
Conference_Location :
Edinburgh
Print_ISBN :
978-1-4577-0997-5
DOI :
10.1109/HSCMA.2011.5942411