DocumentCode
1525697
Title
Clustered Blind Beamforming From Ad-Hoc Microphone Arrays
Author
Himawan, Ivan ; McCowan, Iain ; Sridharan, Sridha
Author_Institution
Speech & Audio Res. Lab., Queensland Univ. of Technol., Brisbane, QLD, Australia
Volume
19
Issue
4
fYear
2011
fDate
5/1/2011 12:00:00 AM
Firstpage
661
Lastpage
676
Abstract
Microphone arrays have been used in various applications to capture conversations, such as in meetings and teleconferences. In many cases, the microphone and likely source locations are known a priori, and calculating beamforming filters is therefore straightforward. In ad-hoc situations, however, when the microphones have not been systematically positioned, this information is not available and beamforming must be achieved blindly. In achieving this, a commonly neglected issue is whether it is optimal to use all of the available microphones, or only an advantageous subset of these. This paper commences by reviewing different approaches to blind beamforming, characterizing them by the way they estimate the signal propagation vector and the spatial coherence of noise in the absence of prior knowledge of microphone and speaker locations. Following this, a novel clustered approach to blind beamforming is motivated and developed. Without using any prior geometrical information, microphones are first grouped into localized clusters, which are then ranked according to their relative distance from a speaker. Beamforming is then performed using either the closest microphone cluster, or a weighted combination of clusters. The clustered algorithms are compared to the full set of microphones in experiments on a database recorded on different ad-hoc array geometries. These experiments evaluate the methods in terms of signal enhancement as well as performance on a large vocabulary speech recognition task.
Keywords
array signal processing; blind source separation; microphone arrays; speech recognition; statistical analysis; ad-hoc array; beamforming filters; blind beamforming; clustered algorithms; microphone arrays; signal propagation vector; spatial coherence; speaker locations; speech recognition; Array signal processing; speech enhancement; speech recognition;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2010.2055560
Filename
5497098
Link To Document