DocumentCode
111760
Title
Raking the Cocktail Party
Author
Dokmanic, Ivan ; Scheibler, Robin ; Vetterli, Martin
Author_Institution
LCAV-EPFL, Lausanne, Switzerland
Volume
9
Issue
5
fYear
2015
fDate
Aug. 2015
Firstpage
825
Lastpage
836
Abstract
We present the concept of an acoustic rake receiver-a microphone beamformer that uses echoes to improve the noise and interference suppression. The rake idea is well-known in wireless communications; it involves constructively combining different multipath components that arrive at the receiver antennas. Unlike spread-spectrum signals used in wireless communications, speech signals are not orthogonal to their shifts. Therefore, we focus on the spatial structure, rather than the temporal. Instead of explicitly estimating the channel, we create correspondences between early echoes in time and image sources in space. These multiple sources of the desired and the interfering signal offer additional spatial diversity that we can exploit in the beamformer design. We present several “intuitive” and optimal formulations of acoustic rake receivers, and show theoretically and numerically that the rake formulation of the maximum signal-to-interference-and-noise ratio beamformer offers significant performance boosts in terms of noise and interference suppression. Beyond signal-to-noise ratio, we observe gains in terms of the perceptual evaluation of speech quality (PESQ) metric for the speech quality. We accompany the paper by the complete simulation and processing chain written in Python. The code and the sound samples are available online at http://lcav.github.io/AcousticRake Receiver/.
Keywords
acoustic signal processing; array signal processing; interference suppression; microphones; multipath channels; radio receivers; radiofrequency interference; radiowave propagation; receiving antennas; speech processing; PESQ metric; Python; acoustic rake receiver; cocktail party raking; image sources; interference suppression improvement; maximum signal-to-interference-and-noise ratio beamformer; microphone beamformer; multipath components; multipath propagation; noise suppression improvement; perceptual evaluation-of-speech quality metric; receiver antennas; signal-to-noise ratio; time sources; wireless communications; Acoustics; Array signal processing; Interference; Microphones; Signal to noise ratio; Vectors; Acoustic rake receiver; beamforming; echo sorting; interference cancellation; noise suppression; perceptual evaluation of speech quality (PESQ); room impulse response;
fLanguage
English
Journal_Title
Selected Topics in Signal Processing, IEEE Journal of
Publisher
ieee
ISSN
1932-4553
Type
jour
DOI
10.1109/JSTSP.2015.2415761
Filename
7065205
Link To Document