DocumentCode :
61947
Title :
Phase Processing for Single-Channel Speech Enhancement: History and recent advances
Author :
Gerkmann, Timo ; Krawczyk-Becker, Martin ; Le Roux, Jonathan
Author_Institution :
Dept. Med. Phys. & Acoust., Univ. Oldenburg, Oldenburg, Germany
Volume :
32
Issue :
2
fYear :
2015
fDate :
Mar-15
Firstpage :
55
Lastpage :
66
Abstract :
With the advancement of technology, both assisted listening devices and speech communication devices are becoming more portable and also more frequently used. As a consequence, users of devices such as hearing aids, cochlear implants, and mobile telephones, expect their devices to work robustly anywhere and at any time. This holds in particular for challenging noisy environments like a cafeteria, a restaurant, a subway, a factory, or in traffic. One way to making assisted listening devices robust to noise is to apply speech enhancement algorithms. To improve the corrupted speech, spatial diversity can be exploited by a constructive combination of microphone signals (so-called beamforming), and by exploiting the different spectro?temporal properties of speech and noise. Here, we focus on single-channel speech enhancement algorithms which rely on spectrotemporal properties. On the one hand, these algorithms can be employed when the miniaturization of devices only allows for using a single microphone. On the other hand, when multiple microphones are available, single-channel algorithms can be employed as a postprocessor at the output of a beamformer. To exploit the short-term stationary properties of natural sounds, many of these approaches process the signal in a time-frequency representation, most frequently the short-time discrete Fourier transform (STFT) domain. In this domain, the coefficients of the signal are complex-valued, and can therefore be represented by their absolute value (referred to in the literature both as STFT magnitude and STFT amplitude) and their phase. While the modeling and processing of the STFT magnitude has been the center of interest in the past three decades, phase has been largely ignored.
Keywords :
array signal processing; discrete Fourier transforms; hearing aids; speech enhancement; assisted listening devices; beamforming; cochlear implants; corrupted speech; hearing aids; mobile telephones; phase processing; short-time discrete Fourier transform domain; single-channel speech enhancement; spatial diversity; speech communication devices; time-frequency representation; Acoustic noise; Acoustic signal processing; Assistive devices; Cochlear implants; Implants; Noise measurement; Spectrogram; Speech enhancement; Time-domain analysis; Time-frequency analysis;
fLanguage :
English
Journal_Title :
Signal Processing Magazine, IEEE
Publisher :
ieee
ISSN :
1053-5888
Type :
jour
DOI :
10.1109/MSP.2014.2369251
Filename :
7038277
Link To Document :
بازگشت