DocumentCode :
3412815
Title :
Multi-channel linear prediction-based speech dereverberation with low-rank power spectrogram approximation
Author :
Jukic, Ante ; Mohammadiha, Nasser ; van Waterschoot, Toon ; Gerkmann, Timo ; Doclo, Simon
Author_Institution :
Dept. of Med. Phys. & Acoust., Univ. of Oldenburg, Oldenburg, Germany
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
96
Lastpage :
100
Abstract :
In many acoustic conditions the recorded speech signals may be severely affected by reverberation, leading to a reduced speech quality and intelligibility. In this paper we focus on a blind speech dereverberation method based on multi-channel linear prediction (MCLP) in the short-time Fourier transform domain, which is typically performed in each frequency bin independently without taking into account the spectral structure of the speech signal. Since it is widely accepted that a speech spectrogram can be well approximated with a low-rank matrix, e.g., using a spectral dictionary, in this paper we propose to incorporate a low-rank matrix approximation of the speech spectrogram into the MCLP-based speech dereverberation. The low-rank approximation is obtained using nonnegative matrix factorization with Itakura-Saito divergence. Experimental results for several measured acoustic systems show that incorporating a low-rank approximation improves the dereverberation performance in terms of instrumental speech quality measures.
Keywords :
Fourier transforms; approximation theory; matrix algebra; reverberation; speech processing; Itakura-Saito divergence; MCLP; acoustic conditions; blind speech dereverberation method; low rank power spectrogram approximation; low-rank matrix approximation; multichannel linear prediction; nonnegative matrix factorization; recorded speech signals; short-time Fourier transform domain; spectral structure; speech dereverberation; speech signal; speech spectrogram; Approximation methods; Dictionaries; Estimation; Gold; Noise; Optimization; Spectrogram; low-rank approximation; multi-channel linear prediction; nonnegative matrix factorization; speech dereverberation; speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7177939
Filename :
7177939
Link To Document :
بازگشت