Multi-channel linear prediction-based speech dereverberation with low-rank power spectrogram approximation

Author

Jukic, Ante ; Mohammadiha, Nasser ; van Waterschoot, Toon ; Gerkmann, Timo ; Doclo, Simon

Author_Institution

Dept. of Med. Phys. & Acoust., Univ. of Oldenburg, Oldenburg, Germany

fYear

2015

fDate

19-24 April 2015

Firstpage

96

Lastpage

100

Abstract

In many acoustic conditions the recorded speech signals may be severely affected by reverberation, leading to a reduced speech quality and intelligibility. In this paper we focus on a blind speech dereverberation method based on multi-channel linear prediction (MCLP) in the short-time Fourier transform domain, which is typically performed in each frequency bin independently without taking into account the spectral structure of the speech signal. Since it is widely accepted that a speech spectrogram can be well approximated with a low-rank matrix, e.g., using a spectral dictionary, in this paper we propose to incorporate a low-rank matrix approximation of the speech spectrogram into the MCLP-based speech dereverberation. The low-rank approximation is obtained using nonnegative matrix factorization with Itakura-Saito divergence. Experimental results for several measured acoustic systems show that incorporating a low-rank approximation improves the dereverberation performance in terms of instrumental speech quality measures.

Keywords

Fourier transforms; approximation theory; matrix algebra; reverberation; speech processing; Itakura-Saito divergence; MCLP; acoustic conditions; blind speech dereverberation method; low rank power spectrogram approximation; low-rank matrix approximation; multichannel linear prediction; nonnegative matrix factorization; recorded speech signals; short-time Fourier transform domain; spectral structure; speech dereverberation; speech signal; speech spectrogram; Approximation methods; Dictionaries; Estimation; Gold; Noise; Optimization; Spectrogram; low-rank approximation; multi-channel linear prediction; nonnegative matrix factorization; speech dereverberation; speech enhancement;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location

South Brisbane, QLD

Type

conf

DOI

10.1109/ICASSP.2015.7177939

Filename

7177939