• DocumentCode
    3412815
  • Title

    Multi-channel linear prediction-based speech dereverberation with low-rank power spectrogram approximation

  • Author

    Jukic, Ante ; Mohammadiha, Nasser ; van Waterschoot, Toon ; Gerkmann, Timo ; Doclo, Simon

  • Author_Institution
    Dept. of Med. Phys. & Acoust., Univ. of Oldenburg, Oldenburg, Germany
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    96
  • Lastpage
    100
  • Abstract
    In many acoustic conditions the recorded speech signals may be severely affected by reverberation, leading to a reduced speech quality and intelligibility. In this paper we focus on a blind speech dereverberation method based on multi-channel linear prediction (MCLP) in the short-time Fourier transform domain, which is typically performed in each frequency bin independently without taking into account the spectral structure of the speech signal. Since it is widely accepted that a speech spectrogram can be well approximated with a low-rank matrix, e.g., using a spectral dictionary, in this paper we propose to incorporate a low-rank matrix approximation of the speech spectrogram into the MCLP-based speech dereverberation. The low-rank approximation is obtained using nonnegative matrix factorization with Itakura-Saito divergence. Experimental results for several measured acoustic systems show that incorporating a low-rank approximation improves the dereverberation performance in terms of instrumental speech quality measures.
  • Keywords
    Fourier transforms; approximation theory; matrix algebra; reverberation; speech processing; Itakura-Saito divergence; MCLP; acoustic conditions; blind speech dereverberation method; low rank power spectrogram approximation; low-rank matrix approximation; multichannel linear prediction; nonnegative matrix factorization; recorded speech signals; short-time Fourier transform domain; spectral structure; speech dereverberation; speech signal; speech spectrogram; Approximation methods; Dictionaries; Estimation; Gold; Noise; Optimization; Spectrogram; low-rank approximation; multi-channel linear prediction; nonnegative matrix factorization; speech dereverberation; speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177939
  • Filename
    7177939