• DocumentCode
    1294719
  • Title

    Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction

  • Author

    Nakatani, Tomohiro ; Yoshioka, Takuya ; Kinoshita, Keisuke ; Miyoshi, Masato ; Juang, Biing-Hwang

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
  • Volume
    18
  • Issue
    7
  • fYear
    2010
  • Firstpage
    1717
  • Lastpage
    1731
  • Abstract
    This paper proposes a statistical model-based speech dereverberation approach that can cancel the late reverberation of a reverberant speech signal captured by distant microphones without prior knowledge of the room impulse responses. With this approach, the generative model of the captured signal is composed of a source process, which is assumed to be a Gaussian process with a time-varying variance, and an observation process modeled by a delayed linear prediction (DLP). The optimization objective for the dereverberation problem is derived to be the sum of the squared prediction errors normalized by the source variances; hence, this approach is referred to as variance-normalized delayed linear prediction (NDLP). Inheriting the characteristic of DLP, NDLP can robustly estimate an inverse system for late reverberation in the presence of noise without greatly distorting a direct speech signal. In addition, owing to the use of variance normalization, NDLP allows us to improve the dereverberation result especially with relatively short (of the order of a few seconds) observations. Furthermore, NDLP can be implemented in a computationally efficient manner in the time-frequency domain. Experimental results demonstrate the effectiveness and efficiency of the proposed approach in comparison with two existing approaches.
  • Keywords
    Gaussian processes; speech processing; Gaussian process; NDLP; delayed linear prediction; dereverberation problem; distant microphones; speech dereverberation; speech signal; time-varying variance; variance-normalized delayed linear prediction; Correlation; Microphones; Optimization; Predictive models; Reverberation; Speech; Speech processing; Dereverberation; inverse filtering; multichannel linear prediction; speech enhancement; statistical model-based signal processing;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2052251
  • Filename
    5547558