• DocumentCode
    82135
  • Title

    Harmonic Phase Estimation in Single-Channel Speech Enhancement Using Phase Decomposition and SNR Information

  • Author

    Mowlaee, Pejman ; Kulmer, Josef

  • Author_Institution
    Dept. of Electr. Eng., Graz Univ. of Technol., Graz, Austria
  • Volume
    23
  • Issue
    9
  • fYear
    2015
  • fDate
    Sept. 2015
  • Firstpage
    1521
  • Lastpage
    1532
  • Abstract
    In conventional single-channel speech enhancement, typically the noisy spectral amplitude is modified while the noisy phase is used to reconstruct the enhanced signal. Several recent attempts have shown the effectiveness of utilizing an improved spectral phase for phase-aware speech enhancement and consequently its positive impact on the perceived speech quality. In this paper, we present a harmonic phase estimation method relying on fundamental frequency and signal-to-noise ratio (SNR) information estimated from noisy speech. The proposed method relies on SNR-based time-frequency smoothing of the unwrapped phase obtained from the decomposition of the noisy phase. To incorporate the uncertainty in the estimated phase due to unreliable voicing decision and SNR estimate, we propose a binary hypothesis test assuming speech-present and speech-absent classes representing high and low SNRs. The effectiveness of the proposed phase estimation method is evaluated for both phase-only enhancement of noisy speech and in combination with an amplitude-only enhancement scheme. We show that by enhancing the noisy phase both perceived speech quality as well as speech intelligibility are improved as predicted by the instrumental metrics and justified by subjective listening tests.
  • Keywords
    phase estimation; smoothing methods; speech enhancement; speech intelligibility; time-frequency analysis; SNR information; SNR-based time-frequency smoothing; amplitude-only enhancement scheme; binary hypothesis test; harmonic phase estimation method; noisy spectral amplitude; phase decomposition; phase-aware speech enhancement; phase-only enhancement; signal reconstruction; single-channel speech enhancement; speech intelligibility; speech-absent classes; speech-present classes; Harmonic analysis; Noise measurement; Phase estimation; Smoothing methods; Speech; Speech enhancement; Harmonic model; noise reduction; phase decomposition; phase estimation; von Mises distribution;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2439038
  • Filename
    7115078