• DocumentCode
    3697459
  • Title

    F0 estimation for noisy speech based on exploring local time-frequency segment

  • Author

    Dongmei Wang;John H. L. Hansen;Emily Tobey

  • Author_Institution
    Dept. Electrical Engineering, University of Texas at Dallas
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper, we propose a fundamental frequency (F0) estimation algorithm for noisy speech based on exploring the local time-frequency (TF) segment. Our algorithm is motivated by the fact that the full band speech signal is redundant for pitch perception. We assume that in the same time region, the TF segment least affected by noise interference is more reliable for F0 estimation than the full band spectrum. Our algorithm consists of two main stages. Firstly, the overall TF plane is divided into overlapped TF segments, and then the F0 candidates are estimated from each single TF segment. Secondly, the optimal F0 value is selected from F0 candidates based on signal to noise ratio (SNR) estimation and dynamic programming. The experimental results show that the proposed algorithm outperforms several non-parametric state-of-the-art F0 estimation techniques.
  • Keywords
    "Estimation","Speech","Signal to noise ratio","Harmonic analysis","Noise measurement","Time-frequency analysis","Signal processing algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
  • Type

    conf

  • DOI
    10.1109/WASPAA.2015.7336942
  • Filename
    7336942