• DocumentCode
    3424895
  • Title

    Comparative evaluations of robust and accurate F0 estimates in reverberant environments

  • Author

    Unoki, Masashi ; Hosorogiya, Toshihiro ; Ishimoto, Yuichi

  • Author_Institution
    Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4569
  • Lastpage
    4572
  • Abstract
    This paper reports comparative evaluations of the method we previously proposed of estimating fundamental frequency (F0) based on complex cepstrum analysis with nine typical methods over huge speech-sound datasets in both artificial and realistic reverberant environments (in room acoustics). They involve several classic algorithms (Cepstrum, AMDF, TPC, and modified autocorrelation) and a few modern algorithms (TEMPO, YIN, and PHIA). The comparative results revealed that the percentage correct rates of the estimated FOs using them were drastically reduced as the reverberation time increased while Fo estimated with the proposed method was completely robust and accurate. They also demonstrated that homomorphic analysis and the concept of a source-filter model were relatively effective for estimating Fo. The results also demonstrated that it was much better than the previously reported methods in terms of robustness and providing accurate F0 estimates in both artificial and realistic reverberant environments.
  • Keywords
    cepstral analysis; frequency estimation; reverberation; speech processing; F0 estimation; cepstrum analysis; fundamental frequency estimation; homomorphic analysis; reverberant environments; reverberant speech; reverberation time; source-filter model; speech-sound datasets; Acoustic noise; Cepstral analysis; Cepstrum; Frequency estimation; Noise robustness; Power harmonic filters; Reverberation; Speech analysis; Speech enhancement; Working environment noise; F0 estimation; MTF concept; complex cepstrum analysis; reverberant speech; source-filter model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518673
  • Filename
    4518673