• DocumentCode
    3093875
  • Title

    Listen to the parrot: Demonstrating the quality of online pitch and formant extraction via feature-based resynthesis

  • Author

    Heckmann, Martin ; Glaser, Claudius ; Vaz, Miguel ; Rodemann, Tobias ; Joublin, Frank ; Goerick, Christian

  • Author_Institution
    Honda Res. Inst. Eur. GmbH, Offenbach am Main
  • fYear
    2008
  • fDate
    22-26 Sept. 2008
  • Firstpage
    1699
  • Lastpage
    1704
  • Abstract
    We present a system for online extraction of the fundamental frequency and the first four formant frequencies from a speech signal. In order to evaluate the performance of the extraction a resynthesis of the speech signal is performed. The resynthesis is based on the extracted frequencies and the energy of the input signal at the formant locations. The extraction of the fundamental frequency and the formants is robust against room echoes and interfering noise. In order to improve the robustness against background noise a noise reduction was implemented. Tests in three rooms of different size at varying distances to the system (up to 8 m yielding an SNR of approx. 0 dB) were performed.
  • Keywords
    feature extraction; speech enhancement; speech synthesis; background noise; feature-based resynthesis; formant extraction; noise reduction; online extraction; online pitch quality; speech signal; Bayesian methods; Distance measurement; Filter bank; Harmonic analysis; Power harmonic filters; Speech; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
  • Conference_Location
    Nice
  • Print_ISBN
    978-1-4244-2057-5
  • Type

    conf

  • DOI
    10.1109/IROS.2008.4650923
  • Filename
    4650923