• DocumentCode
    394242
  • Title

    Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation

  • Author

    Kawahara, Hideki ; Matsui, H.

  • Author_Institution
    Fac. of Syst. Eng., Wakayama Univ., Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    An elastic spectral distance measure based on a F0 adaptive pitch synchronous spectral estimation and selective elimination of periodicity interferences, that was developed for a high-quality speech modification procedure STRAIGHT [1], is introduced to provide a basis for auditory morphing. The proposed measure is implemented on a low dimensional piecewise bilinear time-frequency mapping between the target and the original speech representations. A preliminary test results of morphing emotional speech samples indicated that proposed procedure provides perceptually monotonic and high-quality interpolation and extrapolation of CD quality speech samples.
  • Keywords
    extrapolation; interference (signal); interpolation; signal representation; signal sampling; spectral analysis; speech intelligibility; speech processing; time-frequency analysis; CD quality speech samples; F0 adaptive pitch synchronous spectral estimation; STRAIGHT; auditory morphing; elastic perceptual distance metric; emotional speech samples morphing; fundamental frequency; high-quality extrapolation; high-quality interpolation; high-quality speech modification; interference-free time-frequency representation; low dimensional bilinear time-frequency mapping; perceptually monotonic interpolation; periodicity interference elimination; piecewise bilinear time-frequency mapping; speech representations; Application software; Automatic control; Extrapolation; Human computer interaction; Interference; Interpolation; Speech; Systems engineering and theory; Testing; Time frequency analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198766
  • Filename
    1198766