• DocumentCode
    3426293
  • Title

    Perceptual similarity measurement of speech by combination of acoustic features

  • Author

    Adachi, Yoshihiro ; Kawamoto, Shinichi ; Morishima, Shigeo ; Nakamura, Satoshi

  • Author_Institution
    Spoken Language Commun. Res. Labs., ATR, Kyoto
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4861
  • Lastpage
    4864
  • Abstract
    Future cast system is a new entertainment system where participant´s face is captured and rendered into the movie as an instant computer graphics (CG) movie star, which had been first exhibited at the 2005 World Exposition in Aichi Japan. We are working to add new functionality which enables mapping not only faces but also speech individualities to the cast. Our approach is to find a speaker with the closest speech individuality and apply voice conversion. This paper investigates acoustic features to estimate perceptual similarity of speech individuality. We propose a method linearly combined eight acoustic features related to the perception of speech individualities. The proposed method optimizes weights for the acoustic features considering perceptual similarities. We have evaluated performance of our method with Spearman´s rank correlation coefficients to perceptual similarities. As the results, the experiments evidenced that the proposed method achieves a correlation coefficient of 0.66.
  • Keywords
    acoustic signal processing; speech processing; speech recognition; acoustic features; cast system; computer graphics; entertainment system; perceptual similarity; speech; speech individuality; voice conversion; Acoustic measurements; Cepstrum; Character generation; Databases; Loudspeakers; Mel frequency cepstral coefficient; Motion pictures; Particle measurements; Speaker recognition; Speech; Acoustic correlators; Speaker recognition; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518746
  • Filename
    4518746