DocumentCode
3426293
Title
Perceptual similarity measurement of speech by combination of acoustic features
Author
Adachi, Yoshihiro ; Kawamoto, Shinichi ; Morishima, Shigeo ; Nakamura, Satoshi
Author_Institution
Spoken Language Commun. Res. Labs., ATR, Kyoto
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
4861
Lastpage
4864
Abstract
Future cast system is a new entertainment system where participant´s face is captured and rendered into the movie as an instant computer graphics (CG) movie star, which had been first exhibited at the 2005 World Exposition in Aichi Japan. We are working to add new functionality which enables mapping not only faces but also speech individualities to the cast. Our approach is to find a speaker with the closest speech individuality and apply voice conversion. This paper investigates acoustic features to estimate perceptual similarity of speech individuality. We propose a method linearly combined eight acoustic features related to the perception of speech individualities. The proposed method optimizes weights for the acoustic features considering perceptual similarities. We have evaluated performance of our method with Spearman´s rank correlation coefficients to perceptual similarities. As the results, the experiments evidenced that the proposed method achieves a correlation coefficient of 0.66.
Keywords
acoustic signal processing; speech processing; speech recognition; acoustic features; cast system; computer graphics; entertainment system; perceptual similarity; speech; speech individuality; voice conversion; Acoustic measurements; Cepstrum; Character generation; Databases; Loudspeakers; Mel frequency cepstral coefficient; Motion pictures; Particle measurements; Speaker recognition; Speech; Acoustic correlators; Speaker recognition; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518746
Filename
4518746
Link To Document