DocumentCode
1856080
Title
Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach
Author
Guner, Ekrem ; Mohammadi, Amir ; Demiroglu, Cenk
Author_Institution
Ozyegin Univ., Istanbul, Turkey
fYear
2012
fDate
27-31 Aug. 2012
Firstpage
2055
Lastpage
2059
Abstract
Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection systems, can be generated with the SSS approach. However, a well-known issue with SSS is the lack of voice similarity to the target speaker. The issue arises both in speaker-dependent models and models that are adapted from average voices. Moreover, in speaker adaptation, similarity to the target speaker does not increase significantly after around one minute of adaptation data which potentially indicates inherent bottleneck(s) in the system. Here, we propose using the hybrid speech synthesis approach to understand the key factors behind the speaker similarity problem. To that end, we try to answer the following question: which segments and parameters of speech, if generated/synthesized better, would have a substantial improvement on speaker similarity? In this work, our hybrid methods are described and listening test results are presented and discussed.
Keywords
speaker recognition; speech synthesis; statistical analysis; SSS approach; adaptation data; hybrid approach; speaker similarity analysis; speaker-dependent models; speech transitions; statistical speech synthesis systems; target speaker; unit selection systems; Acoustics; Hidden Markov models; Hybrid power systems; Speech; Speech synthesis; Training; Trajectory; hybrid synthesis; speaker adaptation; speaker similarity; speech synthesis; statistical speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
Conference_Location
Bucharest
ISSN
2219-5491
Print_ISBN
978-1-4673-1068-0
Type
conf
Filename
6334238
Link To Document