DocumentCode
607915
Title
Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis
Author
Mohammadi, Arash ; Demiroglu, Cenk
Author_Institution
Electr. & Electron. Eng., Ozyegin Univ., Istanbul, Turkey
fYear
2013
fDate
24-26 April 2013
Firstpage
1
Lastpage
4
Abstract
Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection systems, can be generated with the SSS approach. Another advantage is the ability to adapt to a target speaker with a couple of minutes of adaptation data. However, many applications, especially in consumer electronics, require adaptation with only a few adaptation utterances. Here, we propose a rapid adaptation technique that first attempt to select a reference model that is close to the target speaker given a distance measure. Then, as opposed to adapting to target speaker from an average model, as typically done in most systems, adaptation is performed from the new reference model. The proposed system significantly outperformed a state-of-the-art baseline system both in objective and subjective tests especially only when one utterance is available for adaptation.
Keywords
hidden Markov models; speech synthesis; statistical analysis; HMM-based speech synthesis; SSS approach; adaptation utterances; average model; consumer electronics; distance measure; nearest neighbor approach; rapid adaptation technique; reference model; smooth speech transitions; speaker adaptation data; statistical speech synthesis approach; unit selection systems; Adaptation models; Decision trees; Hidden Markov models; Speech; Speech synthesis; Training; Vectors; speaker adaptation; speaker similarity; speech synthesis; statistical speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Communications Applications Conference (SIU), 2013 21st
Conference_Location
Haspolat
Print_ISBN
978-1-4673-5562-9
Electronic_ISBN
978-1-4673-5561-2
Type
conf
DOI
10.1109/SIU.2013.6531576
Filename
6531576
Link To Document