DocumentCode
1522908
Title
Synthesis of Child Speech With HMM Adaptation and Voice Conversion
Author
Watts, Oliver ; Yamagishi, Junichi ; King, Simon ; Berkling, Kay
Author_Institution
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
Volume
18
Issue
5
fYear
2010
fDate
7/1/2010 12:00:00 AM
Firstpage
1005
Lastpage
1016
Abstract
The synthesis of child speech presents challenges both in the collection of data and in the building of a synthesizer from that data. We chose to build a statistical parametric synthesizer using the hidden Markov model (HMM)-based system HTS, as this technique has previously been shown to perform well for limited amounts of data, and for data collected under imperfect conditions. Six different configurations of the synthesizer were compared, using both speaker-dependent and speaker-adaptive modeling techniques, and using varying amounts of data. For comparison with HMM adaptation, techniques from voice conversion were used to transform existing synthesizers to the characteristics of the target speaker. Speaker-adaptive voices generally outperformed child speaker-dependent voices in the evaluation. HMM adaptation outperformed voice conversion style techniques when using the full target speaker corpus; with fewer adaptation data, however, no significant listener preference for either HMM adaptation or voice conversion methods was found.
Keywords
hidden Markov models; speech synthesis; HMM adaptation techniques; child speech synthesis; hidden Markov model; speaker adaptive modeling technique; speaker dependent technique; speaker-adaptive voice; statistical parametric synthesizer; target speaker corpus; voice conversion; Children; hidden Markov models (HMMs); speech synthesis;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2035029
Filename
5299003
Link To Document