DocumentCode
2132493
Title
Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model
Author
Hirose, Keikichi ; Matsuda, Tatsuya ; Hashimoto, Hiroya ; Minematsu, Nobuaki
Author_Institution
Dept. of Inf. & Commun. Eng., Univ. of Tokyo, Tokyo, Japan
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
1
Lastpage
6
Abstract
Frame-by-frame representation is not appropriate for prosodic features, which are tightly related to speech units spreading a wide time span, such as words, phrases and so on. This causes an inherit problem in fundamental frequency (F0) contour generation by HMM-based speech synthesis. A method is developed to modify F0 contours in the framework of a generation process model by referring to linguistic information of input text (word boundary and accent type). It takes F0 variances obtained through HMM-based speech synthesis into account during the process. Through a listening experiment on synthetic speech, the method is proved to generate better quality as compared to the HMM-based speech synthesis on average. Since the generation process model can clearly relate its commands and linguistic (and para-/non- linguistic) information, the method has an additional advantage; changing speech styles, and /or adding further information (such as emphasis) can be easily done through manipulating the commands.
Keywords
hidden Markov models; speech synthesis; HMM; accent type; command manipulation; fundamental frequency contour generation; fundamental frequency contour representation; generation process model; speech synthesis; word boundary; Frequency synthesizers; Hidden Markov models; Mathematical model; Pragmatics; Speech; Speech synthesis; HMM-based speech synthesis; flexible control; fundamental frequency contour; generation process model; linguistic information;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning for Signal Processing (MLSP), 2011 IEEE International Workshop on
Conference_Location
Santander
ISSN
1551-2541
Print_ISBN
978-1-4577-1621-8
Electronic_ISBN
1551-2541
Type
conf
DOI
10.1109/MLSP.2011.6064596
Filename
6064596
Link To Document