DocumentCode :
1761512
Title :
Optimal state duration assignment in hidden Markov model-based text-to-speech synthesis system
Author :
Khan, Najeeb Ullah ; Jung-Chul Lee
Author_Institution :
Sch. of Electr. Eng., Univ. of Ulsan, Ulsan, South Korea
Volume :
51
Issue :
12
fYear :
2015
fDate :
6 11 2015
Firstpage :
941
Lastpage :
943
Abstract :
In state-of-the-art text-to-speech (TTS) systems the state durations for each phoneme are generated so as to maximise the state sequence probability given the constraint that the sum of all state durations should be equal to the phoneme duration. Such maximisation sometimes results in negative state durations when the specified phoneme duration is less than the sum of the means of all the states of the phoneme. Such discrepancy implicitly results in the violation of the equality constraint. This has implications for speech research problems, in which each phoneme duration is specified. One such problem is the use of the TTS synthesis system for singing voice synthesis research. An algorithm for state duration assignment is derived so as to maximise the probability of the state sequence with the constraints that the sum of state durations should be equal to the total duration of the phoneme and all the state durations must be greater than or equal to 1. Experimental results show that the proposed algorithm always produces state durations greater than or equal to 1 while satisfying the equality constraint.
Keywords :
hidden Markov models; probability; speech synthesis; HMM-based text-to-speech synthesis system; TTS synthesis system; equality constraint; optimal state duration assignment; phoneme duration; singing voice synthesis; state sequence probability;
fLanguage :
English
Journal_Title :
Electronics Letters
Publisher :
iet
ISSN :
0013-5194
Type :
jour
DOI :
10.1049/el.2015.0539
Filename :
7122460
Link To Document :
بازگشت