Title :
Prediction of tone naturalness perception using geometric model
Author :
Trakulsuk, Konlawachara ; Suchato, Atiwong ; Punyabukkana, Proadpran ; Wutiwiwatchai, Chai
Author_Institution :
Dept. of Comput. Eng., Chulalongkorn Univ., Bangkok, Thailand
Abstract :
Naturalness is an important issue in the Text-To-Speech (TTS) system. To support arbitrarily defined pitch contours for any synthesized syllables, a TTS should be able to maintain the naturalness of the synthetic speech. This work proposed an automatic evaluation of pitch contours in order to determine the level of naturalness of synthesized syllables when perceived by human listeners. By analyzing results, tone perception experiments conducted on human listeners in this work, a syllable tone naturalness prediction model based on the midpoint and endpoint of the syllable´s rhyme part was proposed. The model was then used for developing a tone naturalness prediction algorithm using geometric models of pitch contours. The evaluation of the tone naturalness prediction algorithm involved human listeners perceiving the naturalness of syllables with 45 pitch contour patterns, each of which with 2 repetitions. The proposed algorithm achieved approximately 80% consistency rate compared against human listeners´ decisions on tone naturalness of the syllables.
Keywords :
geometry; speech processing; speech synthesis; TTS; geometric model; pitch contours; syllable tone naturalness prediction model; synthesized syllable naturalness; synthetic speech naturalness; text-to-speech system; tone naturalness perception prediction; tone naturalness prediction algorithm; Thai tones; fundamental frequencies; naturalness model; pitch contours; speech naturalness; tone perception;
Conference_Titel :
Computer Science and Software Engineering (JCSSE), 2014 11th International Joint Conference on
Conference_Location :
Chon Buri
Print_ISBN :
978-1-4799-5821-4
DOI :
10.1109/JCSSE.2014.6841845