Title :
Prosody modification in Filipino speech synthesis using dynamic time warping
Author :
Co, Melvin O. ; Guevara, Rowena Cristina L
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of the Philippines, Quezon, Philippines
Abstract :
Prosody is composed of two components: microprosody and macroprosody. Microprosody is solely influenced by individual speech sounds while macroprosody is subject to the speaker´s choice of intonation. The paper deals with the latter and describes the result of using dynamic time warping (DTW) for changing the macroprosody of speech segments in a concatenative synthesizer. Prerecorded Filipino words uttered in isolation are stored in a corpus. When a text is typed, the utterances of the words are searched in the corpus. The acoustical features of macroprosody are extracted from prerecorded utterances of selected Filipino sentences and modified through DTW. The acoustic features are embedded in the speech segments by using the TD-PSOLA. This synthesis process achieves an average MOS of 2.64 in the acceptability test. In a separate test procedure, results showed 98% of the synthesized Filipino sentences were accurately distinguished as either declarative or interrogative sentences.
Keywords :
linguistics; speech processing; speech synthesis; Filipino speech synthesis; acoustic features; concatenative synthesizer; declarative sentences; dynamic time warping; interrogative sentences; intonation; macroprosody; microprosody; prosody modification; Acoustic testing; Acoustical engineering; Cities and towns; Concatenated codes; Fatigue; Loudspeakers; Natural languages; Speech synthesis; Stress; Synthesizers;
Conference_Titel :
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN :
0-7803-8162-9
DOI :
10.1109/TENCON.2003.1273353