DocumentCode
404825
Title
Prosody modification in Filipino speech synthesis using dynamic time warping
Author
Co, Melvin O. ; Guevara, Rowena Cristina L
Author_Institution
Dept. of Electr. & Electron. Eng., Univ. of the Philippines, Quezon, Philippines
Volume
1
fYear
2003
fDate
15-17 Oct. 2003
Firstpage
397
Abstract
Prosody is composed of two components: microprosody and macroprosody. Microprosody is solely influenced by individual speech sounds while macroprosody is subject to the speaker´s choice of intonation. The paper deals with the latter and describes the result of using dynamic time warping (DTW) for changing the macroprosody of speech segments in a concatenative synthesizer. Prerecorded Filipino words uttered in isolation are stored in a corpus. When a text is typed, the utterances of the words are searched in the corpus. The acoustical features of macroprosody are extracted from prerecorded utterances of selected Filipino sentences and modified through DTW. The acoustic features are embedded in the speech segments by using the TD-PSOLA. This synthesis process achieves an average MOS of 2.64 in the acceptability test. In a separate test procedure, results showed 98% of the synthesized Filipino sentences were accurately distinguished as either declarative or interrogative sentences.
Keywords
linguistics; speech processing; speech synthesis; Filipino speech synthesis; acoustic features; concatenative synthesizer; declarative sentences; dynamic time warping; interrogative sentences; intonation; macroprosody; microprosody; prosody modification; Acoustic testing; Acoustical engineering; Cities and towns; Concatenated codes; Fatigue; Loudspeakers; Natural languages; Speech synthesis; Stress; Synthesizers;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2003. Conference on Convergent Technologies for the Asia-Pacific Region
Print_ISBN
0-7803-8162-9
Type
conf
DOI
10.1109/TENCON.2003.1273353
Filename
1273353
Link To Document