Title :
Quantitative analysis of the local speech rate and its application to speech synthesis
Author :
OHNO, Sumio ; Fukumiya, Masamichi ; Fujisaki, Hiroya
Author_Institution :
Dept. of Appl. Electron., Sci. Univ. of Tokyo, Japan
Abstract :
On the basis of the short-time relative speech rate defined by the authors, this paper examines the optimum width of the smoothing window by perceptual experiments on the naturalness of re-synthesized speech. With the optimum window of 270 ms, relative speech rates are obtained both for `fast´ and `slow´ utterances of the same sentence, using an utterance produced at a `normal´ speech rate. The averaged results show that the speech rate control function for an utterance can be approximately decomposed into a global component for each sentence and local components for each bunsetsu and each major syntactic boundary. Based on these results, a scheme is presented for controlling the local speech rate of a reference utterance to obtain a synthetic utterance of an arbitrary global speech rate
Keywords :
speech processing; speech synthesis; 270 ms; bunsetsu; fast utterances; global speech rate; local speech rate; quantitative analysis; reference utterance; resynthesized speech naturalness perception; sentences; short-time relative speech rate; slow utterances; smoothing window optimal width; speech rate control function; speech synthesis; syntactic boundary; Acoustic measurements; Frequency measurement; Natural languages; Size measurement; Smoothing methods; Speech analysis; Speech synthesis; Stress; Time measurement; Timing;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607255