DocumentCode
2262601
Title
Quantitative analysis of the local speech rate and its application to speech synthesis
Author
OHNO, Sumio ; Fukumiya, Masamichi ; Fujisaki, Hiroya
Author_Institution
Dept. of Appl. Electron., Sci. Univ. of Tokyo, Japan
Volume
4
fYear
1996
fDate
3-6 Oct 1996
Firstpage
2254
Abstract
On the basis of the short-time relative speech rate defined by the authors, this paper examines the optimum width of the smoothing window by perceptual experiments on the naturalness of re-synthesized speech. With the optimum window of 270 ms, relative speech rates are obtained both for `fast´ and `slow´ utterances of the same sentence, using an utterance produced at a `normal´ speech rate. The averaged results show that the speech rate control function for an utterance can be approximately decomposed into a global component for each sentence and local components for each bunsetsu and each major syntactic boundary. Based on these results, a scheme is presented for controlling the local speech rate of a reference utterance to obtain a synthetic utterance of an arbitrary global speech rate
Keywords
speech processing; speech synthesis; 270 ms; bunsetsu; fast utterances; global speech rate; local speech rate; quantitative analysis; reference utterance; resynthesized speech naturalness perception; sentences; short-time relative speech rate; slow utterances; smoothing window optimal width; speech rate control function; speech synthesis; syntactic boundary; Acoustic measurements; Frequency measurement; Natural languages; Size measurement; Smoothing methods; Speech analysis; Speech synthesis; Stress; Time measurement; Timing;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607255
Filename
607255
Link To Document