• DocumentCode
    2262601
  • Title

    Quantitative analysis of the local speech rate and its application to speech synthesis

  • Author

    OHNO, Sumio ; Fukumiya, Masamichi ; Fujisaki, Hiroya

  • Author_Institution
    Dept. of Appl. Electron., Sci. Univ. of Tokyo, Japan
  • Volume
    4
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    2254
  • Abstract
    On the basis of the short-time relative speech rate defined by the authors, this paper examines the optimum width of the smoothing window by perceptual experiments on the naturalness of re-synthesized speech. With the optimum window of 270 ms, relative speech rates are obtained both for `fast´ and `slow´ utterances of the same sentence, using an utterance produced at a `normal´ speech rate. The averaged results show that the speech rate control function for an utterance can be approximately decomposed into a global component for each sentence and local components for each bunsetsu and each major syntactic boundary. Based on these results, a scheme is presented for controlling the local speech rate of a reference utterance to obtain a synthetic utterance of an arbitrary global speech rate
  • Keywords
    speech processing; speech synthesis; 270 ms; bunsetsu; fast utterances; global speech rate; local speech rate; quantitative analysis; reference utterance; resynthesized speech naturalness perception; sentences; short-time relative speech rate; slow utterances; smoothing window optimal width; speech rate control function; speech synthesis; syntactic boundary; Acoustic measurements; Frequency measurement; Natural languages; Size measurement; Smoothing methods; Speech analysis; Speech synthesis; Stress; Time measurement; Timing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607255
  • Filename
    607255