• DocumentCode
    2853072
  • Title

    Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit

  • Author

    Toda, Tomoki ; Kawai, Hisashi ; Tsuzaki, Minoru ; Shikano, Kiyohiro

  • Author_Institution
    ATR Spoken Language Translation Research Laboratories, 2-2-2 Hikaridai Seika-cho Soraku-gun Kyoto, 619-0288 Japan
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    This paper proposes a novel unit selection algorithm for Japanese Text-To-Speech (TTS) systems. Since Japanese syllables consist of CV (C: Consonant, V: Vowel) or V, except when a vowel is devoiced, CV units are basic to concatenative TTS systems for Japanese. However, speech synthesized with CV units sometimes have discontinuities due to V-V concatenation; In order to alleviate such discontinuities, longer units (CV* or non-uniform units) have been proposed. However, the concatenation between V and V is still unavoidable. To address this problem, we propose a novel unit selection algorithm that incorporates not only phoneme units but also diphone units. The concatenation in the proposed algorithm is performed at the vowel center as well as at the phoneme boundary. Results of evaluation experiments clarify that the proposed algorithm outperforms the conventional algorithm.
  • Keywords
    Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743755
  • Filename
    5743755