• DocumentCode
    310557
  • Title

    Automatic generation of speech synthesis units based on closed loop training

  • Author

    Kagoshima, Takehiko ; Akamine, Masami

  • Author_Institution
    Res. & Dev. Center, Toshiba Corp., Kawasaki, Japan
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    963
  • Abstract
    This paper proposes a new method for automatically generating speech synthesis units. A small set of synthesis units is selected from a large speech database by the proposed closed loop training method (CLT). Because CLT is based on the evaluation and minimization of the distortion caused by the synthesis process such as prosodic modification: the selected synthesis units are most suitable for synthesizers. The CLT is applied to a waveform concatenation based synthesizer, whose basic unit is CV/VC (diphone). It is shown that synthesis units can be efficiently generated by CLT from a labeled speech database with a small amount of computation. Moreover, the synthesized speech is clear and smooth even though the storage size of the waveform dictionary is small
  • Keywords
    speech processing; speech synthesis; waveform analysis; automatic generation; closed loop training; diphone; distortion evaluation; distortion minimization; labeled speech database; prosodic modification; speech synthesis units; speech synthesizers; storage size; synthesis process; waveform concatenation based synthesizer; waveform dictionary; Clustering methods; Databases; Degradation; Dictionaries; Minimization methods; Research and development; Speech synthesis; Synthesizers; Training data; Virtual colonoscopy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596098
  • Filename
    596098