• DocumentCode
    703516
  • Title

    Comparison of two different text-to-speech alignment systems: Speech synthesis based vs. hybrid HMM/ANN

  • Author

    Deroo, O. ; Malfrere, F. ; Dutoit, T.

  • Author_Institution
    Dept. of Circuits Theor. & Signal Process., Facult Polytech. de Mons, Mons, Belgium
  • fYear
    1998
  • fDate
    8-11 Sept. 1998
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper we compare two different methods for phonetically labeling a speech database. The first approach is based on the alignment of the speech signal on a high quality synthetic speech pattern, and the second one uses a hybrid HMM/ANN system. Both systems have been evaluated on French read utterances from a speaker never seen in the training stage of the HMM/ANN system and manually segmented. This study outlines the advantages and drawbacks of both methods. The high quality speech synthetic system has the great advantage that no training stage is needed, while the classical HMM/ANN system easily allows multiple phonetic transcriptions. We deduce a method for the automatic constitution of phonetically labeled speech databases based on using the synthetic speech segmentation tool to bootstrap the training process of our hybrid HMM/ANN system. The importance of such segmentation tools will be a key point for the development of improved speech synthesis and recognition systems.
  • Keywords
    hidden Markov models; neural nets; speech processing; speech recognition; speech synthesis; French read utterances; hybrid HMM-ANN system; multiple phonetic transcriptions; phonetic labeling; segmentation tools; speech database; speech recognition systems; speech synthesis; synthetic speech pattern; synthetic speech segmentation tool; text-to-speech alignment systems; training process bootstrap; Artificial neural networks; Cepstral analysis; Hidden Markov models; Speech; Speech recognition; Speech synthesis; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO 1998), 9th European
  • Conference_Location
    Rhodes
  • Print_ISBN
    978-960-7620-06-4
  • Type

    conf

  • Filename
    7089987