• DocumentCode
    156458
  • Title

    Statistical parametric speech synthesis for Arabic language using ANN

  • Author

    Ilyes, Rebai ; Ben Ayed, Yassine

  • Author_Institution
    MIRACL : Multimedia Inf. Syst. & Adv. Comput. Lab., Sfax Univ., Sfax, Tunisia
  • fYear
    2014
  • fDate
    17-19 March 2014
  • Firstpage
    452
  • Lastpage
    457
  • Abstract
    Statistical parametric approach for speech synthesis becomes more popular over the concatenative approach due to the low size of the system and the high-quality speech. Moreover, few researches have been done in the field of speech synthesis for Arabic language with a poor quality of speech. In this paper, we propose a statistical parametric synthesis system for Arabic based on Artificial Neural Networks (ANN). Mel frequency Cepstral coefficients (MFCC), F0, energy and duration are the main components of our system. Speech waveform is generated from the predicted parameters F0, energy and MFCC. Different methods are proposed for this development process. In addition, we propose a method to solve the problem of discontinuities between neighboring segment boundaries in order to improve the speech quality. Experimental results of cepstral and prosodic parameters are given in this paper as well as the subjective evaluation.
  • Keywords
    cepstral analysis; natural language processing; neural nets; speech synthesis; statistical analysis; ANN; Arabic language; MFCC; artificial neural networks; concatenative approach; high-quality speech; mel frequency cepstral coefficients; neighboring segment boundaries; speech quality; speech waveform; statistical parametric speech synthesis; Artificial neural networks; Databases; Mel frequency cepstral coefficient; Speech; Speech synthesis; Statistical parametric; neural networks; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Technologies for Signal and Image Processing (ATSIP), 2014 1st International Conference on
  • Conference_Location
    Sousse
  • Type

    conf

  • DOI
    10.1109/ATSIP.2014.6834654
  • Filename
    6834654