DocumentCode :
156458
Title :
Statistical parametric speech synthesis for Arabic language using ANN
Author :
Ilyes, Rebai ; Ben Ayed, Yassine
Author_Institution :
MIRACL : Multimedia Inf. Syst. & Adv. Comput. Lab., Sfax Univ., Sfax, Tunisia
fYear :
2014
fDate :
17-19 March 2014
Firstpage :
452
Lastpage :
457
Abstract :
Statistical parametric approach for speech synthesis becomes more popular over the concatenative approach due to the low size of the system and the high-quality speech. Moreover, few researches have been done in the field of speech synthesis for Arabic language with a poor quality of speech. In this paper, we propose a statistical parametric synthesis system for Arabic based on Artificial Neural Networks (ANN). Mel frequency Cepstral coefficients (MFCC), F0, energy and duration are the main components of our system. Speech waveform is generated from the predicted parameters F0, energy and MFCC. Different methods are proposed for this development process. In addition, we propose a method to solve the problem of discontinuities between neighboring segment boundaries in order to improve the speech quality. Experimental results of cepstral and prosodic parameters are given in this paper as well as the subjective evaluation.
Keywords :
cepstral analysis; natural language processing; neural nets; speech synthesis; statistical analysis; ANN; Arabic language; MFCC; artificial neural networks; concatenative approach; high-quality speech; mel frequency cepstral coefficients; neighboring segment boundaries; speech quality; speech waveform; statistical parametric speech synthesis; Artificial neural networks; Databases; Mel frequency cepstral coefficient; Speech; Speech synthesis; Statistical parametric; neural networks; speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Technologies for Signal and Image Processing (ATSIP), 2014 1st International Conference on
Conference_Location :
Sousse
Type :
conf
DOI :
10.1109/ATSIP.2014.6834654
Filename :
6834654
Link To Document :
بازگشت