Title :
Context-dependent deterministic plus stochastic model
Author :
Khorram, Soheil ; Sameti, Hossein ; Bahmaninezhad, Fahimeh
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
This article proposes a method to improve the performance of deterministic plus stochastic model (DSM-) based feature extraction by integrating the contextual information. One precious advantage of speech synthesis over speech recognition is that in both training and testing phases of synthesis, contextual information is available. However, similar to recognition, this invaluable knowledge has been forgotten during acoustic feature extraction of speech synthesis. DSM expresses the residual of Mel-cepstral analysis through a summation of two components, namely deterministic and stochastic. This study proposes to model the deterministic component through a novel context-dependent principal component analysis (CD-PCA), and the stochastic component through the conventional high-pass filtered noise. Furthermore, due to the high dependency of the proposed feature extraction on state boundaries, the feature analysis and HMM-based modeling are performed in an iterative manner. Subjective evaluations conducted on a Persian speech database confirm the effectiveness of the proposed synthesis system.
Keywords :
feature extraction; hidden Markov models; speech recognition; speech synthesis; ubiquitous computing; CD-PCA; DSM- based feature extraction; HMM-based modeling; Persian speech database; acoustic feature extraction; context-dependent deterministic plus stochastic model; context-dependent principal component analysis; contextual information; deterministic plus stochastic model; feature analysis; high-pass filtered noise; speech recognition; speech synthesis; stochastic component; synthesis system; Context modeling; Databases; Principal component analysis; Speech; Speech processing; Trajectory; Vocoders; HMM-based speech synthesis; context-dependent PCA; context-dependent residual modeling; excitation modeling; statistical parametric speech synthesis;
Conference_Titel :
Signal Processing (ICSP), 2014 12th International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4799-2188-1
DOI :
10.1109/ICOSP.2014.7015067