Title :
Stress predicition for Mandarin text-to-speech system using discourse context feature
Author :
Hao Che ; Jianhua Tao
Author_Institution :
Inst. of Autom., Beijing, China
Abstract :
Stress prediction is a vital factor for both speech synthesis and natural speech understanding. In this paper, we investigate how to improve the performance of Mandarin stress predictor by introducing discourse context features. Two widely accepted statistical methods are employed to evaluate given/new status and informativeness of a word which are two major discourse context features in the stress prediction. Syntactic and other sentence context features are used in the method as well. The machine learning algorithm, Maximum Entropy model, is adopted to predict which syllable will be stressed with the textual features. The experimental result shows that the performance of the predictor with the discourse context features is better than the methods using the sentence features and even close to the methods using both the acoustic and textual features.
Keywords :
learning (artificial intelligence); maximum entropy methods; natural language processing; speech synthesis; statistical analysis; Mandarin stress predictor performance improvement; Mandarin text-to-speech system; acoustic features; discourse context feature; machine learning algorithm; maximum entropy model; natural speech understanding; sentence context features; speech synthesis; statistical methods; syllable; syntactic features; textual features; word informativeness evaluation; word status evaluation; Acoustics; Context; Correlation; Feature extraction; Predictive models; Semantics; Stress; Mandarin discourse; Maximum Entropy model; discourse context feature; word informativeness;
Conference_Titel :
Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013 International Conference
Conference_Location :
Gurgaon
DOI :
10.1109/ICSDA.2013.6709908