Title :
Active learning for the prediction of prosodic phrase boundaries in Chinese speech synthesis systems using conditional random fields
Author :
Ziping Zhao ; Xirong Ma
Author_Institution :
Coll. of Comput. & Inf. Eng., Tianjin Normal Univ., Tianjin, China
Abstract :
Prosodic structure contributes to speech production and comprehension. One of the crucial problems in achieving natural-sounding synthesized speech is the prediction of appropriate phrase boundaries. Unfortunately, obtaining human annotations of prosodic phrases to train a supervised system can be laborious and costly. Active learning has been proven effective in reducing labeling efforts for supervised learning. This study explores active learning techniques with the objective to reduce the amount of human-annotated data needed to attain a given level of performance. It presents an approach based on active learning to predict the Chinese prosodic phrase boundaries in unrestricted Chinese text. Experiments show that for most of the cases considered, the active selection strategies for labeling the prosodic phrase boundaries are as good as or exceed the performance of random data selection.
Keywords :
learning (artificial intelligence); speech synthesis; text analysis; Chinese prosodic phrase boundaries; Chinese speech synthesis systems; active learning; conditional random fields; human-annotated data; natural-sounding synthesized speech; prosodic phrase boundary prediction; random data selection; supervised learning; unrestricted Chinese text; Entropy; Hidden Markov models; Labeling; Predictive models; Speech; Training; Uncertainty; Active Learning; Conditional Random Fields (CRFs); Prosodic Phrase; Speech Synthesis system;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
Conference_Location :
Takamatsu
DOI :
10.1109/SNPD.2015.7176201