DocumentCode :
427030
Title :
Active learning for story segmentation of spoken documents
Author :
Chen, Shun-Chuan
Author_Institution :
Speech Lab., Nat. Taiwan Univ., Taipei
Volume :
1
fYear :
2004
fDate :
30-30 June 2004
Firstpage :
607
Abstract :
In story segmentation, it is often difficult to gather the segmented data to train a new model for the purpose of supervised learning. Therefore, how to gather the useful data and to reduce the human effort on segmenting stories is an important issue. We apply active learning to selecting the most informative examples to train a supervised model more efficiently. Active learning aims to minimize the number of segmented examples by automatically selecting the examples that are most informative for the story segmenter. By this method, we can decrease the labor effort in boring story segmentation and get the same or better performance in automatic story segmentation. We also consider another problem, namely whether training data in story segmentation can be reusable or not and how the different special structures of the language influence the performance of active learning for story segmentation. The experimental results show that active learning can reduce the number of segmented examples to reach a given level of performance and that reusable training data in story segmentation still contain information that improves the performance. It is a satisfactory result
Keywords :
learning (artificial intelligence); linguistics; minimisation; natural languages; speech processing; text analysis; active learning; automatic story segmentation; language structures; minimization; reusable training data; spoken documents; supervised learning; Educational institutions; Hidden Markov models; Humans; Laboratories; Learning systems; NIST; Sampling methods; Speech; Supervised learning; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7803-8603-5
Type :
conf
DOI :
10.1109/ICME.2004.1394265
Filename :
1394265
Link To Document :
بازگشت