DocumentCode
1858005
Title
Automatic topic segmentation and labeling in multiparty dialogue
Author
Hsueh, P.-Y. ; Moore, J.D.
Author_Institution
Sch. of Inf., Edinburgh Univ., Edinburgh
fYear
2006
fDate
10-13 Dec. 2006
Firstpage
98
Lastpage
101
Abstract
This study concerns how to segment a scenario-driven multiparty dialogue and how to label these segments automatically. We apply approaches that have been proposed for identifying topic boundaries at a coarser level to the problem of identifying agenda-based topic boundaries in scenario-based meetings. We also develop conditional models to classify segments into topic classes. Experiments in topic segmentation show that a supervised classification approach that combines lexical and conversational features outperforms the unsupervised lexical chain-based approach, achieving 20% and 12% improvement on segmentating top-level and sub-topic segments respectively. Experiments in topic classification suggest that it is possible to automatically categorize segments into appropriate topic classes given only the transcripts. Training with features selected using the Log Likelihood ratio improves the results by 13.3%.
Keywords
pattern classification; speech processing; agenda-based topic boundaries; automatic topic labeling; automatic topic segmentation; log likelihood ratio; scenario-based meetings; scenario-driven multiparty dialogue; supervised classification; topic boundaries; Ambient intelligence; Data mining; Decision trees; Frequency; Informatics; Labeling; Predictive models; Speech; Text categorization; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language Technology Workshop, 2006. IEEE
Conference_Location
Palm Beach
Print_ISBN
1-4244-0872-5
Type
conf
DOI
10.1109/SLT.2006.326826
Filename
4123371
Link To Document