Title :
A Supervised Framework for Keyword Extraction From Meeting Transcripts
Author :
Liu, Fei ; Liu, Feifan ; Liu, Yang
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA
fDate :
3/1/2011 12:00:00 AM
Abstract :
This paper presents a supervised framework for extracting keywords from meeting transcripts, a genre that is significantly different from written text or other speech domains such as broadcast news. In addition to the traditional frequency- or position-based clues, we investigate a variety of novel features, including linguistically motivated term specificity features, decision-making sentence-related features, prosodic prominence scores, as well as a group of features derived from summary sentences. To generate better system summaries, we propose a feedback loop mechanism under a supervised framework to leverage the relationship between keywords and summary sentences. Experiments are performed on the ICSI meeting corpus using both human transcripts and automatic speech recognition (ASR) outputs. Results have shown that our proposed supervised framework is able to outperform both unsupervised term frequency inverse document frequency (TF-IDF) weighting and a supervised keyphrase extraction system which is known for its satisfying performance on written text. We conduct extensive analysis to demonstrate the effectiveness of the newly proposed features and the feedback mechanism used to generate summaries. Furthermore, we show promising results using n-best recognition output to address the problems of recognition errors.
Keywords :
decision making; feature extraction; feedback; learning (artificial intelligence); speech recognition; automatic speech recognition; decision making; feedback loop mechanism; human transcripts; inverse document frequency; keyword extraction; meeting transcripts; prosodic prominence scores; sentence related features; summary sentences; supervised framework; supervised learning; unsupervised term frequency; Automatic speech recognition; Data mining; Digital audio broadcasting; Feedback loop; Frequency; Humans; Natural languages; Supervised learning; USA Councils; Voice mail; Automatic speech recognition (ASR); keyword extraction; meeting transcripts; summarization; supervised learning; term frequency $ times $ inverse document frequency (TF-IDF);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Conference_Location :
6/7/2010 12:00:00 AM
DOI :
10.1109/TASL.2010.2052119