مرکز منطقه ای اطلاع رساني علوم و فناوري - A Supervised Framework for Keyword Extraction From Meeting Transcripts

DocumentCode :

3561106

Title :

A Supervised Framework for Keyword Extraction From Meeting Transcripts

Author :

Liu, Fei ; Liu, Feifan ; Liu, Yang

Author_Institution :

Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA

Volume :

Issue :

fYear :

2011

fDate :

3/1/2011 12:00:00 AM

Firstpage :

538

Lastpage :

548

Abstract :

This paper presents a supervised framework for extracting keywords from meeting transcripts, a genre that is significantly different from written text or other speech domains such as broadcast news. In addition to the traditional frequency- or position-based clues, we investigate a variety of novel features, including linguistically motivated term specificity features, decision-making sentence-related features, prosodic prominence scores, as well as a group of features derived from summary sentences. To generate better system summaries, we propose a feedback loop mechanism under a supervised framework to leverage the relationship between keywords and summary sentences. Experiments are performed on the ICSI meeting corpus using both human transcripts and automatic speech recognition (ASR) outputs. Results have shown that our proposed supervised framework is able to outperform both unsupervised term frequency inverse document frequency (TF-IDF) weighting and a supervised keyphrase extraction system which is known for its satisfying performance on written text. We conduct extensive analysis to demonstrate the effectiveness of the newly proposed features and the feedback mechanism used to generate summaries. Furthermore, we show promising results using n-best recognition output to address the problems of recognition errors.

Keywords :

decision making; feature extraction; feedback; learning (artificial intelligence); speech recognition; automatic speech recognition; decision making; feedback loop mechanism; human transcripts; inverse document frequency; keyword extraction; meeting transcripts; prosodic prominence scores; sentence related features; summary sentences; supervised framework; supervised learning; unsupervised term frequency; Automatic speech recognition; Data mining; Digital audio broadcasting; Feedback loop; Frequency; Humans; Natural languages; Supervised learning; USA Councils; Voice mail; Automatic speech recognition (ASR); keyword extraction; meeting transcripts; summarization; supervised learning; term frequency $ times $ inverse document frequency (TF-IDF);

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

Conference_Location :

6/7/2010 12:00:00 AM

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2052119

Filename :

5482003

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3561106