مرکز منطقه ای اطلاع رساني علوم و فناوري - Using N-Best Lists and Confusion Networks for Meeting Summarization

DocumentCode :

1341424

Title :

Using N-Best Lists and Confusion Networks for Meeting Summarization

Author :

Xie, Shasha ; Liu, Yang

Author_Institution :

Comput. Sci. Dept., Univ. of Texas at Dallas, Richardson, TX, USA

Volume :

Issue :

fYear :

2011

fDate :

7/1/2011 12:00:00 AM

Firstpage :

1160

Lastpage :

1169

Abstract :

The incorrect speech recognition results usually have a negative impact on the speech summarization task, especially on the meeting domain where the word error rate is often higher than other speech genres. In this paper we investigate using rich speech recognition results to improve meeting summarization performance. Two kinds of structures are considered, n-best hypotheses and confusion networks. We develop methods to utilize multiple word and sentence candidates and their recognition confidence for summarization under an unsupervised framework. Our experimental results on the ICSI meeting corpus show that our proposed method can significantly improve summarization performance over using 1-best recognition output, evaluated by both ROUGE-1 and ROUGE-2 scores. We also find that if the task is to generate speech summaries or identify salient segments, using rich speech recognition output is just as effective as using human transcripts. In addition, we discuss the difference between n-best lists and confusion networks, and analyze the word error rate in the exacted summary sentences.

Keywords :

speech recognition; 1-best recognition output; ICSI meeting corpus; ROUGE-1 and ROUGE-2 scores; ROUGE-2 scores; confusion networks; human transcripts; meeting summarization; n-best lists; rich speech recognition output; speech summarization task; word error rate; Breast; Equations; Humans; Mathematical model; Noise; Speech; Speech recognition; Extractive meeting summarization; confusion networks; maximum marginal relevance (MMR); n-best hypotheses;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2010.2082534

Filename :

5593872

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1341424