DocumentCode
768582
Title
A new approach to automatic speech summarization
Author
Hori, Chiori ; Furui, Sadaoki
Author_Institution
Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
Volume
5
Issue
3
fYear
2003
Firstpage
368
Lastpage
378
Abstract
This paper proposes a new automatic speech summarization method. In this method, a set of words maximizing a summarization score is extracted from automatically transcribed speech. This extraction is performed according to a target compression ratio using a dynamic programming (DP) technique. The extracted set of words is then connected to build a summarization sentence. The summarization score consists of a word significance measure, a confidence measure, linguistic likelihood, and a word concatenation probability. The word concatenation score is determined by a dependency structure in the original speech given by stochastic dependency context free grammar (SDCFG). Japanese broadcast news speech transcribed using a large-vocabulary continuous-speech recognition (LVCSR) system is summarized using our proposed method and compared with manual summarization by human subjects. The manual summarization results are combined to build a word network. This word network is used to calculate the word accuracy of each automatic summarization result using the most similar word string in the network. Experimental results show that the proposed method effectively extracts relatively important information by removing redundant and irrelevant information.
Keywords
dynamic programming; natural languages; text analysis; DP; Japanese broadcast news speech; LVCSR system; SDCFG; automatic speech summarization; confidence measure; dependency structure; dynamic programming; irrelevant information removal; large-vocabulary continuous-speech recognition system; linguistic likelihood; most similar word string; redundant information removal; stochastic dependency context free grammar; summarization score maximization; target compression ratio; word concatenation probability; word network; word set; word significance measure; Automatic speech recognition; Broadcasting; Computer science; Data mining; Dynamic programming; Indexing; Information science; Speech processing; Speech recognition; Stochastic processes;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2003.813274
Filename
1223564
Link To Document