DocumentCode :
2236302
Title :
Spoken document summarization using acoustic, prosodic and semantic information
Author :
Huang, Chien-Lin ; Hsieh, Chia-Hsin ; Wu, Chung-Hsien
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear :
2005
fDate :
6-8 July 2005
Abstract :
This paper presents a spoken document summarization scheme using acoustic, prosodic, and semantic information. First, speech recognition confidence is estimated to choose reliable words from the speech transcription. Prosodic information, including pitch and energy, is used for stressed word selection. Latent semantic indexing (LSI) is adopted to identify significant words. Finally, word trigram and semantic dependency is measured to include the syntactic and semantic information for speech summarization. The dynamic programming (DP) algorithm is used to find the best summarization result according to the summarization score estimated from the above five measures. Finally, the summarized result is presented by the concatenation of the summarized speech words. Experimental results indicate that the proposed approach effectively extracts important words and gives a promising speech summary.
Keywords :
dynamic programming; indexing; natural languages; speech processing; speech recognition; word processing; LSI; acoustic information; dynamic programming algorithm; latent semantic indexing; prosodic information; semantic information; speech recognition; speech transcription; spoken document summarization; stressed word selection; Acoustical engineering; Computer science; Data mining; Dynamic programming; Heuristic algorithms; Indexing; Large scale integration; Power engineering and energy; Reliability engineering; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Print_ISBN :
0-7803-9331-7
Type :
conf
DOI :
10.1109/ICME.2005.1521453
Filename :
1521453
Link To Document :
بازگشت