DocumentCode :
2770437
Title :
Soundbite identification using reference and automatic transcripts of broadcast news speech
Author :
Liu, Feifan ; Liu, Yang
Author_Institution :
Univ. of Texas at Dallas, Dallas
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
653
Lastpage :
658
Abstract :
Soundbite identification in broadcast news is important for locating information useful for question answering, mining opinions of a particular person, and enriching speech recognition output with quotation marks. This paper presents a systematic study of this problem under a classification framework, including problem formulation for classification, feature extraction, and the effect of using automatic speech recognition (ASR) output and automatic sentence boundary detection. Our experiments on a Mandarin broadcast news speech corpus show that the three-way classification framework outperforms the binary classification. The entropy-based feature weighting method generally performs better than others. Using ASR output degrades system performance, with more degradation observed from using automatic sentence segmentation than speech recognition errors for this task, especially on the recall rate.
Keywords :
broadcasting; feature extraction; signal classification; speech recognition; Mandarin broadcast news speech corpus; automatic sentence boundary detection; automatic sentence segmentation; automatic speech recognition; automatic transcript; binary classification; broadcast news speech; entropy-based feature weighting method; feature extraction; question answering; soundbite identification; Automatic speech recognition; Broadcasting; Degradation; Entropy; Feature extraction; Hidden Markov models; Loudspeakers; Speech processing; Speech recognition; System performance; sentence boundary detection; soundbite identification; term weighting; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430189
Filename :
4430189
Link To Document :
بازگشت