DocumentCode :
417174
Title :
A multimedia approach for audio segmentation in TV broadcast news
Author :
Perez-Freire, Luis ; Garcia-Mateo, Carmen
Author_Institution :
ETSI Telecomunicacion, Vigo Univ., Spain
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The paper deals with the task of audio segmentation in TV broadcast news. A multimedia approach for this purpose, by means of audio and video processing, is proposed. Thus, the segmentation system is composed by two differentiated parts: one analyzes the audio stream, and is based on the well-known Bayesian information criterion (BIC), whereas the other part extracts useful information from the video stream to improve the performance of BIC. An investigation of parameters involved in BIC formulation is also accomplished, in order to achieve the best results possible in our experimental framework: the database Transcrigal-DB. The final system provides significative improvements in both overall performance and robustness.
Keywords :
Bayes methods; audio signal processing; multimedia computing; speech processing; speech recognition; video signal processing; Bayesian information criterion; TV broadcast news; audio processing; audio segmentation; automatic speech recognition; multimedia approach; nonspeech fragments; speech fragments; video processing; Automatic speech recognition; Bayesian methods; Data mining; Digital multimedia broadcasting; Loudspeakers; Speech recognition; Statistics; Streaming media; TV broadcasting; Telecommunication standards;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325999
Filename :
1325999
Link To Document :
بازگشت