DocumentCode
417174
Title
A multimedia approach for audio segmentation in TV broadcast news
Author
Perez-Freire, Luis ; Garcia-Mateo, Carmen
Author_Institution
ETSI Telecomunicacion, Vigo Univ., Spain
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper deals with the task of audio segmentation in TV broadcast news. A multimedia approach for this purpose, by means of audio and video processing, is proposed. Thus, the segmentation system is composed by two differentiated parts: one analyzes the audio stream, and is based on the well-known Bayesian information criterion (BIC), whereas the other part extracts useful information from the video stream to improve the performance of BIC. An investigation of parameters involved in BIC formulation is also accomplished, in order to achieve the best results possible in our experimental framework: the database Transcrigal-DB. The final system provides significative improvements in both overall performance and robustness.
Keywords
Bayes methods; audio signal processing; multimedia computing; speech processing; speech recognition; video signal processing; Bayesian information criterion; TV broadcast news; audio processing; audio segmentation; automatic speech recognition; multimedia approach; nonspeech fragments; speech fragments; video processing; Automatic speech recognition; Bayesian methods; Data mining; Digital multimedia broadcasting; Loudspeakers; Speech recognition; Statistics; Streaming media; TV broadcasting; Telecommunication standards;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325999
Filename
1325999
Link To Document