Title :
A multimedia approach for audio segmentation in TV broadcast news
Author :
Perez-Freire, Luis ; Garcia-Mateo, Carmen
Author_Institution :
ETSI Telecomunicacion, Vigo Univ., Spain
Abstract :
The paper deals with the task of audio segmentation in TV broadcast news. A multimedia approach for this purpose, by means of audio and video processing, is proposed. Thus, the segmentation system is composed by two differentiated parts: one analyzes the audio stream, and is based on the well-known Bayesian information criterion (BIC), whereas the other part extracts useful information from the video stream to improve the performance of BIC. An investigation of parameters involved in BIC formulation is also accomplished, in order to achieve the best results possible in our experimental framework: the database Transcrigal-DB. The final system provides significative improvements in both overall performance and robustness.
Keywords :
Bayes methods; audio signal processing; multimedia computing; speech processing; speech recognition; video signal processing; Bayesian information criterion; TV broadcast news; audio processing; audio segmentation; automatic speech recognition; multimedia approach; nonspeech fragments; speech fragments; video processing; Automatic speech recognition; Bayesian methods; Data mining; Digital multimedia broadcasting; Loudspeakers; Speech recognition; Statistics; Streaming media; TV broadcasting; Telecommunication standards;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325999