DocumentCode :
431597
Title :
Broadcast news segmentation by audio type analysis
Author :
Nwe, Tin Lay ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res., Singapore
Volume :
2
fYear :
2005
fDate :
18-23 March 2005
Abstract :
It is common for us to define audio types according to human perception instead of audio spectral properties. In this paper, we analyse the spectral properties of audio types and propose the acoustic features based on spectral properties and harmonic enhancement, to classify audio. By analyzing the spectral properties of sound types, a multi-model HMM is proposed to integrate the primitive spectral properties in statistical modeling. To validate the approach, we build a classifier to segment audio streams into speech, commercials, environmental sound, physical violence and silence in multiple steps. It is shown that the proposed approach outperforms conventional methods. Experimental evaluations on 20 audio tracks of the TRECVID broadcast news database have shown the effectiveness of the proposed approach.
Keywords :
audio signal processing; feature extraction; hidden Markov models; signal classification; spectral analysis; acoustic features; audio classification; audio stream classification; audio type analysis; audio type spectral properties; broadcast news segmentation; energy distribution analysis; harmonic enhancement; human audio perception; multimodel HMM; statistical modeling; Broadcasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1415592
Filename :
1415592
Link To Document :
بازگشت