DocumentCode
699341
Title
Audio source segmentation using spectral correlation features for automatic indexing of broadcast news
Author
Matsunaga, Shoichi ; Mizuno, Osamu ; Ohtsuki, Katsutoshi ; Hayashi, Yoshihiko
Author_Institution
NTT Cyber Space Labs., NTT Corporationing, Yokosuka, Japan
fYear
2004
fDate
6-10 Sept. 2004
Firstpage
2103
Lastpage
2106
Abstract
This paper proposes a new segmentation procedure to detect audio source intervals for automatic indexing of broadcast news. The procedure is composed of an audio source detection part and a part that smoothes the detected sequences. The detection part uses three new acoustic feature parameters that are based on spectral cross-correlation: spectral stability, white noise similarity, and sound spectral shape. These parameters make it possible to capture the audio sources more accurately than can be done with conventional parameters. The smoothing part has a new merging method that drops erroneous detection results of short duration. Audio source classification experiments are conducted on broadcast news segments. Performance is increased by 6.6% when the proposed parameters are used and by 3.1% when the proposed merging method is used, showing the usefulness of our approach. Experiments confirm the impact of this proposal on broadcast news indexing.
Keywords
audio signal processing; signal classification; signal detection; smoothing methods; acoustic feature parameters; audio source classification; audio source detection part; audio source segmentation procedure; automatic indexing; broadcast news; merging method; sound spectral shape; spectral correlation features; spectral stability; white noise similarity; Abstracts; Correlation; Noise; Smoothing methods; Speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2004 12th European
Conference_Location
Vienna
Print_ISBN
978-320-0001-65-7
Type
conf
Filename
7079871
Link To Document