DocumentCode :
2321738
Title :
Strategies for automatic segmentation of audio data
Author :
Kemp, Thomas ; Schmidt, Michael ; Westphal, Martin ; Waibel, Alen
Author_Institution :
Interactive Syst. Labs., Karlsruhe Univ., Germany
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1423
Abstract :
In many applications, like indexing of broadcast news or surveillance applications, the input data consists of a continuous, unsegmented audio stream. Speech recognition technology, however, usually requires segments of relatively short length as input. For such applications, effective methods to segment continuous audio streams into homogeneous segments are required. In this paper, three different segmenting strategies (model-based, metric-based and energy-based) are compared on the same broadcast news test data. It is shown that model-based and metric-based techniques outperform the simpler energy-based algorithms. While model based segmenters achieve very high level of segment boundary precision, the metric-based segmenter preforms better in terms of segment boundary recall (RCL). To combine the advantages of both strategies, a new hybrid algorithm is introduced. For this, the results of a preliminary metric-based segmentation are used to construct the models for the final model-based segmenter run. The new hybrid approach is shown to outperform the other segmenting strategies
Keywords :
indexing; speech recognition; audio data; automatic segmentation; broadcast news; broadcast news test data; continuous unsegmented audio stream; energy-based algorithms; homogeneous segments; hybrid algorithm; input data; metric-based technique; model-based technique; segment boundary precision; segment boundary recall; speech recognition technology; surveillance application; Automatic speech recognition; Indexing; Interactive systems; Laboratories; Loudspeakers; Multimedia communication; Speech recognition; Streaming media; TV broadcasting; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861862
Filename :
861862
Link To Document :
بازگشت