DocumentCode
1742195
Title
Feature fluctuation absorption for a quick audio retrieval from long recordings
Author
Kashino, Kunio ; Kurozumi, Takayuki ; Murase, Hiroshi
Author_Institution
NTT Commun. Sci. Labs., Kanagawa, Japan
Volume
3
fYear
2000
fDate
2000
Firstpage
98
Abstract
Kashino et al. proposed (1999) a histogram-based quick signal search method called time-series active search (TAS). TAS has only been effective in the exact matching case, where the segments to be detected are assumed to be exactly same as the reference signal. Here, we extend the method so that it is applicable even if the features fluctuate. In addition to the feature modification, feature dithering is discussed to absorb feature fluctuations. Efficient time-scaled search is also investigated to cope with variations of the reference signal duration. Tests using broadcast recordings show that the extended method improves the accuracy in nonexact-matching tasks such as hand-clap detection and word spotting in a single-speaker´s narration. The tests also show the speed-ups by pruning introduced in the time-scaled search
Keywords
audio signal processing; pattern recognition; time series; TAS; efficient time-scaled search; exact matching case; feature dithering; feature fluctuation absorption; feature modification; hand-clap detection; histogram-based quick signal search method; long recordings; nonexact-matching tasks; quick audio retrieval; single-speaker narration; time-scaled search; time-series active search; word spotting; Absorption; Audio recording; Broadcasting; Fluctuations; Hafnium; Histograms; Laboratories; Search methods; Signal detection; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location
Barcelona
ISSN
1051-4651
Print_ISBN
0-7695-0750-6
Type
conf
DOI
10.1109/ICPR.2000.903494
Filename
903494
Link To Document