DocumentCode
2826440
Title
Commercial mining basedon temporal recurrence hashing algorithm and bag-of-fingerprints model
Author
Wu, Xiaomeng ; Satoh, Shin´ichi
Author_Institution
Digital Content & Media Sci. Res. Div., Nat. Inst. of Inf., Tokyo, Japan
fYear
2011
fDate
11-14 Sept. 2011
Firstpage
2529
Lastpage
2532
Abstract
We propose two novel algorithms for fully-unsupervised, super-fast, and cross-channel TV commercial mining in this paper. The tasks involved in the process include: 1) mining commercial clusters from streams of individual channels, and 2) grouping identical commercial clusters across multiple channels. The first process is achieved with a dual-stage hashing algorithm, which searches for recurring short segments by hashing frames, and it assembles these short segments into sets of commercial sequences by hashing temporal recurrences. The algorithm mined commercials from a one-month stream in less than 42 minutes, which was ten times faster than that in related studies. A new bag-of-fingerprints model is proposed for the second process to encode the temporal clues of local fingerprints. The model is abundantly robust against framing and fingerprinting errors in recurring sequences, and discovers false matches of local fingerprints. A five-month database was used for comprehensively demonstrating the effectiveness and efficiency of the model.
Keywords
advertising; cryptography; data mining; fingerprint identification; image sequences; bag-of-fingerprint model; commercial cluster mining; crosschannel TV commercial mining; false match discovery; fingerprinting error; five-month database; identical commercial cluster; temporal clue encoding; temporal recurrence hashing algorithm; Clustering algorithms; Conferences; Databases; Histograms; Robustness; Streaming media; TV; Duplicate Detection; Fingerprinting;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2011 18th IEEE International Conference on
Conference_Location
Brussels
ISSN
1522-4880
Print_ISBN
978-1-4577-1304-0
Electronic_ISBN
1522-4880
Type
conf
DOI
10.1109/ICIP.2011.6116177
Filename
6116177
Link To Document