• DocumentCode
    2826440
  • Title

    Commercial mining basedon temporal recurrence hashing algorithm and bag-of-fingerprints model

  • Author

    Wu, Xiaomeng ; Satoh, Shin´ichi

  • Author_Institution
    Digital Content & Media Sci. Res. Div., Nat. Inst. of Inf., Tokyo, Japan
  • fYear
    2011
  • fDate
    11-14 Sept. 2011
  • Firstpage
    2529
  • Lastpage
    2532
  • Abstract
    We propose two novel algorithms for fully-unsupervised, super-fast, and cross-channel TV commercial mining in this paper. The tasks involved in the process include: 1) mining commercial clusters from streams of individual channels, and 2) grouping identical commercial clusters across multiple channels. The first process is achieved with a dual-stage hashing algorithm, which searches for recurring short segments by hashing frames, and it assembles these short segments into sets of commercial sequences by hashing temporal recurrences. The algorithm mined commercials from a one-month stream in less than 42 minutes, which was ten times faster than that in related studies. A new bag-of-fingerprints model is proposed for the second process to encode the temporal clues of local fingerprints. The model is abundantly robust against framing and fingerprinting errors in recurring sequences, and discovers false matches of local fingerprints. A five-month database was used for comprehensively demonstrating the effectiveness and efficiency of the model.
  • Keywords
    advertising; cryptography; data mining; fingerprint identification; image sequences; bag-of-fingerprint model; commercial cluster mining; crosschannel TV commercial mining; false match discovery; fingerprinting error; five-month database; identical commercial cluster; temporal clue encoding; temporal recurrence hashing algorithm; Clustering algorithms; Conferences; Databases; Histograms; Robustness; Streaming media; TV; Duplicate Detection; Fingerprinting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2011 18th IEEE International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1522-4880
  • Print_ISBN
    978-1-4577-1304-0
  • Electronic_ISBN
    1522-4880
  • Type

    conf

  • DOI
    10.1109/ICIP.2011.6116177
  • Filename
    6116177