• DocumentCode
    2062652
  • Title

    Audio-visual fingerprinting and cross-modal aggregation: Components and applications

  • Author

    Dunker, Peter ; Gruhne, Matthias

  • Author_Institution
    Fraunhofer Inst. for Digital Media Technol. IDMT, Ilmenau
  • fYear
    2008
  • fDate
    14-16 April 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Within the last years the amount of digital media has been spread due to efficient media encoding algorithms. Hence, a large number of audio and video files are stored on the users hard disks and on popular video community platforms. Due to the lack of suitable or disobeyed metadata standards, the description of these data is often missing or misleading. Therefore, audio and visual identification algorithms have been developed, which identify videos or pieces of music and provide a suitable metadata description or copyright information based on a content database. Integrating both information, the visual and the audio part of the video for simultaneous identification is called cross-modal processing. In this paper the principle structure of an audio and a visual identification system is identified and different state-of-the-art algorithms are discussed. Furthermore, a cross-modal system is presented and especially the cross aggregation is discussed. Finally, current use cases for audio, visual and cross-modal search and retrieval are depicted.
  • Keywords
    audio coding; meta data; video coding; video retrieval; audio-visual fingerprinting; audio-visual identification algorithm; content database; copyright information; cross-modal aggregation; cross-modal retrieval; cross-modal search; digital media encoding; metadata description; metadata standards; state-of-the-art algorithm; Audio databases; Data mining; Feature extraction; Fingerprint recognition; Image converters; Multiple signal classification; Spatial databases; Spectrogram; Video compression; Visual databases; audio identification; cross-modal aggregation; visual identification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Consumer Electronics, 2008. ISCE 2008. IEEE International Symposium on
  • Conference_Location
    Vilamoura
  • Print_ISBN
    978-1-4244-2422-1
  • Electronic_ISBN
    978-1-4244-2422-1
  • Type

    conf

  • DOI
    10.1109/ISCE.2008.4559483
  • Filename
    4559483