• DocumentCode
    2452258
  • Title

    Automatic instrument and environmental sound recognition for media annotation of TV content

  • Author

    Cavaco, Sofia ; Malheiro, Frederico ; Mateus, João ; Jesus, Rui ; Correia, Nuno

  • Author_Institution
    Dept. de Inf., Univ. Nova de Lisboa, Caparica, Portugal
  • fYear
    2012
  • fDate
    16-18 July 2012
  • Firstpage
    1125
  • Lastpage
    1130
  • Abstract
    Due to the lack of annotation of their large video archives, multimedia content provider companies and television channels do not use the data in their archives to their full extent. In order to contribute with a solution to this problem, we have developed a tool that combines audio and visual information to annotate video. In particular, this tool has been used by a video production company that has given us positive feedback. The main innovation of this tool is the use of environmental sound recognition to annotate video. Here we focus on the tool´s audio information extraction method, which consists of a sound recognizer that learns a small set of spectral features from the data using non-negative matrix factorization. The recognizer can be used for different purposes such as to classify musical instruments, to identify the notes that are played and to distinguish environmental sounds like water, traffic, trains and people.
  • Keywords
    audio signal processing; feature extraction; matrix decomposition; multimedia systems; spectral analysis; television; video signal processing; TV content; audio information; automatic instrument recognition; environmental sound recognition; media annotation; musical instrument classification; nonnegative matrix factorization; note identification; sound recognizer; spectral features; tool audio information extraction method; video annotation; video production company; visual information; Companies; Feature extraction; Instruments; Spectrogram; Training; Vectors; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2012 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4673-0173-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2012.6376785
  • Filename
    6376785