• DocumentCode
    3265147
  • Title

    Efficient general genre video abstraction scheme for embedded devices using pure audio cues

  • Author

    Bhatt, Rajen B. ; Krishnamoorthy, P. ; Kumar, Sarvesh

  • Author_Institution
    Samsung India Software R&D Center, Noida, India
  • fYear
    2009
  • fDate
    1-2 Dec. 2009
  • Firstpage
    63
  • Lastpage
    67
  • Abstract
    In this paper, we propose a framework of general genre (e.g., action, comedy, drama, documentary, musical, etc...) movie video abstraction scheme for embedded devices based on pure audio. The proposed algorithm does chaptering of multi-genre movie videos by detecting silence, environmental noise, pure speech, music (pure instrumental music and music with vocals, i.e., songs), and speech with back ground music (or music without vocals but with speech). Various audio features along with supervised classification strategies have been used for the abstraction. The current system has been evaluated with Gaussian Mixture Model (GMM) and Fuzzy Decision Tree (FDT) classifiers. The silence and environmental noise have been detected using the threshold approach with certain combination of audio features. Various optimizations done at algorithm and program level have made the scheme highly suitable for embedded devices.
  • Keywords
    Gaussian processes; audio signal processing; decision trees; Gaussian mixture model; audio cues; background music; embedded devices; environmental noise; fuzzy decision tree; movie genre video abstraction; noise classification; silence detection; Decision trees; Embedded software; Fuzzy systems; Instruments; Motion pictures; Regions; Research and development; Speech analysis; Speech enhancement; Working environment noise; Video abstraction; audio content classification; general genre video;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    ICT and Knowledge Engineering, 2009 7th International Conference on
  • Conference_Location
    Bangkok
  • Print_ISBN
    978-1-4244-4513-4
  • Electronic_ISBN
    978-1-4244-4514-1
  • Type

    conf

  • DOI
    10.1109/ICTKE.2009.5397324
  • Filename
    5397324