DocumentCode
3265147
Title
Efficient general genre video abstraction scheme for embedded devices using pure audio cues
Author
Bhatt, Rajen B. ; Krishnamoorthy, P. ; Kumar, Sarvesh
Author_Institution
Samsung India Software R&D Center, Noida, India
fYear
2009
fDate
1-2 Dec. 2009
Firstpage
63
Lastpage
67
Abstract
In this paper, we propose a framework of general genre (e.g., action, comedy, drama, documentary, musical, etc...) movie video abstraction scheme for embedded devices based on pure audio. The proposed algorithm does chaptering of multi-genre movie videos by detecting silence, environmental noise, pure speech, music (pure instrumental music and music with vocals, i.e., songs), and speech with back ground music (or music without vocals but with speech). Various audio features along with supervised classification strategies have been used for the abstraction. The current system has been evaluated with Gaussian Mixture Model (GMM) and Fuzzy Decision Tree (FDT) classifiers. The silence and environmental noise have been detected using the threshold approach with certain combination of audio features. Various optimizations done at algorithm and program level have made the scheme highly suitable for embedded devices.
Keywords
Gaussian processes; audio signal processing; decision trees; Gaussian mixture model; audio cues; background music; embedded devices; environmental noise; fuzzy decision tree; movie genre video abstraction; noise classification; silence detection; Decision trees; Embedded software; Fuzzy systems; Instruments; Motion pictures; Regions; Research and development; Speech analysis; Speech enhancement; Working environment noise; Video abstraction; audio content classification; general genre video;
fLanguage
English
Publisher
ieee
Conference_Titel
ICT and Knowledge Engineering, 2009 7th International Conference on
Conference_Location
Bangkok
Print_ISBN
978-1-4244-4513-4
Electronic_ISBN
978-1-4244-4514-1
Type
conf
DOI
10.1109/ICTKE.2009.5397324
Filename
5397324
Link To Document