DocumentCode
312779
Title
Enhanced video handling based on audio analysis
Author
Minami, Kenichi ; Akutsu, Akihito ; Hamada, Hiroslni ; Tonomura, Yoshinobu
Author_Institution
NTT Human Interface Labs., Kanagawa, Japan
fYear
1997
fDate
3-6 Jun 1997
Firstpage
219
Lastpage
226
Abstract
Soundtracks of videos contain a rich source of content-based information. In this paper, we propose an audio-based approach to video indexing and handling. Audio data is analysed by means of frequency analysis, and music and voice are independently detected even if they occur together. The method is implemented on a system called Video in Time as an example of creating reasonable condensed versions of dramas or movies by excerpting meaningful video segments. Users can select the desired replaying time from several different levels, depending on how much time can be afforded for viewing. Detection rates for music and voice are evaluated and experiences with the system are mentioned
Keywords
audio-visual systems; data analysis; image segmentation; indexing; multimedia computing; music; speech processing; Video in Time; audio analysis; content-based information; detection rates; drama; enhanced video handling; frequency analysis; movies; music; time; video indexing; video segments; video soundtracks; voice; Data mining; Feature extraction; Humans; Indexing; Laboratories; Layout; Motion pictures; Speech analysis; Telegraphy; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Computing and Systems '97. Proceedings., IEEE International Conference on
Conference_Location
Ottawa, Ont.
Print_ISBN
0-8186-5530-5
Type
conf
DOI
10.1109/MMCS.1997.609596
Filename
609596
Link To Document