Title :
SVM-based audio classification for instructional video analysis
Author :
Li, Ying ; Dorai, Chitra
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
Automatic content analysis and annotation for efficient search and browsing of topics in instructional videos are current challenges in the management of e-learning content repositories. This paper presents our current work on classifying the soundtrack of instructional videos into seven distinct audio classes using the support vector machine (SVM) technology. The classification results are then used to partition a video into homogeneous audio segments, which forms the fundamental basis for its higher-level content analysis and exploration. Initial experiments carried out on three education and four training videos totalling 185 minutes have yielded an average 97.9% classification accuracy. The performance comparisons between the SVM-based, the decision tree (DT)-based and the threshold-based audio classification schemes further demonstrates the superiority of the proposed scheme.
Keywords :
audio signal processing; classification; content-based retrieval; educational technology; feature extraction; multimedia systems; support vector machines; video signal processing; SVM technology; SVM-based audio classification; audio classes; audio feature extraction; automatic content analysis; automatic content annotation; classification accuracy; e-learning content repositories; educational videos; homogeneous audio segments; instructional video analysis; instructional video browsing; multimedia data; soundtrack classification; support vector machines; topic searching; training videos; video partitioning; Classification tree analysis; Decision trees; Educational programs; Hidden Markov models; Industrial training; Multiple signal classification; Music; Speech; Support vector machine classification; Support vector machines;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1327256