Title :
Noise robust keyword spotting for user generated video blogs
Author :
Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.
Author_Institution :
ICT Res. Inst., Univ. of Wollongong, Wollongong, NSW, Australia
Abstract :
This paper presents a template-based system for speaker independent key word spotting (KWS) in continuous speech that can help in automatic analysis, indexing, search and retrieval of user generated videos by content. Extensive experiments on clean speech confirm that the proposed approach is superior to a HMM approach when applied to noisy speech with different signal-to-noise ratio (SNR) levels. Experiments conducted to detect swear words, personal names and product names within a set of online user generated video blogs shows significantly better recall and precision results compared to a traditional ASR-based approach.
Keywords :
Web sites; speech processing; video retrieval; HMM approach; automatic analysis; indexing analysis; noise robust keyword spotting; noisy speech; online user generated video blogs; signal-to-noise ratio levels; speaker independent key word spotting; template-based system; user generated video retrieval; user generated video search; Blogs; Hidden Markov models; Histograms; Noise measurement; Signal to noise ratio; Speech; Keyword Spotting (KWS); Noise Robustness; Social Networks; Template Matching (TM); Users Video Blogs;
Conference_Titel :
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
DOI :
10.1109/ICME.2013.6607589