DocumentCode :
3153537
Title :
Noise robust keyword spotting for user generated video blogs
Author :
Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.
Author_Institution :
ICT Res. Inst., Univ. of Wollongong, Wollongong, NSW, Australia
fYear :
2013
fDate :
15-19 July 2013
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents a template-based system for speaker independent key word spotting (KWS) in continuous speech that can help in automatic analysis, indexing, search and retrieval of user generated videos by content. Extensive experiments on clean speech confirm that the proposed approach is superior to a HMM approach when applied to noisy speech with different signal-to-noise ratio (SNR) levels. Experiments conducted to detect swear words, personal names and product names within a set of online user generated video blogs shows significantly better recall and precision results compared to a traditional ASR-based approach.
Keywords :
Web sites; speech processing; video retrieval; HMM approach; automatic analysis; indexing analysis; noise robust keyword spotting; noisy speech; online user generated video blogs; signal-to-noise ratio levels; speaker independent key word spotting; template-based system; user generated video retrieval; user generated video search; Blogs; Hidden Markov models; Histograms; Noise measurement; Signal to noise ratio; Speech; Keyword Spotting (KWS); Noise Robustness; Social Networks; Template Matching (TM); Users Video Blogs;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2013 IEEE International Conference on
Conference_Location :
San Jose, CA
ISSN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2013.6607589
Filename :
6607589
Link To Document :
بازگشت