DocumentCode
2449828
Title
Detecting offensive user video blogs: An adaptive keyword spotting approach
Author
Barakat, M.S. ; Ritz, C.H. ; Stirling, D.A.
Author_Institution
Sch. of Electr., Comput. & Telecommun. Eng., Univ. of Wollongong, Wollongong, NSW, Australia
fYear
2012
fDate
16-18 July 2012
Firstpage
419
Lastpage
425
Abstract
This paper proposes a speaker independent keyword spotting (KWS) approach applied to the audio track of user video blogs that can help in their automatic analysis, indexing, search and retrieval. The approach, which relies on matching of keyword templates to speech segments using an adaptive similarity threshold that is estimated automatically for each utterance, does not require training data or language model as required in existing approaches such as those based on the Hidden Markov Model (HMM). This is a particular advantage for user video blogs since they usually contain words of interest that have not been adequately represented in a training database. Experiments conducted to detect offensive words in video blogs achieved much higher accuracy than existing speech-to-text based approaches.
Keywords
audio signal processing; indexing; information retrieval; social networking (online); speaker recognition; speech synthesis; KWS approach; adaptive keyword spotting approach; adaptive similarity threshold; audio track; automatic analysis; indexing; keyword template matching; offensive user video blog detection; retrieval; search; speaker independent keyword spotting approach; speech segments; speech-to-text based approach; Blogs; Databases; Hidden Markov models; Histograms; Speech; Speech recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Audio, Language and Image Processing (ICALIP), 2012 International Conference on
Conference_Location
Shanghai
Print_ISBN
978-1-4673-0173-2
Type
conf
DOI
10.1109/ICALIP.2012.6376654
Filename
6376654
Link To Document