DocumentCode
323529
Title
A fast vocabulary independent algorithm for spotting words in speech
Author
Dharanipragada, S. ; Roukos, S.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
233
Abstract
In applications such as audio-indexing, spoken message retrieval and video-browsing, it is necessary to have the ability to detect spoken words that are outside the vocabulary of the speech recognizer used in these systems, in large amounts of speech at speeds many times faster than real-time. We present a fast, vocabulary independent, algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse-to-detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone-level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired accuracy and speed in word spotting. Overall, the algorithm has a speed of execution that is 2400 times faster than real-time
Keywords
acoustic signal detection; indexing; information retrieval; signal representation; speech processing; speech recognition; accuracy; acoustic match; audio-indexing; coarse match; coarse search; coarse-to-detailed search; fast vocabulary independent algorithm; keywords; out of vocabulary words detection; phone sequence spotting; phone-level representation; phone-ngram matching; preprocessing stage; putative word hits; speech recognizer; spoken message retrieval; video-browsing; word spotting; Computer networks; Decoding; Information retrieval; Lattices; Real time systems; Speech recognition; Text recognition; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674410
Filename
674410
Link To Document