DocumentCode
2980886
Title
New word detection in audio-indexing
Author
Dharanipragada, Satya ; Roukos, Salim
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear
1997
fDate
14-17 Dec 1997
Firstpage
551
Lastpage
557
Abstract
For an audio indexing system that uses a speech recognizer with a fixed vocabulary to be practical, one needs the ability to detect out of vocabulary or new words at query time. We present a fast, vocabulary independent algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse to detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting
Keywords
audio systems; indexing; pattern matching; speech recognition; word processing; acoustic match; audio indexing system; coarse search; fixed vocabulary; new word detection; phone level representation; phone ngram matching; preprocessing method; preprocessing stage; putative hits; putative word hits; query time; search strategy; speech recognizer; vocabulary independent algorithm; word spotting; word/phone sequenc; Computer networks; Concurrent computing; Decoding; Hidden Markov models; Information retrieval; Lattices; Speech recognition; Text recognition; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location
Santa Barbara, CA
Print_ISBN
0-7803-3698-4
Type
conf
DOI
10.1109/ASRU.1997.659135
Filename
659135
Link To Document