DocumentCode :
2980886
Title :
New word detection in audio-indexing
Author :
Dharanipragada, Satya ; Roukos, Salim
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
1997
fDate :
14-17 Dec 1997
Firstpage :
551
Lastpage :
557
Abstract :
For an audio indexing system that uses a speech recognizer with a fixed vocabulary to be practical, one needs the ability to detect out of vocabulary or new words at query time. We present a fast, vocabulary independent algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse to detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting
Keywords :
audio systems; indexing; pattern matching; speech recognition; word processing; acoustic match; audio indexing system; coarse search; fixed vocabulary; new word detection; phone level representation; phone ngram matching; preprocessing method; preprocessing stage; putative hits; putative word hits; query time; search strategy; speech recognizer; vocabulary independent algorithm; word spotting; word/phone sequenc; Computer networks; Concurrent computing; Decoding; Hidden Markov models; Information retrieval; Lattices; Speech recognition; Text recognition; Viterbi algorithm; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-7803-3698-4
Type :
conf
DOI :
10.1109/ASRU.1997.659135
Filename :
659135
Link To Document :
بازگشت