New word detection in audio-indexing

Author

Dharanipragada, Satya ; Roukos, Salim

Author_Institution

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

fYear

1997

fDate

14-17 Dec 1997

Firstpage

551

Lastpage

557

Abstract

For an audio indexing system that uses a speech recognizer with a fixed vocabulary to be practical, one needs the ability to detect out of vocabulary or new words at query time. We present a fast, vocabulary independent algorithm for spotting words in speech. The algorithm consists of a preprocessing stage and a coarse to detailed search strategy for spotting a word/phone sequence in speech. The preprocessing method provides a phone level representation of the speech that can be searched efficiently. The coarse search, consisting of phone-ngram matching, identifies regions of speech as putative word hits. The detailed acoustic match is then conducted only at the putative hits identified in the coarse match. This gives us the desired speed in wordspotting

Keywords

audio systems; indexing; pattern matching; speech recognition; word processing; acoustic match; audio indexing system; coarse search; fixed vocabulary; new word detection; phone level representation; phone ngram matching; preprocessing method; preprocessing stage; putative hits; putative word hits; query time; search strategy; speech recognizer; vocabulary independent algorithm; word spotting; word/phone sequenc; Computer networks; Concurrent computing; Decoding; Hidden Markov models; Information retrieval; Lattices; Speech recognition; Text recognition; Viterbi algorithm; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 1997. Proceedings., 1997 IEEE Workshop on

Conference_Location

Santa Barbara, CA

Print_ISBN

0-7803-3698-4

Type

conf

DOI

10.1109/ASRU.1997.659135

Filename

659135