Title :
Fast implementation methods for Viterbi-based word-spotting
Author :
Knill, K.M. ; Young, S.J.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper explores methods of increasing the speed of a Viterbi-based word-spotting system for audio document retrieval. Fast processing is essential since the user expects to receive the results of a keyword search many times faster than the actual length of the speech. A number of computational short-cuts to the standard Viterbi word-spotter are presented. These are based on exploiting the background Viterbi phone recognition path that is computed to provide a normalisation base. An initial approximation using the phone transition boundaries reduces the retrieval time by a factor of 5, while achieving a slight improvement in word-spotting performance. To further reduce retrieval time, pattern matching, feature selection, and Gaussian selection techniques are applied to this approximate pass to give a total ×50 increase in speed with little loss in performance. In addition, a low memory requirement means that these approaches can be implemented on any platform, including hand-held devices
Keywords :
Gaussian processes; Viterbi detection; information retrieval systems; pattern matching; speech recognition; Gaussian selection techniques; Viterbi based word spotting; approximation; audio document retrieval; background Viterbi phone recognition path; feature selection; hand-held devices; keyword search; normalisation base; pattern matching; phone transition boundaries; retrieval time reduction; word spotting performance; Audio databases; Computer networks; Concatenated codes; Hidden Markov models; Keyword search; Lattices; Pattern matching; Speech processing; Viterbi algorithm; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
0-7803-3192-3
DOI :
10.1109/ICASSP.1996.541148