Title :
Phonetic recognition for spoken document retrieval
Author :
Ng, Kenney ; Zue, Victor W.
Author_Institution :
Spoken Language Syst. Group, MIT, Cambridge, MA, USA
Abstract :
This paper describes the development and application of a phonetic recognition system to the task of spoken document retrieval. The recognizer is used to generate phonetic transcriptions of the speech messages which are then processed to produce subword unit representations for indexing and retrieval. Subword units are used as an alternative to words units generated by either keyword spotting or word recognition. We first investigate the use of different acoustic and language models in the speech recognizer in an effort to improve phonetic recognition performance. Then we examine a variety of subword unit indexing terms and measure their ability to perform effective spoken document retrieval. Finally, we look at some simple robust indexing and retrieval methods that take into account the characteristics of the recognition errors in an attempt to improve retrieval performance
Keywords :
acoustic signal processing; indexing; information retrieval; natural languages; speech recognition; speech synthesis; acoustic models; indexing; keyword spotting; language models; phonetic recognition performance; phonetic transcriptions; recognition errors; robust methods; speech messages; spoken document retrieval; subword unit indexing terms; subword unit representations; word recognition; Application software; Computer science; Indexing; Information retrieval; Laboratories; Natural languages; Performance evaluation; Robustness; Speech processing; Speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674433