DocumentCode :
2660405
Title :
Open vocabulary spoken document retrieval by subword sequence obtained from speech recognizer
Author :
Kuriki, Go ; Itoh, Yoshio ; Kojima, Kazunori ; Ishigame, Masaaki ; Tanaka, Kazuyo ; Lee, Shi-wook
Author_Institution :
Fac. of Software & Inf., Iwate Prefectural Univ., Takizawa
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
301
Lastpage :
304
Abstract :
We present a method for open vocabulary retrieval based on a spoken document retrieval (SDR) system using subword models. The present paper proposes a new approach to open vocabulary SDR system using subword models which do not require subword recognition. Instead, subword sequences are obtained from the phone sequence outputted containing an out of vocabulary (OOV) word, a speech recognizer outputs a word sequence whose phone sequence is considered to be similar to the OOV word. When OOV words are provided in a query, the proposed system is able to retrieve the target section by comparing the phone sequences of the query and the word sequence generated by the speech recognizer.
Keywords :
information retrieval; speech recognition; vocabulary; open vocabulary retrieval; out-of-vocabulary word; phone sequences; speech recognizer; spoken document retrieval; subword sequence; Speech recognition; Vocabulary; open vocabulary; spoken document retrieval; subword; subword sequence;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
Type :
conf
DOI :
10.1109/SLT.2008.4777900
Filename :
4777900
Link To Document :
بازگشت