DocumentCode
1320084
Title
Interactive Spoken Document Retrieval With Suggested Key Terms Ranked by a Markov Decision Process
Author
Pan, Yi-Cheng ; Lee, Hung-yi ; Lee, Lin-shan
Author_Institution
MediaTek, Inc., Hsinchu, Taiwan
Volume
20
Issue
2
fYear
2012
Firstpage
632
Lastpage
645
Abstract
Interaction with users is a powerful strategy that potentially yields better information retrieval for all types of media, including text, images, and videos. While spoken document retrieval (SDR) is a crucial technology for multimedia access in the network era, it is also more challenging than text information retrieval because of the inevitable recognition errors. It is therefore reasonable to consider interactive functionalities for SDR systems. We propose an interactive SDR approach in which given the user´s query, the system returns not only the retrieval results but also a short list of key terms describing distinct topics. The user selects these key terms to expand the query if the retrieval results are not satisfactory. The entire retrieval process is organized around a hierarchy of key terms that define the allowable state transitions; this is modeled by a Markov decision process, which is popularly used in spoken dialogue systems. By reinforcement learning with simulated users, the key terms on the short list are properly ranked such that the retrieval success rate is maximized while the number of interactive steps is minimized. Significant improvements over existing approaches were observed in preliminary experiments performed on information needs provided by real users. A prototype system was also implemented.
Keywords
Markov processes; information needs; information retrieval; interactive systems; learning (artificial intelligence); multimedia computing; text analysis; Markov decision process; SDR system; inevitable recognition error; information needs; interactive functionality; interactive spoken document retrieval success rate; multimedia access; reinforcement learning; spoken dialogue system; text information retrieval process; user query; Economics; Information retrieval; Markov processes; Navigation; Prototypes; Speech; Videos; Spoken document retrieval (SDR); dialogue system;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2011.2163512
Filename
6018284
Link To Document