DocumentCode :
1503024
Title :
Integrating Recognition and Retrieval With Relevance Feedback for Spoken Term Detection
Author :
Lee, Hung-yi ; Chen, Chia-ping ; Lee, Lin-shan
Author_Institution :
Dept. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
20
Issue :
7
fYear :
2012
Firstpage :
2095
Lastpage :
2110
Abstract :
Recognition and retrieval are typically viewed as two cascaded independent modules for spoken term detection (STD). Retrieval techniques are assumed to be applied on top of automatic speech recognition (ASR) output, with performance depending on ASR accuracy. We propose a framework that integrates recognition and retrieval and consider them jointly in order to yield better STD performance. This can be achieved either by adjusting the acoustic model parameters (model-based) or by considering detected examples (example-based) using relevance information provided by the user (user relevance feedback) or inferred by the system (pseudo-relevance feedback), either for a given query (short-term context) or by taking into account many previous queries (long-term context). Such relevance feedback approaches have long been used in text information retrieval, but are rarely considered and cannot be directly applied to the retrieval of spoken content. The proposed relevance feedback approaches are specific to spoken content retrieval and are hence very different from those developed for text retrieval, which are applied only to text symbols. We present not only these relevance feedback scenarios and approaches for STD, but also propose a framework to integrate them all together. Preliminary experiments showed significant improvements in each case.
Keywords :
acoustic signal processing; relevance feedback; speech recognition; text analysis; acoustic model parameter; automatic speech recognition; long-term context; pseudo-relevance feedback; short-term context; spoken content retrieval; spoken term detection; text information retrieval; text retrieval; user relevance feedback; Accuracy; Acoustics; Information retrieval; Lattices; Multimedia communication; Speech; Speech recognition; Relevance feedback; spoken term detection;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2196514
Filename :
6189746
Link To Document :
بازگشت