DocumentCode :
2660342
Title :
Latent semantic retrieval of spoken documents over position specific posterior lattices
Author :
Chang, Hung-lin ; Pan, Yi-Cheng ; Lee, Lin-shan
Author_Institution :
Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
fYear :
2008
fDate :
15-19 Dec. 2008
Firstpage :
285
Lastpage :
288
Abstract :
This paper presents a new approach of latent semantic retrieval of spoken documents over Position Specific Posterior Lattices (PSPL). This approach performs concept matching instead of literal term matching during retrieval based on the Probabilistic Latent Semantic Analysis (PLSA), so as to solve the problem of term mismatch between the query and the desired spoken documents. This approach is performed over PSPL to consider the multiple hypotheses generated by ASR process, as well as the position information for these hypotheses, so as to alleviate the problem of relatively poor ASR accuracy. We establish a framework to evaluate semantic relevance between terms and the relevance score between a query and a PSPL, both based on the latent topic information from PLSA. Preliminary experiments on Chinese broadcast news segments showed significant improvements can be obtained with the proposed approach.
Keywords :
content-based retrieval; information retrieval; Chinese broadcast news segments; Probabilistic Latent Semantic Analysis; concept matching; latent semantic retrieval; position specific posterior lattices; semantic relevance; spoken documents; Automatic speech recognition; Computer science; Content based retrieval; IP networks; Information analysis; Information retrieval; Lattices; Material storage; Multimedia communication; Performance analysis; Semantics; Spoken Document Retrieval;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
Type :
conf
DOI :
10.1109/SLT.2008.4777896
Filename :
4777896
Link To Document :
بازگشت