Title :
Latent semantic retrieval of spoken documents over position specific posterior lattices
Author :
Chang, Hung-lin ; Pan, Yi-Cheng ; Lee, Lin-shan
Author_Institution :
Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
Abstract :
This paper presents a new approach of latent semantic retrieval of spoken documents over Position Specific Posterior Lattices (PSPL). This approach performs concept matching instead of literal term matching during retrieval based on the Probabilistic Latent Semantic Analysis (PLSA), so as to solve the problem of term mismatch between the query and the desired spoken documents. This approach is performed over PSPL to consider the multiple hypotheses generated by ASR process, as well as the position information for these hypotheses, so as to alleviate the problem of relatively poor ASR accuracy. We establish a framework to evaluate semantic relevance between terms and the relevance score between a query and a PSPL, both based on the latent topic information from PLSA. Preliminary experiments on Chinese broadcast news segments showed significant improvements can be obtained with the proposed approach.
Keywords :
content-based retrieval; information retrieval; Chinese broadcast news segments; Probabilistic Latent Semantic Analysis; concept matching; latent semantic retrieval; position specific posterior lattices; semantic relevance; spoken documents; Automatic speech recognition; Computer science; Content based retrieval; IP networks; Information analysis; Information retrieval; Lattices; Material storage; Multimedia communication; Performance analysis; Semantics; Spoken Document Retrieval;
Conference_Titel :
Spoken Language Technology Workshop, 2008. SLT 2008. IEEE
Conference_Location :
Goa
Print_ISBN :
978-1-4244-3471-8
Electronic_ISBN :
978-1-4244-3472-5
DOI :
10.1109/SLT.2008.4777896