Title :
Faceted topic retrieval of news video using joint topic modeling of visual features and speech transcripts
Author :
Wan, Kong-Wah ; Tan, Ah-Hwee ; Lim, Joo-Hwee ; Chia, Liang-Tien
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Abstract :
Because of the inherent ambiguity in user queries, an important task of modern retrieval systems is faceted topic retrieval (FTR), which relates to the goal of returning diverse or novel information elucidating the wide range of topics or facets of the query need. We introduce a generative model for hypothesizing facets in the (news) video domain by combining the complementary information in the visual keyframes and the speech transcripts. We evaluate the efficacy of our multimodal model on the standard TRECVID-2005 video corpus annotated with facets. We find that: (1) the joint modeling of the visual and text (speech transcripts) information can achieve significant F-score improvement over a text-alone system; (2) our model compares favorably with standard diverse ranking algorithms such as the MMR. Our FTR model has been implemented on a news search prototype that is undergoing commercial trial.
Keywords :
query processing; video retrieval; FTR model; MMR; complementary information; diverse ranking algorithms; faceted topic retrieval; joint topic modeling; modern retrieval systems; multimodal model; news video; query need; significant F-score improvement; speech transcripts information; standard TRECVID-2005 video corpus; text information; text-alone system; user query; visual features; visual information; visual keyframes; Computational modeling; Frequency division multiplexing; Joints; Measurement; Speech; Speech recognition; Visualization; Faceted Topic Retrieval; Latent Dirichlet Allocation; Multimedia Topic Modeling;
Conference_Titel :
Multimedia and Expo (ICME), 2010 IEEE International Conference on
Conference_Location :
Suntec City
Print_ISBN :
978-1-4244-7491-2
DOI :
10.1109/ICME.2010.5583061