مرکز منطقه ای اطلاع رساني علوم و فناوري - Transductive Inference with Hierarchical Clustering for Video Annotation

DocumentCode :

3197151

Title :

Transductive Inference with Hierarchical Clustering for Video Annotation

Author :

Qi, Guo-Jun ; Hua, Xian-Sheng ; Song, Yan ; Tang, Jinhui ; Zhang, Hong-Jiang

Author_Institution :

Univ. of Sci. & Technol. of China, Hefei

fYear :

2007

fDate :

2-5 July 2007

Firstpage :

643

Lastpage :

646

Abstract :

In this paper, we present a novel framework for video semantic detection based on transductive inference and hierarchical clustering, which directly focuses on predicting the available samples in a current unlabeled pool, instead of trying to build a classifier workable for any unavailable data. In this framework, a number of hierarchical clustering results are constructed from the entire video dataset containing both labeled and unlabeled examples. We aim to make the clusters as pure as possible, i.e., samples in a same cluster mostly have a same label. To further purify these hierarchical clustering results, an EM based cluster-tuning algorithm is iteratively employed. Based on these clustering results, several hypotheses are generated by probability voting among labeled samples in the obtained clusters. From these hypotheses, one of them is chosen according to the Vapnik combined bound, and it is then applied to predict the labels of unlabeled samples. This selected transductive hypothesis, which is only interested in predicting the available unlabeled samples in test set rather than producing a general classifier like inductive inference learning, exploits the structure and distribution of the unlabeled pool to achieve a minimal test error bound. Thus it can have better generalization ability for video annotation both theoretically and experimentally. This is also shown by our experiment results.

Keywords :

content-based retrieval; expectation-maximisation algorithm; inference mechanisms; iterative methods; learning (artificial intelligence); pattern clustering; probability; video retrieval; video signal processing; EM based cluster-tuning algorithm; Vapnik combined bound; content-based video search; generalization ability; hierarchical clustering; probability voting; transductive hypothesis; transductive inference learning; unlabeled samples; video annotation; video semantic detection; Asia; Clustering algorithms; Computational efficiency; Humans; Inference algorithms; Iterative algorithms; Labeling; Testing; Voting;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia and Expo, 2007 IEEE International Conference on

Conference_Location :

Beijing

Print_ISBN :

1-4244-1016-9

Electronic_ISBN :

1-4244-1017-7

Type :

conf

DOI :

10.1109/ICME.2007.4284732

Filename :

4284732

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3197151