DocumentCode
3673613
Title
Negative-Based Sampling for Multimedia Retrieval
Author
Hsin-Yu Ha;Shu-Ching Chen;Mei-Ling Shyu
Author_Institution
Sch. of Comput. &
fYear
2015
Firstpage
64
Lastpage
71
Abstract
Nowadays, in such a high-tech living lifestyle, profusion of multimedia data are produced and propagated around the world. To identify meaningful semantic concepts from the large amount of data, one of the major challenges is called the data imbalance problem. Data imbalance occurs when the number of positive instances (i.e., instances which contain the target concept) is greatly less than the number of negative instances (i.e., instances which do not contain the target concept). In other words, the ratio between positive and negative instances is extremely low. Rebalancing the dataset is usually proposed to resolve the problem by sampling or data pruning. In this paper, we propose a sampling method which consists of three stages, namely selecting features to identify the negative instances, producing negative ranking scores, and performing sampling. The method is compared with some other existing methods on the TRECVID dataset and is demonstrated to have better performance.
Keywords
"Semantics","Sampling methods","Feature extraction","Correlation","Training data","Multimedia communication","Videos"
Publisher
ieee
Conference_Titel
Information Reuse and Integration (IRI), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/IRI.2015.20
Filename
7300956
Link To Document