DocumentCode
2775243
Title
MSRA-MM 2.0: A Large-Scale Web Multimedia Dataset
Author
Li, Hao ; Wang, Meng ; Hua, Xian-Sheng
Author_Institution
Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
6-6 Dec. 2009
Firstpage
164
Lastpage
169
Abstract
In this paper, we introduce the second version of Microsoft Research Asia Multimedia (MSRA-MM), a dataset that aims to facilitate research in multimedia information retrieval and related areas. The images and videos in the dataset are collected from a commercial search engine with more than 1000 queries. It contains about 1 million images and 20,000 videos. We also provide the surrounding texts that are obtained from more than 1 million Web pages. The images and videos have been comprehensively annotated, including their relevance levels to corresponding queries, semantic concepts of images, and category and quality information of videos. We define six standard tasks on the dataset: (1) image search reranking; (2) image annotation; (3) query-by-example image search; (4) video search reranking; (5) video categorization; and (6) video quality assessment.
Keywords
Internet; information retrieval; multimedia computing; video signal processing; MSRA-MM 2.0; Microsoft Research Asia Multimedia; Web pages; commercial search engine; image annotation; image search reranking; large-scale Web multimedia dataset; multimedia information retrieval; query-by-example image search; video categorization; video quality assessment; video search reranking; Asia; Conferences; Data mining; Information retrieval; Large-scale systems; Multimedia computing; Quality assessment; Search engines; Videos; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location
Miami, FL
Print_ISBN
978-1-4244-5384-9
Electronic_ISBN
978-0-7695-3902-7
Type
conf
DOI
10.1109/ICDMW.2009.46
Filename
5360509
Link To Document