Author :
Xian-Sheng Hua ; Jin Li
Author_Institution :
Microsoft Res., Redmond, WA, USA
Abstract :
“Tell Me What” is smart phone based image recognition system, and it is also an automatic pipeline for generating image recognition systems to recognize an arbitrary set of entities. For any given set of entities, “Tell Me What” backend system automatically fetches related image data from the Internet for each entity, and then run a comprehensive data cleaning process to purify the data. A multi-class classifier and inverted index are then built based on the cleaned data. For an unknown new image captured by a camera, the user is allowed to optionally highlight regions and then a classification process and a search process are applied to get recognition results. Distributed computing techniques are applied to ensure that the backend model and index generation processes can be done in a few hours.
Keywords :
Internet; cameras; image classification; smart phones; Internet; Tell Me What backend system; backend model; camera; classification process; data cleaning process; data purification; distributed computing techniques; index generation process; inverted index; multiclass classifier; search process; smart phone based image recognition system; Buildings; Feature extraction; Image recognition; Indexes; Internet; Pipelines; Training data; Image recognition; image understanding;
Conference_Titel :
Multimedia and Expo Workshops (ICMEW), 2014 IEEE International Conference on
Conference_Location :
Chengdu
DOI :
10.1109/ICMEW.2014.6890616