مرکز منطقه ای اطلاع رساني علوم و فناوري - Cross-Domain Feature Learning in Multimedia

DocumentCode :

66312

Title :

Cross-Domain Feature Learning in Multimedia

Author :

Xiaoshan Yang ; Tianzhu Zhang ; Changsheng Xu

Author_Institution :

Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China

Volume :

Issue :

fYear :

2015

fDate :

Jan. 2015

Firstpage :

Lastpage :

Abstract :

In the Web 2.0 era, a huge number of media data, such as text, image/video, and social interaction information, have been generated on the social media sites (e.g., Facebook, Google, Flickr, and YouTube). These media data can be effectively adopted for many applications (e.g., image/video annotation, image/video retrieval, and event classification) in multimedia. However, it is difficult to design an effective feature representation to describe these data because they have multi-modal property (e.g., text, image, video, and audio) and multi-domain property (e.g., Flickr, Google, and YouTube). To deal with these issues, we propose a novel cross-domain feature learning (CDFL) algorithm based on stacked denoising auto-encoders. By introducing the modal correlation constraint and the cross-domain constraint in conventional auto-encoder, our CDFL can maximize the correlations among different modalities and extract domain invariant semantic features simultaneously. To evaluate our CDFL algorithm , we apply it to three important applications: sentiment classification, spam filtering, and event classification. Comprehensive evaluations demonstrate the encouraging performance of the proposed approach.

Keywords :

learning (artificial intelligence); multimedia systems; social networking (online); Facebook; Flickr; Google; Web 2.0; YouTube; cross-domain constraint; cross-domain feature learning; event classification; feature representation; modal correlation constraint; multidomain property; multimedia; multimodal property; sentiment classification; social media sites; spam filtering; stacked denoising auto-encoder; Correlation; Google; Media; Multimedia communication; Noise reduction; Semantics; Streaming media; Cross-domain; deep learning; feature learning; multi-modal;

fLanguage :

English

Journal_Title :

Multimedia, IEEE Transactions on

Publisher :

ieee

ISSN :

1520-9210

Type :

jour

DOI :

10.1109/TMM.2014.2375793

Filename :

6971165

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=66312