مرکز منطقه ای اطلاع رساني علوم و فناوري - Filtering spam in Weibo using ensemble imbalanced classification and knowledge expansion

DocumentCode :

725730

Title :

Filtering spam in Weibo using ensemble imbalanced classification and knowledge expansion

Author :

Zhipeng Jin ; Qiudan Li ; Zeng, Daniel ; Lei Wang

Author_Institution :

State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China

fYear :

2015

fDate :

27-29 May 2015

Firstpage :

132

Lastpage :

134

Abstract :

Weibo has become an important information sharing platform in our daily life in China. Many applications utilize Weibo data to analyze hot topic and opinion evolution patterns to gain insights into user behavior. However, various spam messages degrade the performance of these applications and thus are essential to be filtered. In this paper, we propose a unified spam detection approach, which utilizes external knowledge sources to expand keywords features and applies an ensemble under-sampling based strategy to handle the class-imbalance problem. The experimental results show the effectiveness and robustness of our approach in Weibo data.

Keywords :

information filtering; learning (artificial intelligence); pattern classification; security of data; social networking (online); unsolicited e-mail; China; Weibo data; class-imbalance problem; ensemble imbalanced classification; external knowledge source; knowledge expansion; spam filtering; unified spam detection approach; Feature extraction; Information filtering; Logistics; Twitter; Unsolicited electronic mail; class-imbalance learning; ensemble learning; external knowledge expansion; spam detection;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligence and Security Informatics (ISI), 2015 IEEE International Conference on

Conference_Location :

Baltimore, MD

Print_ISBN :

978-1-4799-9888-3

Type :

conf

DOI :

10.1109/ISI.2015.7165952

Filename :

7165952

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=725730