Title :
Filtering spam in Weibo using ensemble imbalanced classification and knowledge expansion
Author :
Zhipeng Jin ; Qiudan Li ; Zeng, Daniel ; Lei Wang
Author_Institution :
State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
Abstract :
Weibo has become an important information sharing platform in our daily life in China. Many applications utilize Weibo data to analyze hot topic and opinion evolution patterns to gain insights into user behavior. However, various spam messages degrade the performance of these applications and thus are essential to be filtered. In this paper, we propose a unified spam detection approach, which utilizes external knowledge sources to expand keywords features and applies an ensemble under-sampling based strategy to handle the class-imbalance problem. The experimental results show the effectiveness and robustness of our approach in Weibo data.
Keywords :
information filtering; learning (artificial intelligence); pattern classification; security of data; social networking (online); unsolicited e-mail; China; Weibo data; class-imbalance problem; ensemble imbalanced classification; external knowledge source; knowledge expansion; spam filtering; unified spam detection approach; Feature extraction; Information filtering; Logistics; Twitter; Unsolicited electronic mail; class-imbalance learning; ensemble learning; external knowledge expansion; spam detection;
Conference_Titel :
Intelligence and Security Informatics (ISI), 2015 IEEE International Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
978-1-4799-9888-3
DOI :
10.1109/ISI.2015.7165952