DocumentCode :
2183528
Title :
Civil Transportation Event Extraction from Chinese Microblog
Author :
Jiaxi Xiong ; Yonggang Hao ; Zheng Huang
Author_Institution :
Dept. of Inf. Security & Eng., Shanghai Jiaotong Univ., Shanghai, China
fYear :
2013
fDate :
16-19 Dec. 2013
Firstpage :
577
Lastpage :
582
Abstract :
People produce hundreds of millions of microblogs everyday. With its 140-character message, Microblog has yielded an enormous corpus of information, which is noisy but informative in some way. However, previous work with standard NLP tools of event extraction performs poorly on Microblog. In this paper, we adopt a series of methods to extract events from Chinese microblogs. In particular, we grab the chatters from Sina Weibo to extract civil transportation information. We eliminate buzz in Weibo, use CRF methods to filter microblogs so as to focus on transportation, and we also use CRF to recognize named entities and to extract events.
Keywords :
information retrieval; social networking (online); CRF methods; Chinese microblog; NLP tools; Sina Weibo; civil transportation event extraction; conditional random fields; named entity recognition; natural language processing; Data mining; Noise; Standardization; Tagging; Testing; Training; Transportation; CRF; Chinese Microblog; Event Extraction; NER; NLP;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on
Conference_Location :
Fuzhou
Print_ISBN :
978-1-4799-2829-3
Type :
conf
DOI :
10.1109/CLOUDCOM-ASIA.2013.48
Filename :
6821052
Link To Document :
بازگشت