DocumentCode :
1791855
Title :
A challenge of authorship identification for ten-thousand-scale microblog users
Author :
Okuno, Syunya ; Asai, Hiroki ; Yamana, Hayato
Author_Institution :
Dept. of Comput. Sci. & Eng., Waseda Univ., Tokyo, Japan
fYear :
2014
fDate :
27-30 Oct. 2014
Firstpage :
52
Lastpage :
54
Abstract :
Internet security issues require authorship identification for all kinds of internet contents; however, authorship identification for microblog users is much harder than other documents because microblog texts are too short. Moreover, when the number of candidates becomes large, i.e., big data, it will take long time to identify. Our proposed method solves these problems. The experimental results show that our method successfully identifies the authorship with 53.2% of precision out of 10,000 microblog users in the almost half execution time of previous method.
Keywords :
Big Data; security of data; social networking (online); Internet security issues; authorship identification; big data; microblog texts; ten-thousand-scale microblog users; Big data; Blogs; Computers; Distance measurement; Internet; Security; Training; Twitter; authorship attribution; authorship detection; authorship identification; microblog;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
Type :
conf
DOI :
10.1109/BigData.2014.7004491
Filename :
7004491
Link To Document :
بازگشت