Title :
Finding Participants in a Chat: Authorship Attribution for Conversational Documents
Author :
Inches, Giacomo ; Harvey, Matthew ; Crestani, Fabio
Author_Institution :
Fac. of Inf., Univ. of Lugano (USI), Lugano, Switzerland
Abstract :
In this work we study the problem of Authorship Attribution for a novel set of documents, namely online chats. Although the problem of Authorship Attribution has been extensively investigated for different document types, from books to letters and from emails to blog posts, to the best of our knowledge this is the first study of Authorship Attribution for conversational documents (IRC chat logs) using statistical models. We experimentally demonstrate the unsuitability of the classical statistical models for conversational documents and propose a novel approach which is able to achieve a high accuracy rate (up to 95%) for hundreds of authors.
Keywords :
document handling; interactive programming; social networking (online); statistical analysis; IRC chat logs; authorship attribution; conversational documents; online chats; statistical models; Equations; Maximum likelihood estimation; Media; Smoothing methods; Standards; Training; Vocabulary; authorship attribution; conversational documents; irc chats; text mining;
Conference_Titel :
Social Computing (SocialCom), 2013 International Conference on
Conference_Location :
Alexandria, VA
DOI :
10.1109/SocialCom.2013.45