DocumentCode :
2691683
Title :
Investigating hybrids of evolutionary search and linear discriminant analysis for authorship attribution
Author :
Shaker, Kareem ; Corne, David ; Everson, Richard
Author_Institution :
Heriot-Watt Univ., Edinburgh
fYear :
2007
fDate :
25-28 Sept. 2007
Firstpage :
2071
Lastpage :
2077
Abstract :
Authorship attribution is the problem of determining who is (or was) the author of one or more texts, in cases where authorship is disputed. There are many well- known cases of disputed authorship; in this paper we consider the Federalist papers, and the 15th Book of Oz. We treat the problem as a supervised classification problem, and use evolutionary algorithms to search through subsets of function words, which in turn form the basis of predicting authorship via linear discriminant analysis. We compare two approaches (due to the size of the text corpora in dispute, extensive experimentation is difficult), both centred around the optimization of ROC curves. On both datasets, the hybrid EA approach was able to classify the disputed works with 100% accuracy, using small sets of function words comparable to or better than previous works on these cases.
Keywords :
evolutionary computation; text analysis; Federalist papers; authorship attribution; evolutionary algorithms; evolutionary search-linear discriminant analysis; function words; supervised classification problem; Books; Evolutionary computation; Linear discriminant analysis; Principal component analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2007. CEC 2007. IEEE Congress on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-1339-3
Electronic_ISBN :
978-1-4244-1340-9
Type :
conf
DOI :
10.1109/CEC.2007.4424728
Filename :
4424728
Link To Document :
بازگشت