DocumentCode :
292050
Title :
Authorship attribution of text samples using neural networks and Bayesian classifiers
Author :
Kjell, Bradley
Author_Institution :
Dept. of Comput. Sci., Central Connecticut State Univ., New Britain, CT, USA
Volume :
2
fYear :
1994
fDate :
2-5 Oct 1994
Firstpage :
1660
Abstract :
Previous work has shown that statistics of letter pairs extracted from text samples can be effective in discriminating between two authors writing in a similar style. This paper extends that work by using n-tuples for n from 1 to 5. The features used in classification are the relative frequencies of the tuples, transformed with a KL transform. Both three layer neural network classifiers and Bayesian classifiers are used with these features to classify text samples from two similar authors. The most effective combination was 2-tuples used with a neural network classifier, although other combinations did nearly as well
Keywords :
Bayes methods; document handling; feature extraction; feedforward neural nets; pattern classification; statistical analysis; Bayesian classifiers; KL transform; authorship attribution; classification; feature extraction; multilayer neural network classifiers; text samples; tuples; writing style; Bayesian methods; Computer science; Concatenated codes; Displays; Frequency; Karhunen-Loeve transforms; Neural networks; Statistics; Testing; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics, 1994. Humans, Information and Technology., 1994 IEEE International Conference on
Conference_Location :
San Antonio, TX
Print_ISBN :
0-7803-2129-4
Type :
conf
DOI :
10.1109/ICSMC.1994.400086
Filename :
400086
Link To Document :
بازگشت