Title :
The Novel k Nearest Neighbor Algorithm
Author :
Jivani, Anjali G.
Author_Institution :
Dept. of Comput. Sci. & Eng., Maharaja Sayajirao Univ. of Baroda, Vadodara, India
Abstract :
In the field of Text Classification/Categorization, the k Nearest Neighbor algorithm (kNN) has been to date one of the oldest and most popular methods. It has been experimented upon, implemented and tested by many researchers all over the world. There have been variations in the implementation of this algorithm and I have in this paper done the same. As the name suggests the method is dependent on the parameter `k´ which can drastically change the output as we vary its values. When the training set contains classes of unequal sizes, the test data is likely to get classified to a class which has more samples than the actual class it belongs to, if that actual class has less number of samples. In the proposed method, I have added a small variation to the classic kNN and have named this method `The Novel k Nearest Neighbor Algorithm´. The parameter k in this method depends on the size of the smallest class sample.
Keywords :
data mining; pattern classification; text analysis; class sample; kNN algorithm; novel k nearest neighbor algorithm; text categorization; text classification; text mining; training set; unequal sizes; Algorithm design and analysis; Classification algorithms; Computers; Testing; Text categorization; Training; Vectors; k nearest neighbors; text categorization; text classification; text mining;
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2013 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4673-2906-4
DOI :
10.1109/ICCCI.2013.6466287