Title :
Improving recall of k-nearest neighbor algorithm for classes of uneven size
Author :
Boiculese, Vasile Lucian ; Dimitriu, Gabriel ; Moscalu, Mihaela
Author_Institution :
Dept. of Med. Inf. & Biostat., Grigore T. Popa Univ. of Med. & Pharmacy, Iasi, Romania
Abstract :
The k-nearest neighbor algorithm is one of the most suitable method of classification for its simplicity, adaptability and performance. The real problem arises when classes do overlap and when samples size is unevenly distributed between categories. Many studies present optimization techniques on discriminant metrics, on weighting the features, on using probabilistic measures or adjusting the prototypes position. Classes that are represented by a small sample size are overwhelmed by the large number of prototypes of dominated groups. In this paper we describe a method of weighting the prototypes for each class of the k nearest neighbors to cope with the uneven distribution of data. The proposed method increases the classification rate in terms of recall measure.
Keywords :
feature extraction; medical computing; optimisation; pattern classification; probability; classification rate; discriminant metrics; feature weighting; k-nearest neighbor algorithm recall; optimization techniques; probabilistic measures; prototypes position; sample size; uneven data distribution; uneven size classes; Accuracy; Hip; classification; k-nearest neighbor; uneven size classes;
Conference_Titel :
E-Health and Bioengineering Conference (EHB), 2013
Conference_Location :
Iasi
Print_ISBN :
978-1-4799-2372-4
DOI :
10.1109/EHB.2013.6707403