مرکز منطقه ای اطلاع رساني علوم و فناوري - Maximizing pattern separation in discretizing continuous features for classification purposes

DocumentCode :

2495946

Title :

Maximizing pattern separation in discretizing continuous features for classification purposes

Author :

Ferrari, Enrico ; Muselli, Marco

Author_Institution :

Inst. of Electron., Comput. & Telecommun. Eng., Genoa, Italy

fYear :

2010

fDate :

18-23 July 2010

Firstpage :

Lastpage :

Abstract :

Discretization is a fundamental phase for many classification algorithms: it aims at finding a proper set of cutoffs that subdivide a continuous domain into homogeneous intervals; the points in each interval should have a high probability of belonging to the same class. This paper proposes two different approaches for discretization: the first one consists in retrieving the optimal set of separation points through the solution of a proper linear programming problem. Since the optimal solution may require an excessive computational burden, an alternative technique, based on the iterative addition of separation points, is described. The greedy algorithm is evaluated on some artificial datasets and compared with other well-known discretization techniques such as EntMDL. The results of the simulations show the good performances of the novel algorithm in terms both of accuracy of the solution and of computational effort required for its generation.

Keywords :

greedy algorithms; learning (artificial intelligence); linear programming; pattern classification; EntMDL; classification algorithms; discretizing continuous features; greedy algorithm; linear programming; pattern separation; Propulsion;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Neural Networks (IJCNN), The 2010 International Joint Conference on

Conference_Location :

Barcelona

ISSN :

1098-7576

Print_ISBN :

978-1-4244-6916-1

Type :

conf

DOI :

10.1109/IJCNN.2010.5596838

Filename :

5596838

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2495946