مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised feature selection using feature similarity

DocumentCode :

1260906

Title :

Unsupervised feature selection using feature similarity

Author :

Mitra, Pabitra ; Murthy, C.A. ; Pal, Sankar K.

Author_Institution :

Machine Intelligence Unit, Indian Stat. Inst., Calcutta, India

Volume :

Issue :

fYear :

2002

fDate :

3/1/2002 12:00:00 AM

Firstpage :

301

Lastpage :

312

Abstract :

In this article, we describe an unsupervised feature selection algorithm suitable for data sets, large in both dimension and size. The method is based on measuring similarity between features whereby redundancy therein is removed. This does not need any search and, therefore, is fast. A new feature similarity measure, called maximum information compression index, is introduced. The algorithm is generic in nature and has the capability of multiscale representation of data sets. The superiority of the algorithm, in terms of speed and performance, is established extensively over various real-life data sets of different sizes and dimensions. It is also demonstrated how redundancy and information loss in feature selection can be quantified with an entropy measure

Keywords :

computational complexity; data mining; feature extraction; entropy measure; feature similarity measure; maximum information compression index; multiscale data set representation; redundancy removal; unsupervised feature selection algorithm; Entropy; Loss measurement;

fLanguage :

English

Journal_Title :

Pattern Analysis and Machine Intelligence, IEEE Transactions on

Publisher :

ieee

ISSN :

0162-8828

Type :

jour

DOI :

10.1109/34.990133

Filename :

990133

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1260906