Data condensation in large databases by incremental learning with support vector machines

Author

Mitra, Pabitra ; Murthy, C.A. ; Pal, Sankar K.

Author_Institution

Machine Intelligence Unit, Indian Stat. Inst., Calcutta, India

Volume

2

fYear

2000

fDate

2000

Firstpage

708

Abstract

An algorithm for data condensation using support vector machines (SVM) is presented. The algorithm extracts data points lying close to the class boundaries, which form a much reduced but critical set for classification. The problem of large memory requirements for training SVM in batch mode is circumvented by adopting an active incremental learning algorithm. The learning strategy is motivated from the condensed nearest neighbor classification technique. Experimental results presented show that such active incremental learning enjoy superiority in terms of computation time and condensation ratio, over related methods

Keywords

computational complexity; data warehouses; learning (artificial intelligence); learning automata; pattern classification; SVM; active incremental learning algorithm; batch mode training; class boundaries; computation time; condensed nearest neighbor classification technique; data condensation; data point extraction; incremental learning; large databases; large memory requirements; pattern classification; support vector machines; Data mining; Databases; Machine intelligence; Machine learning; Machine learning algorithms; Nearest neighbor searches; Quadratic programming; Sampling methods; Support vector machine classification; Support vector machines;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2000. Proceedings. 15th International Conference on

Conference_Location

Barcelona

ISSN

1051-4651

Print_ISBN

0-7695-0750-6

Type

conf

DOI

10.1109/ICPR.2000.906173

Filename

906173