DocumentCode
2909599
Title
Data condensation in large databases by incremental learning with support vector machines
Author
Mitra, Pabitra ; Murthy, C.A. ; Pal, Sankar K.
Author_Institution
Machine Intelligence Unit, Indian Stat. Inst., Calcutta, India
Volume
2
fYear
2000
fDate
2000
Firstpage
708
Abstract
An algorithm for data condensation using support vector machines (SVM) is presented. The algorithm extracts data points lying close to the class boundaries, which form a much reduced but critical set for classification. The problem of large memory requirements for training SVM in batch mode is circumvented by adopting an active incremental learning algorithm. The learning strategy is motivated from the condensed nearest neighbor classification technique. Experimental results presented show that such active incremental learning enjoy superiority in terms of computation time and condensation ratio, over related methods
Keywords
computational complexity; data warehouses; learning (artificial intelligence); learning automata; pattern classification; SVM; active incremental learning algorithm; batch mode training; class boundaries; computation time; condensed nearest neighbor classification technique; data condensation; data point extraction; incremental learning; large databases; large memory requirements; pattern classification; support vector machines; Data mining; Databases; Machine intelligence; Machine learning; Machine learning algorithms; Nearest neighbor searches; Quadratic programming; Sampling methods; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location
Barcelona
ISSN
1051-4651
Print_ISBN
0-7695-0750-6
Type
conf
DOI
10.1109/ICPR.2000.906173
Filename
906173
Link To Document