DocumentCode
394413
Title
Extracting latent structures in numerical classification: an investigation using two factor models
Author
Choudhury, Arindum ; Ong, YewSoon ; Keane, A.J.
Author_Institution
Computational Eng. & Design Center, Southampton Univ., UK
Volume
4
fYear
2002
fDate
18-22 Nov. 2002
Firstpage
1842
Abstract
We investigate the use of SVD based two factor models for numerical data classification. Motivations for such a study include the widespread success of such models (e.g, LSI) in textual information retrieval, emerging connections with well established statistical techniques and the increasing occurrence of mixed mode (text-and-numeric) data. A direct extension as well as an efficient modification of the LSI model applied to numerical data problems are presented and the associated problems and likely remedies discussed. The techniques under investigation are shown to perform competitively with respect to popular existing numerical classification techniques on a range of synthetic and real world benchmark data. In particular, we show that the modified LSI proposed in this work avoids confronting the optimal subspace selection problem yet generalizes well and remains computationally efficient for large data.
Keywords
classification; indexing; singular value decomposition; LSI model; SVD; data modeling; information retrieval; numerical data classification; semantic indexing; similarity metric; Data engineering; Data mining; Design engineering; Filtering; Indexing; Information retrieval; Large scale integration; Multidimensional systems; Numerical models; Predictive models;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN
981-04-7524-1
Type
conf
DOI
10.1109/ICONIP.2002.1198992
Filename
1198992
Link To Document