DocumentCode :
394413
Title :
Extracting latent structures in numerical classification: an investigation using two factor models
Author :
Choudhury, Arindum ; Ong, YewSoon ; Keane, A.J.
Author_Institution :
Computational Eng. & Design Center, Southampton Univ., UK
Volume :
4
fYear :
2002
fDate :
18-22 Nov. 2002
Firstpage :
1842
Abstract :
We investigate the use of SVD based two factor models for numerical data classification. Motivations for such a study include the widespread success of such models (e.g, LSI) in textual information retrieval, emerging connections with well established statistical techniques and the increasing occurrence of mixed mode (text-and-numeric) data. A direct extension as well as an efficient modification of the LSI model applied to numerical data problems are presented and the associated problems and likely remedies discussed. The techniques under investigation are shown to perform competitively with respect to popular existing numerical classification techniques on a range of synthetic and real world benchmark data. In particular, we show that the modified LSI proposed in this work avoids confronting the optimal subspace selection problem yet generalizes well and remains computationally efficient for large data.
Keywords :
classification; indexing; singular value decomposition; LSI model; SVD; data modeling; information retrieval; numerical data classification; semantic indexing; similarity metric; Data engineering; Data mining; Design engineering; Filtering; Indexing; Information retrieval; Large scale integration; Multidimensional systems; Numerical models; Predictive models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN :
981-04-7524-1
Type :
conf
DOI :
10.1109/ICONIP.2002.1198992
Filename :
1198992
Link To Document :
بازگشت