DocumentCode
3300654
Title
Dimensionality reduction for text using LLE
Author
HE, Chuan ; DONG, Zhe ; Li, Ruifan ; Zhong, Yixin
Author_Institution
Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., Beijing
fYear
2008
fDate
19-22 Oct. 2008
Firstpage
1
Lastpage
7
Abstract
Dimensionality reduction is a necessary preprocessing step in many fields of information processing such as information retrieval, pattern recognition and data compression. Its goal is to discover the representative or the discriminative information residing in raw data. Locally linear embedding (LLE), one of effective manifold learning algorithms, addresses this problem by computing low-dimensional, neighborhood preserving embeddings of high-dimensional data. The embedding is derived from the symmetries for locally linear reconstructions. And the computation of this embedding is related to an eigen-problem in the implement. Since LLE was proposed, it has been being applied to deal with image data only because it originated from facial recognition. However, the problem of curse of dimensionality is very prevalent. Therefore, we here try to apply this algorithm for text processing. In this paper, we introduce the LLE briefly and analyze its advantage and latent disadvantages, and the relationship between LSI and LLE in the graph embedding framework is then discussed from a theoretic view. Finally, the experimental results are show with the datasets of Reuters21578 and TDT2.
Keywords
text analysis; data compression; dimensionality reduction; discriminative information; eigen-problem; facial recognition; graph embedding framework; image data; information processing; information retrieval; locally linear embedding; locally linear reconstruction; manifold learning algorithm; pattern recognition; raw data; text processing; Covariance matrix; Data compression; Embedded computing; Feature extraction; Image reconstruction; Information retrieval; Large scale integration; Pattern recognition; Principal component analysis; Text processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2008. NLP-KE '08. International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4515-8
Electronic_ISBN
978-1-4244-2780-2
Type
conf
DOI
10.1109/NLPKE.2008.4906771
Filename
4906771
Link To Document