DocumentCode
2491163
Title
Tree Cluster of Text Data by NMF Based Neural Network
Author
Barman, Paresh Chandra ; Lee, Soo-Young
Author_Institution
Dept. of BioSystems, Korea Adv. Inst. of Sci. & Technol., Daejeon
fYear
2006
fDate
19-21 Dec. 2006
Firstpage
312
Lastpage
315
Abstract
This paper proposes the tree clustering of text documents using non-negative matrix factorization (NMF) based neural network. The main problem of the tree clustering is to find the number of branches of a parent node of the tree and to find the significant attributes which can cluster the documents of each node. In advance if we don´t know the exact number of branches then it is not applicable for tree clustering. The paper proposed the min-max correlation coefficient method for finding the number of branches and by reducing the NMF basis vectors dimension according to their probability and removing the common terms from the NMF basis vectors, the authors find the significant attributes to cluster the documents. The approach is also helpful for prepruning the tree. For the justification of the approaches the authors use the CLASSIC3 text database
Keywords
matrix decomposition; minimax techniques; neural nets; probability; trees (mathematics); CLASSIC3 text database; min-max correlation coefficient method; neural network; nonnegative matrix factorization; probability; significant attributes; text data; text documents; tree clustering; Biochemical analysis; Clustering algorithms; Computer networks; Data analysis; Data engineering; Data mining; Electronic mail; Image analysis; Neural networks; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2006. ICECE '06. International Conference on
Conference_Location
Dhaka
Print_ISBN
98432-3814-1
Type
conf
DOI
10.1109/ICECE.2006.355634
Filename
4178470
Link To Document