DocumentCode :
2184165
Title :
A combined approach of formal concept analysis and text mining for concept based document clustering
Author :
Myat, Nyeint Nyeint ; Hla, Khin Haymar Saw
fYear :
2005
fDate :
19-22 Sept. 2005
Firstpage :
330
Lastpage :
333
Abstract :
Nowadays, the demand of conceptual document clustering is becoming increase to manage various types of vast amount of information published on the World Wide Web. In this paper, we use formal concept analysis (FCA) method for clustering documents according to their formal contexts. Concept hierarchy of documents is built using the formal concepts of the documents in the document corpus. We use tf.idf (term frequency × inverse document frequency) term weighting model to reduce less useful concepts from these formal concepts and the association and correlation mining techniques to analyze the relationship of terms in the document corpus.
Keywords :
Internet; data mining; text analysis; World Wide Web; concept based document clustering; correlation mining; formal concept analysis; inverse document frequency; term frequency; text mining; Clustering algorithms; Frequency; Information retrieval; Itemsets; Organizing; Partitioning algorithms; Publishing; Search engines; Text mining; Web sites; association; conceptual document clustering; correlation; formal concept analysis (FCA); frequent termsets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2415-X
Type :
conf
DOI :
10.1109/WI.2005.1
Filename :
1517867
Link To Document :
بازگشت