DocumentCode :
2387004
Title :
Concept Mining using Association Rules and Combinatorial Topology
Author :
Sutojo, Albert
Author_Institution :
San Jose State Univ., San Jose
fYear :
2007
fDate :
2-4 Nov. 2007
Firstpage :
387
Lastpage :
387
Abstract :
The collection of concepts in a document set can be represented by a geometric structure called simplicial complex of combinatorial topology where each keyword is represented as a vertex and the relation between keywords as simplex. A simplex which consists of more than one keyword is a high-frequency keywordset. These keywords occur close to each other within a document which also occur frequently within a set of documents. The high frequent occurrence of these keywords shows relations between keywords. These relations carry concepts. The relations of these keywords can be captured by association rule mining and represented as simplices. The collection of all these simplices, represents the structure of concepts within a document set. Based on this topology, documents are clustered and the collection of simplices can serve as document index.
Keywords :
data mining; topology; association rules; combinatorial topology; concept mining; document index; geometric structure; high-frequency keywordset; simplicial complex; Association rules; Data mining; Databases; Frequency measurement; Topology; USA Councils;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Granular Computing, 2007. GRC 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3032-1
Type :
conf
DOI :
10.1109/GrC.2007.154
Filename :
4403129
Link To Document :
بازگشت