DocumentCode
1820046
Title
KeyGraph and WordNet hypernyms for topic detection
Author
Perera, Kasun ; Karunarathne, Damitha
Author_Institution
Sch. of Comput., Univ. of Colombo, Colombo, Sri Lanka
fYear
2015
fDate
22-24 July 2015
Firstpage
303
Lastpage
308
Abstract
The Vast number of publicly available unstructured information on web and their rapid growth pose a great challenge in understanding, managing and structuring the information. Topic modeling algorithms have been developed with the purpose of analyzing these unstructured data and obtain abstract topics and clusters from these data collections. KeyGraph is a word co-occurrence based algorithm for topic modeling. We provide an extension for KeyGraph algorithm by incorporating WordNet hypernyms for Keywords in the data collection. Our results show that incorporating hypernyms for KeyGraph algorithm would result improved topic and document clusters.
Keywords
Internet; document handling; pattern clustering; KeyGraph hypernyms; Web; WordNet hypernyms; abstract topics; cooccurrence based algorithm; data collections; document clusters; keywords; topic detection; topic modeling algorithms; unstructured data; Algorithm design and analysis; Clustering algorithms; Gold; Ontologies; Security; Sensitivity analysis; Standards; Document clustering; Information Extraction; KeyGraph; Topic Model; WordNet;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering (JCSSE), 2015 12th International Joint Conference on
Conference_Location
Songkhla
Type
conf
DOI
10.1109/JCSSE.2015.7219814
Filename
7219814
Link To Document