• DocumentCode
    1820046
  • Title

    KeyGraph and WordNet hypernyms for topic detection

  • Author

    Perera, Kasun ; Karunarathne, Damitha

  • Author_Institution
    Sch. of Comput., Univ. of Colombo, Colombo, Sri Lanka
  • fYear
    2015
  • fDate
    22-24 July 2015
  • Firstpage
    303
  • Lastpage
    308
  • Abstract
    The Vast number of publicly available unstructured information on web and their rapid growth pose a great challenge in understanding, managing and structuring the information. Topic modeling algorithms have been developed with the purpose of analyzing these unstructured data and obtain abstract topics and clusters from these data collections. KeyGraph is a word co-occurrence based algorithm for topic modeling. We provide an extension for KeyGraph algorithm by incorporating WordNet hypernyms for Keywords in the data collection. Our results show that incorporating hypernyms for KeyGraph algorithm would result improved topic and document clusters.
  • Keywords
    Internet; document handling; pattern clustering; KeyGraph hypernyms; Web; WordNet hypernyms; abstract topics; cooccurrence based algorithm; data collections; document clusters; keywords; topic detection; topic modeling algorithms; unstructured data; Algorithm design and analysis; Clustering algorithms; Gold; Ontologies; Security; Sensitivity analysis; Standards; Document clustering; Information Extraction; KeyGraph; Topic Model; WordNet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Software Engineering (JCSSE), 2015 12th International Joint Conference on
  • Conference_Location
    Songkhla
  • Type

    conf

  • DOI
    10.1109/JCSSE.2015.7219814
  • Filename
    7219814