• DocumentCode
    176271
  • Title

    LDA Analyzer: A Tool for Exploring Topic Models

  • Author

    Chunyao Zou ; Daqing Hou

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Clarkson Univ., Potsdam, NY, USA
  • fYear
    2014
  • fDate
    Sept. 29 2014-Oct. 3 2014
  • Firstpage
    593
  • Lastpage
    596
  • Abstract
    Online technical forums are valuable sources for mining useful software engineering information. LDA (Latent Dirichlet Allocation) is an unsupervised machine learning method which can be used for extracting underlying topics out of such large forums. However, the main output of LDA forum learning are usually huge matrices that contain millions of numbers, which is impossible for researchers to directly scrutinize the numerical distribution and semantically evaluate the relationship between the extracted topics and large collection of unorganized documents. In this paper, we present LDAAnalyzer, an LDA visualization tool that makes the hidden topic-document structures rise to the surface. From the functionality point of view, LDA Analyzer consists of (1) LDA modeling (2) LDA output analysis and (3) new corpus training. With the help of LDAAnalyzer, our semantic topic-modeling evaluation based on large technical forums becomes feasible.
  • Keywords
    data mining; data visualisation; software engineering; unsupervised learning; LDA visualization tool; LDAAnalyzer; hidden topic-document structures; latent dirichlet allocation; numerical distribution; online technical forums; semantic topic-modeling evaluation; software engineering information; unorganized documents; unsupervised machine learning method; Data visualization; Load modeling; Mathematical model; Semantics; Standards; Training; Visualization; LDA; forum; topic modeling; visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance and Evolution (ICSME), 2014 IEEE International Conference on
  • Conference_Location
    Victoria, BC
  • ISSN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSME.2014.103
  • Filename
    6976147