• DocumentCode
    175889
  • Title

    Topic mining for call centers based on LDA

  • Author

    Wenming Guo ; Tianlang Deng

  • Author_Institution
    Sch. of Software Eng., Beijing Univ. of Post & Telecommun., Beijing, China
  • fYear
    2014
  • fDate
    19-21 Aug. 2014
  • Firstpage
    839
  • Lastpage
    844
  • Abstract
    Latent Dirichlet Allocation, which is a non-supervised learning method, can be used for topic detection, automatic text categorization, keyword extraction and so on. It only focuses on the text itself, not considering other external correlation properties. External association property refers to some structured attributes that correspondence with the text data, for example, a paper usually has several properties like authors, publishing time etc. A telephone call usually has several properties like caller number, call time etc. To iron out flaws; we propose an improved model A-LDA based LDA. We use data sets from telephone call centers (a kind of data centers in rapid growth) to experiment on topic detection. The topic results show that A-LDA with introduce of external correlation properties, compared with the traditional LDA, is decreased in perplexity value and has better generalization performance. At the same time, we can obtain the topic that external attributes contained.
  • Keywords
    call centres; data mining; learning (artificial intelligence); telecommunication computing; LDA; automatic text categorization; correlation property; data centers; external association property; generalization performance; keyword extraction; latent Dirichlet allocation; nonsupervised learning method; perplexity value; telephone call centers; topic detection; topic mining; Algorithm design and analysis; Approximation algorithms; Bayes methods; Clustering algorithms; Data mining; Data models; Inference algorithms; A-LDA; LDA; call-centers; topic mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2014 10th International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4799-5150-5
  • Type

    conf

  • DOI
    10.1109/ICNC.2014.6975947
  • Filename
    6975947