• DocumentCode
    3106138
  • Title

    Mining Latent Associations of Objects Using a Typed Mixture Model--A Case Study on Expert/Expertise Mining

  • Author

    Bao, Shenghua ; Cao, Yunbo ; Liu, Bing ; Yu, Yong ; Li, Hang

  • Author_Institution
    Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    803
  • Lastpage
    807
  • Abstract
    This paper studies the problem of discovering latent associations among objects in text documents. Specifically, given two sets of objects and various types of co-occurrence data concerning the objects existing in texts, we aim to discover the hidden or latent associative relationships between the two sets of objects. Existing methods are not directly applicable as they are unable to consider all this information. For example, the probabilistic mixture model called Separable Mixture Model (SMM) proposed by Hofmann can use only one type of co-occurrences to mine latent associations. This paper proposes a more general probabilistic mixture model called the Typed Separable Mixture Model (TSMM), which is able to use all types of co-occurrences within a single framework. Experimental results based on the expert/expertise mining task show that TSMM outperforms SMM significantly.
  • Keywords
    data mining; document handling; probability; co-occurrence data; expert mining; expertise mining; latent associations discovery; latent associations mining; latent associative relationships; probabilistic mixture model; text documents; typed mixture model; typed separable mixture model; Asia; Collaboration; Computer science; Data mining; Information filtering; Information filters; Information retrieval; Natural language processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.109
  • Filename
    4053106