DocumentCode
3106138
Title
Mining Latent Associations of Objects Using a Typed Mixture Model--A Case Study on Expert/Expertise Mining
Author
Bao, Shenghua ; Cao, Yunbo ; Liu, Bing ; Yu, Yong ; Li, Hang
Author_Institution
Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai
fYear
2006
fDate
18-22 Dec. 2006
Firstpage
803
Lastpage
807
Abstract
This paper studies the problem of discovering latent associations among objects in text documents. Specifically, given two sets of objects and various types of co-occurrence data concerning the objects existing in texts, we aim to discover the hidden or latent associative relationships between the two sets of objects. Existing methods are not directly applicable as they are unable to consider all this information. For example, the probabilistic mixture model called Separable Mixture Model (SMM) proposed by Hofmann can use only one type of co-occurrences to mine latent associations. This paper proposes a more general probabilistic mixture model called the Typed Separable Mixture Model (TSMM), which is able to use all types of co-occurrences within a single framework. Experimental results based on the expert/expertise mining task show that TSMM outperforms SMM significantly.
Keywords
data mining; document handling; probability; co-occurrence data; expert mining; expertise mining; latent associations discovery; latent associations mining; latent associative relationships; probabilistic mixture model; text documents; typed mixture model; typed separable mixture model; Asia; Collaboration; Computer science; Data mining; Information filtering; Information filters; Information retrieval; Natural language processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location
Hong Kong
ISSN
1550-4786
Print_ISBN
0-7695-2701-7
Type
conf
DOI
10.1109/ICDM.2006.109
Filename
4053106
Link To Document