DocumentCode :
3142009
Title :
Tag-topic model for semantic knowledge acquisition from blogs
Author :
Li, Fang ; Shen, Huiyu ; He, Tingting
Author_Institution :
Eng. & Res. Center for Inf. Technol. on Educ., Huazhong Normal Univ., Wuhan, China
fYear :
2011
fDate :
27-29 Nov. 2011
Firstpage :
221
Lastpage :
226
Abstract :
This paper proposed a tag-topic model for semantic knowledge acquisition from blogs. The model extends the Latent Dirichlet Allocation by adding a tag layer between the document and topic layer, it represents each document with a mixture of tags, each tag is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words. After parameters estimating, the tags are regarded as concepts, the top words arranged to the top topics are selected as related words of the concepts, and PMI-IR is utilized for filtering out noisy words to improve the quality of the semantic knowledge. Experimental results show that the tag-topic model can effectively capture semantic knowledge.
Keywords :
Web sites; document handling; knowledge acquisition; parameter estimation; PMI-IR; blogs; document; latent Dirichlet allocation; multinomial distribution; parameter estimation; semantic knowledge acquisition; tag layer; tag-topic model; topic layer; Computational modeling; Semantics; Perplexity; Semantic Knowledge Acquisition; Tag; Topic Model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
Type :
conf
DOI :
10.1109/NLPKE.2011.6138198
Filename :
6138198
Link To Document :
بازگشت