• DocumentCode
    3142009
  • Title

    Tag-topic model for semantic knowledge acquisition from blogs

  • Author

    Li, Fang ; Shen, Huiyu ; He, Tingting

  • Author_Institution
    Eng. & Res. Center for Inf. Technol. on Educ., Huazhong Normal Univ., Wuhan, China
  • fYear
    2011
  • fDate
    27-29 Nov. 2011
  • Firstpage
    221
  • Lastpage
    226
  • Abstract
    This paper proposed a tag-topic model for semantic knowledge acquisition from blogs. The model extends the Latent Dirichlet Allocation by adding a tag layer between the document and topic layer, it represents each document with a mixture of tags, each tag is associated with a multinomial distribution over topics and each topic is associated with a multinomial distribution over words. After parameters estimating, the tags are regarded as concepts, the top words arranged to the top topics are selected as related words of the concepts, and PMI-IR is utilized for filtering out noisy words to improve the quality of the semantic knowledge. Experimental results show that the tag-topic model can effectively capture semantic knowledge.
  • Keywords
    Web sites; document handling; knowledge acquisition; parameter estimation; PMI-IR; blogs; document; latent Dirichlet allocation; multinomial distribution; parameter estimation; semantic knowledge acquisition; tag layer; tag-topic model; topic layer; Computational modeling; Semantics; Perplexity; Semantic Knowledge Acquisition; Tag; Topic Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
  • Conference_Location
    Tokushima
  • Print_ISBN
    978-1-61284-729-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2011.6138198
  • Filename
    6138198